← Back to Skills Marketplace
silingyuan0

minimax-speech

by silingyuan0 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
102
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install minimax-speech-generate
Description
MiniMax 语音合成技能 - 支持同步/异步文本转语音(T2S)、音色克隆(Voice Clone)、音色设计(Voice Design)、音色查询与删除。使用模型 speech-2.8-hd,输出 mp3/wav/pcm 格式音频文件到本地。
Usage Guidance
This package appears to implement the advertised TTS and voice-clone features, but the manifest is incomplete and the source/origin is unknown. Before installing: 1) Do not use a production/broadly-permissioned API key — create a restricted/test key or billing limits. 2) Verify the API domains (api.minimaxi.com / minimax.io) and the provider's legitimacy; ask the author for a homepage or repository. 3) Ensure Python and the 'requests' library are available; the manifest should declare this dependency. 4) Understand privacy: cloning uploads your reference audio (potentially sensitive) to a third-party service — get consent from speakers. 5) Prefer running first in an isolated environment (container or VM) to observe network traffic and behavior. 6) Ask the publisher to correct the registry metadata to declare MINIMAX_API_KEY and MINIMAX_REGION and to provide verifiable source code hosting; absence of these is the main reason this skill is flagged as suspicious.
Capability Analysis
Type: OpenClaw Skill Name: minimax-speech-generate Version: 1.0.0 The skill bundle provides a legitimate implementation of the MiniMax Speech API for text-to-speech, voice cloning, and voice design. The core logic in `scripts/speech.py` correctly interacts with official MiniMax endpoints (api.minimaxi.com and api.minimax.io) using standard practices, such as environment variables for API keys and base64 encoding for audio data. No indicators of data exfiltration, malicious execution, or prompt injection were found.
Capability Assessment
Purpose & Capability
The skill's code and SKILL.md implement TTS, voice cloning, voice design, list/get/delete voice operations using a MiniMax HTTP API — that matches the described purpose. However, the registry metadata lists no required environment variables or dependencies, while both SKILL.md and scripts/speech.py require MINIMAX_API_KEY (and optionally MINIMAX_REGION) and the Python 'requests' library. The missing manifest declarations and unknown source/homepage are an incoherence and reduce traceability.
Instruction Scope
Runtime instructions and the script stay within the stated purpose: they call MiniMax API endpoints, write output audio files locally, and read reference audio files for cloning. The SKILL.md does instruct providing MINIMAX_API_KEY and MINIMAX_REGION (which is consistent with the code). There are no instructions to read unrelated user files or system secrets beyond the API key or to send data to unexpected endpoints. Note: cloning uploads base64-encoded audio to the third-party API (privacy/legal implication).
Install Mechanism
This is instruction-only with a bundled Python script (no install spec). That is lower risk, but the script depends on the 'requests' package which is not declared in metadata. There is no automated installer; users will need to ensure Python and requests are present. No arbitrary downloads or extract/install steps are present.
Credentials
The functionality legitimately requires a MINIMAX_API_KEY and optionally MINIMAX_REGION; these are referenced in SKILL.md and enforced by the code. However the registry metadata claims 'Required env vars: none', which is inconsistent and misleading. No other credentials are requested. The script reads local audio files when cloning (expected) and writes output files — this is proportional but users should be aware that audio data is uploaded to the remote service.
Persistence & Privilege
The skill does not request always:true, does not modify other skills or system configurations, and does not persist credentials itself. Autonomous invocation is allowed by default (normal), but there is no elevated persistence or privileged behavior in the package.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install minimax-speech-generate
  3. After installation, invoke the skill by name or use /minimax-speech-generate
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of the MiniMax speech synthesis skill. - Supports synchronous and asynchronous text-to-speech (TTS) using the speech-2.8-hd model. - Includes features for voice cloning, voice design, voice listing, querying, and deletion. - Outputs audio files in mp3, wav, or pcm formats. - Provides a command-line interface for all supported features. - Requires MiniMax API key and region configuration.
Metadata
Slug minimax-speech-generate
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is minimax-speech?

MiniMax 语音合成技能 - 支持同步/异步文本转语音(T2S)、音色克隆(Voice Clone)、音色设计(Voice Design)、音色查询与删除。使用模型 speech-2.8-hd,输出 mp3/wav/pcm 格式音频文件到本地。 It is an AI Agent Skill for Claude Code / OpenClaw, with 102 downloads so far.

How do I install minimax-speech?

Run "/install minimax-speech-generate" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is minimax-speech free?

Yes, minimax-speech is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does minimax-speech support?

minimax-speech is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created minimax-speech?

It is built and maintained by silingyuan0 (@silingyuan0); the current version is v1.0.0.

💬 Comments