← 返回 Skills 市场
minimax-speech
作者
silingyuan0
· GitHub ↗
· v1.0.0
· MIT-0
102
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install minimax-speech-generate
功能描述
MiniMax 语音合成技能 - 支持同步/异步文本转语音(T2S)、音色克隆(Voice Clone)、音色设计(Voice Design)、音色查询与删除。使用模型 speech-2.8-hd,输出 mp3/wav/pcm 格式音频文件到本地。
安全使用建议
This package appears to implement the advertised TTS and voice-clone features, but the manifest is incomplete and the source/origin is unknown. Before installing: 1) Do not use a production/broadly-permissioned API key — create a restricted/test key or billing limits. 2) Verify the API domains (api.minimaxi.com / minimax.io) and the provider's legitimacy; ask the author for a homepage or repository. 3) Ensure Python and the 'requests' library are available; the manifest should declare this dependency. 4) Understand privacy: cloning uploads your reference audio (potentially sensitive) to a third-party service — get consent from speakers. 5) Prefer running first in an isolated environment (container or VM) to observe network traffic and behavior. 6) Ask the publisher to correct the registry metadata to declare MINIMAX_API_KEY and MINIMAX_REGION and to provide verifiable source code hosting; absence of these is the main reason this skill is flagged as suspicious.
功能分析
Type: OpenClaw Skill
Name: minimax-speech-generate
Version: 1.0.0
The skill bundle provides a legitimate implementation of the MiniMax Speech API for text-to-speech, voice cloning, and voice design. The core logic in `scripts/speech.py` correctly interacts with official MiniMax endpoints (api.minimaxi.com and api.minimax.io) using standard practices, such as environment variables for API keys and base64 encoding for audio data. No indicators of data exfiltration, malicious execution, or prompt injection were found.
能力评估
Purpose & Capability
The skill's code and SKILL.md implement TTS, voice cloning, voice design, list/get/delete voice operations using a MiniMax HTTP API — that matches the described purpose. However, the registry metadata lists no required environment variables or dependencies, while both SKILL.md and scripts/speech.py require MINIMAX_API_KEY (and optionally MINIMAX_REGION) and the Python 'requests' library. The missing manifest declarations and unknown source/homepage are an incoherence and reduce traceability.
Instruction Scope
Runtime instructions and the script stay within the stated purpose: they call MiniMax API endpoints, write output audio files locally, and read reference audio files for cloning. The SKILL.md does instruct providing MINIMAX_API_KEY and MINIMAX_REGION (which is consistent with the code). There are no instructions to read unrelated user files or system secrets beyond the API key or to send data to unexpected endpoints. Note: cloning uploads base64-encoded audio to the third-party API (privacy/legal implication).
Install Mechanism
This is instruction-only with a bundled Python script (no install spec). That is lower risk, but the script depends on the 'requests' package which is not declared in metadata. There is no automated installer; users will need to ensure Python and requests are present. No arbitrary downloads or extract/install steps are present.
Credentials
The functionality legitimately requires a MINIMAX_API_KEY and optionally MINIMAX_REGION; these are referenced in SKILL.md and enforced by the code. However the registry metadata claims 'Required env vars: none', which is inconsistent and misleading. No other credentials are requested. The script reads local audio files when cloning (expected) and writes output files — this is proportional but users should be aware that audio data is uploaded to the remote service.
Persistence & Privilege
The skill does not request always:true, does not modify other skills or system configurations, and does not persist credentials itself. Autonomous invocation is allowed by default (normal), but there is no elevated persistence or privileged behavior in the package.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install minimax-speech-generate - 安装完成后,直接呼叫该 Skill 的名称或使用
/minimax-speech-generate触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
- Initial release of the MiniMax speech synthesis skill.
- Supports synchronous and asynchronous text-to-speech (TTS) using the speech-2.8-hd model.
- Includes features for voice cloning, voice design, voice listing, querying, and deletion.
- Outputs audio files in mp3, wav, or pcm formats.
- Provides a command-line interface for all supported features.
- Requires MiniMax API key and region configuration.
元数据
常见问题
minimax-speech 是什么?
MiniMax 语音合成技能 - 支持同步/异步文本转语音(T2S)、音色克隆(Voice Clone)、音色设计(Voice Design)、音色查询与删除。使用模型 speech-2.8-hd,输出 mp3/wav/pcm 格式音频文件到本地。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 102 次。
如何安装 minimax-speech?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install minimax-speech-generate」即可一键安装,无需额外配置。
minimax-speech 是免费的吗?
是的,minimax-speech 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
minimax-speech 支持哪些平台?
minimax-speech 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 minimax-speech?
由 silingyuan0(@silingyuan0)开发并维护,当前版本 v1.0.0。
推荐 Skills