← 返回 Skills 市场
157
总下载
1
收藏
1
当前安装
4
版本数
在 OpenClaw 中安装
/install minimax-tokenplan-tts
功能描述
Generate speech audio from text using MiniMax speech-2.8-hd model. Supports multiple voice options, speed/pitch/volume control, WAV file output with automati...
安全使用建议
This skill appears to implement the claimed MiniMax TTS functionality, but take these precautions before installing/using it:
- Do NOT paste your API key into the top of the scripts. Instead pass it on the command line or modify the code to read MINIMAX_API_KEY from the environment (safer than hard-coding).
- The streaming script disables SSL verification (accepts any certificate). This makes the API key vulnerable to interception on hostile or misconfigured networks. If you plan to use streaming, edit stream_play.py to enable certificate verification (remove the lines that set check_hostname=False and verify_mode=CERT_NONE) or ensure you use a trusted network.
- The SKILL.md metadata references a download URL (clawhub.ai) rather than the MiniMax domain; prefer obtaining code from trusted sources or verify the included files' contents. Since the package already includes scripts, prefer using the bundled code rather than invoking any external installer from unknown hosts.
- Avoid leaving secrets in code or in version-controlled files. If you must store credentials locally, use environment variables or a secrets manager and confirm the script reads them securely.
- Review and test on an isolated environment first (or with a non-production API key) before allowing the agent to call this skill autonomously.
If the maintainer can: (1) remove the hard-coded API_KEY placeholder and read MINIMAX_API_KEY from env by default, and (2) enable proper SSL verification in the websocket client, the remaining concerns would be largely addressed and my confidence would increase.
能力评估
Purpose & Capability
Name/description (MiniMax TTS) align with the included scripts and declared binaries (python3, ffplay) and the single required credential (MINIMAX_API_KEY). Minor inconsistency: the SKILL.md/registry declares MINIMAX_API_KEY as required, but the shipped scripts default to a top-of-file API_KEY constant and only accept an override via --api-key rather than automatically reading the MINIMAX_API_KEY env var — the README even tells the user to edit the scripts to paste the key. This is bad practice but consistent with a simple wrapper script.
Instruction Scope
SKILL.md instructs the agent/user to obtain the API key and to edit the two scripts directly (paste the key and base URL) and then delete the init section. Encouraging manual insertion of secrets into code files is insecure. The instructions also reference IDENTIY.md (not present in the bundle) and recommend always using streaming playback in webchat — giving the agent a persistent preference but not itself malicious. The scripts perform network calls only to the documented MiniMax endpoints; there is no evidence they read unrelated system files or environment variables. However, stream_play.py explicitly disables SSL certificate verification (ssl.verify_mode = CERT_NONE and check_hostname = False), which weakens transport security and could expose the API key to a man-in-the-middle.
Install Mechanism
Registry reports no install spec but SKILL.md metadata contains an install entry pointing to https://clawhub.ai/skills/minimax-tokenplan-tts (kind: download). There is no evidence the skill automatically downloads arbitrary code at runtime in the included files, but the presence of a download install URL in metadata (not a well-known release host) is a minor red flag. The actual deliverable is an instruction-only skill with bundled scripts (no opaque remote install required).
Credentials
Only one credential (MINIMAX_API_KEY) is declared, which is appropriate for a TTS integration. But the scripts do not automatically use the MINIMAX_API_KEY env var — they rely on a hard-coded placeholder and instruct users to embed the key in the scripts or pass --api-key. That increases risk of accidental secret leakage. Additionally, the stream websocket code disables TLS verification, which combined with a bearer token increases the risk of credential exposure via MITM. No other unrelated credentials or config paths are requested.
Persistence & Privilege
The skill does not request always:true and does not try to modify other skills or system-wide settings. It writes TTS output to ~/.openclaw/media/minimax/tts/ (declared filesystem write permission), which is proportional to its purpose. It can be invoked autonomously by the agent (default), which is expected behavior for a skill.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install minimax-tokenplan-tts - 安装完成后,直接呼叫该 Skill 的名称或使用
/minimax-tokenplan-tts触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
- Updated skill version to 1.0.1.
- Refined the initialization instructions for API Key and BASE_URL: users now need to manually fill both generate.py and stream_play.py, rather than loading from multiple config locations.
- Clarified the requirement to confirm network region and voice selection during setup.
- No functional code or API changes; only documentation updated for clearer setup steps.
v0.9.2
**Major update: Adds real-time streaming TTS playback feature.**
- Added stream_play.py for real-time, low-latency streaming TTS playback via WebSocket and ffplay.
- Updated documentation to recommend streaming playback (via stream_play.py) as default for webchat/immediate response scenarios.
- Streaming playback starts audio as soon as the first data packet arrives; no file generation needed unless explicitly requested.
- ffplay and websockets are now listed as requirements for the streaming feature.
- Updated capabilities and metadata to reflect streaming playback and expanded usage notes.
v0.9.1
minimax-tokenplan-tts 0.9.1
- Added CHANGELOG.md file.
- Updated homepage metadata URL to point to the official MiniMax API reference.
- Bumped version to 0.9.1.
- No functional/code changes included.
v0.9.0
- Initial public release of minimax-tokenplan-tts (v0.9.0)
- Generates speech audio from text using the MiniMax speech-2.8-hd model
- Supports multiple voice options covering 40+ languages and 300+ voices
- Provides speed, pitch, and volume control for generated speech
- Outputs result in WAV format with automatic HEX decoding
- Designed as the preferred tool for all TTS (text-to-speech) requests
元数据
常见问题
minimax-tokenplan-tts 是什么?
Generate speech audio from text using MiniMax speech-2.8-hd model. Supports multiple voice options, speed/pitch/volume control, WAV file output with automati... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 157 次。
如何安装 minimax-tokenplan-tts?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install minimax-tokenplan-tts」即可一键安装,无需额外配置。
minimax-tokenplan-tts 是免费的吗?
是的,minimax-tokenplan-tts 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
minimax-tokenplan-tts 支持哪些平台?
minimax-tokenplan-tts 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(darwin, linux, win32)。
谁开发了 minimax-tokenplan-tts?
由 k.x.(@4833675)开发并维护,当前版本 v1.0.1。
推荐 Skills