← 返回 Skills 市场

mmVoiceMaker

Name: mmVoiceMaker
Author: blue-coconut

作者 Haolan He · GitHub ↗ · v1.0.1

cross-platform ⚠ suspicious

812

总下载

当前安装

版本数

在 OpenClaw 中安装

/install mm-voice-maker

功能描述

Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use when converting text to speech, creat...

安全使用建议

Key things to consider before installing/use: - Metadata mismatch: the registry claims 'no required env vars' but the skill and its check script require MINIMAX_VOICE_API_KEY (and optionally MINIMAX_API_BASE). Do not proceed without confirming where that API key comes from and what permissions it has. - Network calls: the code will call https://api.minimaxi.com (or whatever MINIMAX_API_BASE you set). Verify the API provider is legitimate and that sending audio and transcripts to it matches your privacy policy. - Run checks in a safe workspace: follow the instructions to run python check_environment.py and run the CLI from an isolated folder or container so the tool's temp files (./audio/tmp/) cannot overwrite important data. - Review included code if you can: because the package includes Python scripts, inspect scripts/ for any unexpected endpoints, logging of secrets, or upload routines before providing your API key. - If unsure, run in an isolated environment (VM/container) and/or provide a least-privilege API key (if provider supports scoping) or a test account. Resolve the metadata inconsistency with the skill author (who/what is the MiniMax service and why were required env vars omitted) before trusting it with sensitive inputs.

功能分析

Type: OpenClaw Skill Name: mm-voice-maker Version: 1.0.1 The mm-voice-maker skill bundle is a legitimate and well-documented integration for the MiniMax Voice API and FFmpeg audio processing. It provides a comprehensive CLI (mmvoice.py) and a suite of Python scripts for high-quality text-to-speech, voice cloning, and audio editing. The code uses standard libraries and follows safe practices for executing external commands via subprocess.run, avoiding shell injection risks. The SKILL.md instructions provide clear, safety-conscious guidelines for the AI agent, such as requiring user confirmation before deleting temporary files and emphasizing gender-matching for voice selection. No evidence of malicious intent, data exfiltration, or harmful prompt injection was found.

能力评估

⚠ Purpose & Capability

The name/description (MiniMax TTS + FFmpeg) match the included code and docs: the code implements TTS, cloning, design, and FFmpeg-based audio processing. However, the registry metadata claims no required environment variables or credentials while both SKILL.md and the code clearly require MINIMAX_VOICE_API_KEY (and optionally MINIMAX_API_BASE). That metadata omission is an incoherence: someone implementing this skill legitimately needs the API key and FFmpeg, so the declared metadata is incorrect or incomplete.

ℹ Instruction Scope

Runtime instructions tell the agent to run check_environment.py, create and validate segments.json, save intermediate and final audio files under the agent's current working directory, and to contact the MiniMax API. The instructions therefore require filesystem write access in the agent's cwd and network access to api.minimaxi.com. Nothing in the instructions directs broad or unrelated data collection, but the agent will create potentially many temp files (./audio/tmp/) and is told to persist temp files until the user confirms — be careful to run in a safe directory and review produced files before deleting.

✓ Install Mechanism

There is no external install step or remote download in the skill bundle — the Python source files are included in the package. That reduces supply-chain risk relative to arbitrary remote installs. The code expects typical Python dependencies (requests, websockets) and FFmpeg, but there is no automatic installer; the user/agent must install those separately.

⚠ Credentials

Although the registry lists no required env vars, both SKILL.md and check_environment.py require MINIMAX_VOICE_API_KEY (and support MINIMAX_API_BASE). The skill will read that environment variable and use it to authenticate to the MiniMax API. This mismatch between declared and actual required credentials is a material inconsistency and should be resolved before use. Aside from the API key and FFmpeg, no unrelated credentials are requested.

ℹ Persistence & Privilege

The skill is not marked always:true and does not request elevated platform privileges. It does write temporary and output audio files into the agent's working directory and suggests manual cleanup (rm -rf ./audio/tmp/). That file-writing behavior is expected for audio production but means you should avoid running it from sensitive system directories and confirm file locations before running destructive cleanup commands.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install mm-voice-maker
安装完成后，直接呼叫该 Skill 的名称或使用 /mm-voice-maker 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.1

MiniMax Voice Maker 1.0.1 - Introduces comprehensive voice synthesis, voice cloning, voice design, and audio post-processing capabilities, powered by MiniMax Voice API and FFmpeg. - Supports advanced workflows: emotion detection, multi-voice/multi-emotion synthesis, and segment-based audio generation. - Provides robust CLI tools and scripts for sync/async TTS, audio processing, and voice management. - Includes detailed workflow guides, voice selection principles (gender matching), and troubleshooting documentation. - Emphasizes user confirmation and environment verification for reliable operation.

v1.0.0

Complex text-to-speech production skill powered by MiniMax Voice API and FFmpeg. | Support multi-voice synthesis, can create audiobooks, podcasts, etc. Also provides voice cloning (10s–5min audio), voice design (text prompt), audio post-processing (merge, convert, normalize, trim) capabilities. 基于 MiniMax Voice API 和 FFmpeg 的复杂语音制作技能。 | 支持多角色语音合成，可制作有声书、播客等。还提供了声音克隆（10秒–5分钟音频）、声音设计（文字描述生成）、音频后处理（合并、转换、归一化、裁剪）能力。

元数据

Slug mm-voice-maker

版本 1.0.1

许可证 —

累计安装 0

当前安装数 0

历史版本数 2

常见问题

mmVoiceMaker 是什么？

Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use when converting text to speech, creat... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 812 次。

如何安装 mmVoiceMaker？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install mm-voice-maker」即可一键安装，无需额外配置。

mmVoiceMaker 是免费的吗？

是的，mmVoiceMaker 完全免费（开源免费），可自由下载、安装和使用。

mmVoiceMaker 支持哪些平台？

mmVoiceMaker 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 mmVoiceMaker？

由 Haolan He（@blue-coconut）开发并维护，当前版本 v1.0.1。