← Back to Skills Marketplace

mmVoiceMaker

Name: mmVoiceMaker
Author: blue-coconut

by Haolan He · GitHub ↗ · v1.0.1

cross-platform ⚠ suspicious

812

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install mm-voice-maker

Description

Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use when converting text to speech, creat...

Usage Guidance

Key things to consider before installing/use: - Metadata mismatch: the registry claims 'no required env vars' but the skill and its check script require MINIMAX_VOICE_API_KEY (and optionally MINIMAX_API_BASE). Do not proceed without confirming where that API key comes from and what permissions it has. - Network calls: the code will call https://api.minimaxi.com (or whatever MINIMAX_API_BASE you set). Verify the API provider is legitimate and that sending audio and transcripts to it matches your privacy policy. - Run checks in a safe workspace: follow the instructions to run python check_environment.py and run the CLI from an isolated folder or container so the tool's temp files (./audio/tmp/) cannot overwrite important data. - Review included code if you can: because the package includes Python scripts, inspect scripts/ for any unexpected endpoints, logging of secrets, or upload routines before providing your API key. - If unsure, run in an isolated environment (VM/container) and/or provide a least-privilege API key (if provider supports scoping) or a test account. Resolve the metadata inconsistency with the skill author (who/what is the MiniMax service and why were required env vars omitted) before trusting it with sensitive inputs.

Capability Analysis

Type: OpenClaw Skill Name: mm-voice-maker Version: 1.0.1 The mm-voice-maker skill bundle is a legitimate and well-documented integration for the MiniMax Voice API and FFmpeg audio processing. It provides a comprehensive CLI (mmvoice.py) and a suite of Python scripts for high-quality text-to-speech, voice cloning, and audio editing. The code uses standard libraries and follows safe practices for executing external commands via subprocess.run, avoiding shell injection risks. The SKILL.md instructions provide clear, safety-conscious guidelines for the AI agent, such as requiring user confirmation before deleting temporary files and emphasizing gender-matching for voice selection. No evidence of malicious intent, data exfiltration, or harmful prompt injection was found.

Capability Assessment

⚠ Purpose & Capability

The name/description (MiniMax TTS + FFmpeg) match the included code and docs: the code implements TTS, cloning, design, and FFmpeg-based audio processing. However, the registry metadata claims no required environment variables or credentials while both SKILL.md and the code clearly require MINIMAX_VOICE_API_KEY (and optionally MINIMAX_API_BASE). That metadata omission is an incoherence: someone implementing this skill legitimately needs the API key and FFmpeg, so the declared metadata is incorrect or incomplete.

ℹ Instruction Scope

Runtime instructions tell the agent to run check_environment.py, create and validate segments.json, save intermediate and final audio files under the agent's current working directory, and to contact the MiniMax API. The instructions therefore require filesystem write access in the agent's cwd and network access to api.minimaxi.com. Nothing in the instructions directs broad or unrelated data collection, but the agent will create potentially many temp files (./audio/tmp/) and is told to persist temp files until the user confirms — be careful to run in a safe directory and review produced files before deleting.

✓ Install Mechanism

There is no external install step or remote download in the skill bundle — the Python source files are included in the package. That reduces supply-chain risk relative to arbitrary remote installs. The code expects typical Python dependencies (requests, websockets) and FFmpeg, but there is no automatic installer; the user/agent must install those separately.

⚠ Credentials

Although the registry lists no required env vars, both SKILL.md and check_environment.py require MINIMAX_VOICE_API_KEY (and support MINIMAX_API_BASE). The skill will read that environment variable and use it to authenticate to the MiniMax API. This mismatch between declared and actual required credentials is a material inconsistency and should be resolved before use. Aside from the API key and FFmpeg, no unrelated credentials are requested.

ℹ Persistence & Privilege

The skill is not marked always:true and does not request elevated platform privileges. It does write temporary and output audio files into the agent's working directory and suggests manual cleanup (rm -rf ./audio/tmp/). That file-writing behavior is expected for audio production but means you should avoid running it from sensitive system directories and confirm file locations before running destructive cleanup commands.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install mm-voice-maker
After installation, invoke the skill by name or use /mm-voice-maker
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.1

MiniMax Voice Maker 1.0.1 - Introduces comprehensive voice synthesis, voice cloning, voice design, and audio post-processing capabilities, powered by MiniMax Voice API and FFmpeg. - Supports advanced workflows: emotion detection, multi-voice/multi-emotion synthesis, and segment-based audio generation. - Provides robust CLI tools and scripts for sync/async TTS, audio processing, and voice management. - Includes detailed workflow guides, voice selection principles (gender matching), and troubleshooting documentation. - Emphasizes user confirmation and environment verification for reliable operation.

v1.0.0

Complex text-to-speech production skill powered by MiniMax Voice API and FFmpeg. | Support multi-voice synthesis, can create audiobooks, podcasts, etc. Also provides voice cloning (10s–5min audio), voice design (text prompt), audio post-processing (merge, convert, normalize, trim) capabilities. 基于 MiniMax Voice API 和 FFmpeg 的复杂语音制作技能。 | 支持多角色语音合成，可制作有声书、播客等。还提供了声音克隆（10秒–5分钟音频）、声音设计（文字描述生成）、音频后处理（合并、转换、归一化、裁剪）能力。

Metadata

Slug mm-voice-maker

Version 1.0.1

License —

All-time Installs 0

Active Installs 0

Total Versions 2

Frequently Asked Questions

What is mmVoiceMaker?

Enables voice synthesis, voice cloning, voice design, and audio post-processing using MiniMax Voice API and FFmpeg. Use when converting text to speech, creat... It is an AI Agent Skill for Claude Code / OpenClaw, with 812 downloads so far.

How do I install mmVoiceMaker?

Run "/install mm-voice-maker" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is mmVoiceMaker free?

Yes, mmVoiceMaker is completely free (open-source). You can download, install and use it at no cost.

Which platforms does mmVoiceMaker support?

mmVoiceMaker is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created mmVoiceMaker?

It is built and maintained by Haolan He (@blue-coconut); the current version is v1.0.1.

More Skills