← Back to Skills Marketplace
Minimax-Multimodal-Toolkit
by
MiniMax-AI
· GitHub ↗
· v1.0.2
· MIT-0
5028
Downloads
19
Stars
27
Active Installs
3
Versions
Install in OpenClaw
/install minimax-multimodal
Description
Use mmx to generate text, images, video, speech, and music via the MiniMax AI platform. Use when the user wants to create media content, chat with MiniMax mo...
Usage Guidance
Before installing, verify the mmx-cli npm package and consider pinning a trusted version. Prefer environment variables or a secure secret manager over putting API keys directly on command lines, protect ~/.mmx/credentials.json, rotate the key if exposed, avoid --yes for costly or account-changing actions unless explicitly approved, and only send files or prompts you are comfortable sharing with MiniMax.
Capability Assessment
Purpose & Capability
The documented capabilities match the stated purpose: text, image, video, speech, music, vision, search, quota, and configuration through the MiniMax CLI.
Instruction Scope
The skill recommends agent-friendly flags including --yes and --quiet; this is disclosed and useful for automation, but agents should only suppress confirmations for user-approved actions.
Install Mechanism
The skill is instruction-only and asks users to install the external npm package mmx-cli globally without a pinned version, so package provenance should be checked before use.
Credentials
Commands send user-selected prompts, local files, URLs, generated media requests, and optional callback URLs to MiniMax or configured endpoints, which is proportionate for a multimodal provider CLI.
Persistence & Privilege
The skill clearly discloses MiniMax API key use and local persistence in ~/.mmx/credentials.json; this grants account authority within the key's permissions but is expected for the integration.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install minimax-multimodal - After installation, invoke the skill by name or use
/minimax-multimodal - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.2
**Switch to `mmx-cli`: major simplification and platform change**
- Replaces all separate bash scripts and API reference docs with a unified command-line interface (`mmx`).
- Removes 15 legacy shell script and markdown reference files.
- Updates skill name and description to reflect the new `mmx-cli` interface.
- New workflow: install `mmx-cli` via npm and use a single CLI for text, image, video, speech, and music generation.
- All instructions, options, and usage details are consolidated into the new SKILL.md focused on `mmx` commands and agent-friendly flags.
v1.0.1
**MiniMax multimodal toolkit now uses bash scripts, adds image generation, and drops Python dependencies:**
- Switched all major scripts from Python to bash, requiring only `ffmpeg`, `jq`, and `curl` (no pip/Python needed)
- Added support for image generation (text-to-image, image-to-image) via new image generation scripts
- All scripts and workflows updated to use `.sh` bash scripts instead of `.py` Python scripts
- Requires new environment variable `MINIMAX_API_HOST` (in addition to API key) with region selection instructions
- Updated and simplified prerequisites and usage instructions; Python environment setup is no longer needed
v1.0.0
MiniMax multimodal skill initial release — generate speech, music, video, and process media via MiniMax AI.
- Supports TTS (text-to-speech), voice cloning, and custom voice design.
- Music and video creation: text-to-video, image-to-video, templates, multi-scene, and more.
- FFmpeg-powered media tools: convert, concatenate, trim, extract audio/video.
- Enforces all outputs to agent’s minimax-output/ directory, with explicit output paths required.
- Includes comprehensive rules for TTS voice segmentation (single/multi-voice) and API key setup guidance.
Metadata
Frequently Asked Questions
What is Minimax-Multimodal-Toolkit?
Use mmx to generate text, images, video, speech, and music via the MiniMax AI platform. Use when the user wants to create media content, chat with MiniMax mo... It is an AI Agent Skill for Claude Code / OpenClaw, with 5028 downloads so far.
How do I install Minimax-Multimodal-Toolkit?
Run "/install minimax-multimodal" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Minimax-Multimodal-Toolkit free?
Yes, Minimax-Multimodal-Toolkit is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Minimax-Multimodal-Toolkit support?
Minimax-Multimodal-Toolkit is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Minimax-Multimodal-Toolkit?
It is built and maintained by MiniMax-AI (@minimax-ai-dev); the current version is v1.0.2.
More Skills