← Back to Skills Marketplace

Minimax-Multimodal-Toolkit

Name: Minimax-Multimodal-Toolkit
Author: minimax-ai-dev

by MiniMax-AI · GitHub ↗ · v1.0.2 · MIT-0

cross-platform ✓ Security Clean

5028

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install minimax-multimodal

Description

Use mmx to generate text, images, video, speech, and music via the MiniMax AI platform. Use when the user wants to create media content, chat with MiniMax mo...

Usage Guidance

Before installing, verify the mmx-cli npm package and consider pinning a trusted version. Prefer environment variables or a secure secret manager over putting API keys directly on command lines, protect ~/.mmx/credentials.json, rotate the key if exposed, avoid --yes for costly or account-changing actions unless explicitly approved, and only send files or prompts you are comfortable sharing with MiniMax.

Capability Assessment

ℹ Purpose & Capability

The documented capabilities match the stated purpose: text, image, video, speech, music, vision, search, quota, and configuration through the MiniMax CLI.

ℹ Instruction Scope

The skill recommends agent-friendly flags including --yes and --quiet; this is disclosed and useful for automation, but agents should only suppress confirmations for user-approved actions.

ℹ Install Mechanism

The skill is instruction-only and asks users to install the external npm package mmx-cli globally without a pinned version, so package provenance should be checked before use.

ℹ Credentials

Commands send user-selected prompts, local files, URLs, generated media requests, and optional callback URLs to MiniMax or configured endpoints, which is proportionate for a multimodal provider CLI.

ℹ Persistence & Privilege

The skill clearly discloses MiniMax API key use and local persistence in ~/.mmx/credentials.json; this grants account authority within the key's permissions but is expected for the integration.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install minimax-multimodal
After installation, invoke the skill by name or use /minimax-multimodal
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.2

**Switch to `mmx-cli`: major simplification and platform change** - Replaces all separate bash scripts and API reference docs with a unified command-line interface (`mmx`). - Removes 15 legacy shell script and markdown reference files. - Updates skill name and description to reflect the new `mmx-cli` interface. - New workflow: install `mmx-cli` via npm and use a single CLI for text, image, video, speech, and music generation. - All instructions, options, and usage details are consolidated into the new SKILL.md focused on `mmx` commands and agent-friendly flags.

v1.0.1

**MiniMax multimodal toolkit now uses bash scripts, adds image generation, and drops Python dependencies:** - Switched all major scripts from Python to bash, requiring only `ffmpeg`, `jq`, and `curl` (no pip/Python needed) - Added support for image generation (text-to-image, image-to-image) via new image generation scripts - All scripts and workflows updated to use `.sh` bash scripts instead of `.py` Python scripts - Requires new environment variable `MINIMAX_API_HOST` (in addition to API key) with region selection instructions - Updated and simplified prerequisites and usage instructions; Python environment setup is no longer needed

v1.0.0

MiniMax multimodal skill initial release — generate speech, music, video, and process media via MiniMax AI. - Supports TTS (text-to-speech), voice cloning, and custom voice design. - Music and video creation: text-to-video, image-to-video, templates, multi-scene, and more. - FFmpeg-powered media tools: convert, concatenate, trim, extract audio/video. - Enforces all outputs to agent’s minimax-output/ directory, with explicit output paths required. - Includes comprehensive rules for TTS voice segmentation (single/multi-voice) and API key setup guidance.

Metadata

Slug minimax-multimodal

Version 1.0.2

License MIT-0

All-time Installs 180

Active Installs 27

Total Versions 3

Frequently Asked Questions

What is Minimax-Multimodal-Toolkit?

Use mmx to generate text, images, video, speech, and music via the MiniMax AI platform. Use when the user wants to create media content, chat with MiniMax mo... It is an AI Agent Skill for Claude Code / OpenClaw, with 5028 downloads so far.

How do I install Minimax-Multimodal-Toolkit?

Run "/install minimax-multimodal" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Minimax-Multimodal-Toolkit free?

Yes, Minimax-Multimodal-Toolkit is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Minimax-Multimodal-Toolkit support?

Minimax-Multimodal-Toolkit is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Minimax-Multimodal-Toolkit?

It is built and maintained by MiniMax-AI (@minimax-ai-dev); the current version is v1.0.2.

More Skills