← 返回 Skills 市场

MiniMax Multimodal Toolkit

Name: MiniMax Multimodal Toolkit
Author: yhlorra

作者 zylorra · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

1344

总下载

当前安装

版本数

在 OpenClaw 中安装

/install yh-minimax-multimodal-toolkit

功能描述

Generate and process speech, music, video, and images using MiniMax AI with voice cloning, custom voices, multi-scene video, and FFmpeg-based media tools.

安全使用建议

This skill contains shell scripts that legitimately call MiniMax APIs and use FFmpeg. Before installing or running it: - Expect to provide MINIMAX_API_KEY and MINIMAX_API_HOST (SKILL.md requires these) even though the registry metadata does not list them — the metadata omission is a red flag. Ask the publisher to correct metadata if you need vetted declarations. - The scripts load .env files from the skill project root and from your current working directory and export each key=value they find. To avoid unintentionally exposing unrelated secrets, run the scripts from an isolated directory that does not contain a .env with other credentials, or remove/inspect any .env first. - check_environment.sh can print partial API key text when the key format is invalid. Avoid running these scripts where stdout/stderr is logged to systems you do not control, and avoid pasting your key into public chat. - If you proceed, use a scoped/limited MiniMax API key, test in an isolated environment (container or ephemeral VM), and rotate the key afterward. Review the scripts (they are pure bash) if you have security concerns; they only contact the documented MiniMax hosts (https://api.minimaxi.com or https://api.minimax.io) and use curl/ffmpeg/jq/xxd. Given the mismatches (metadata vs runtime) and the .env-loading behavior, treat this skill cautiously and correct or sandbox before use.

功能分析

Type: OpenClaw Skill Name: yh-minimax-multimodal-toolkit Version: 1.0.0 The toolkit is a legitimate and well-structured set of Bash scripts for interacting with MiniMax multimodal APIs (TTS, Music, Video, and Image). It utilizes standard system utilities like curl, ffmpeg, jq, and xxd to handle API requests and media processing. The instructions in SKILL.md are focused on task execution and include safety-conscious directions, such as ensuring output is restricted to a specific directory and advising the agent to seek user confirmation when configuring environment variables. No evidence of data exfiltration, malicious persistence, or deceptive prompt injection was found.

能力评估

⚠ Purpose & Capability

The skill's name/description (MiniMax multimodal generation) matches the included scripts (TTS, music, image, video, FFmpeg tools). However the registry metadata lists no required environment variables or primary credential while the SKILL.md and scripts clearly require MINIMAX_API_KEY and MINIMAX_API_HOST. That metadata omission is an incoherence the user should notice.

⚠ Instruction Scope

SKILL.md instructs the agent/user to set MINIMAX_API_KEY and MINIMAX_API_HOST and to run the provided bash scripts. The scripts themselves load a .env from two locations (the skill project root and the agent current working directory) and export all key=value pairs they find (if not already set). Loading and exporting arbitrary keys from the agent's working directory .env may pull unrelated secrets into the script environment. Also check_environment.sh prints partial API key when format is invalid, which could leak key fragments into logs.

✓ Install Mechanism

There is no installer (instruction-only install spec), the tool is pure shell scripts using standard system binaries (curl, ffmpeg, jq, xxd). The required tools are proportionate to media generation/processing tasks; no remote arbitrary-code download/install URLs are present.

⚠ Credentials

The runtime requires MINIMAX_API_KEY and MINIMAX_API_HOST (documented in SKILL.md and used in scripts) but the skill registry metadata declares no required env vars or primary credential — a direct mismatch. Additionally, the load_env behavior will import and export any variables found in $(pwd)/.env (agent cwd), which can cause unrelated secrets to be exposed to the script environment. The scripts do transmit MINIMAX_API_KEY to the documented MiniMax endpoints (expected), but exporting other env vars without restriction is disproportionate.

✓ Persistence & Privilege

The skill does not request permanent/always-on inclusion and does not modify other skills or system-wide settings. It runs as invoked. No elevated platform privileges are requested in the bundle.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install yh-minimax-multimodal-toolkit
安装完成后，直接呼叫该 Skill 的名称或使用 /yh-minimax-multimodal-toolkit 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial publish

元数据

Slug yh-minimax-multimodal-toolkit

版本 1.0.0

许可证 MIT-0

累计安装 2

当前安装数 2

历史版本数 1

常见问题

MiniMax Multimodal Toolkit 是什么？

Generate and process speech, music, video, and images using MiniMax AI with voice cloning, custom voices, multi-scene video, and FFmpeg-based media tools. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 1344 次。

如何安装 MiniMax Multimodal Toolkit？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install yh-minimax-multimodal-toolkit」即可一键安装，无需额外配置。

MiniMax Multimodal Toolkit 是免费的吗？

是的，MiniMax Multimodal Toolkit 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

MiniMax Multimodal Toolkit 支持哪些平台？

MiniMax Multimodal Toolkit 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 MiniMax Multimodal Toolkit？

由 zylorra（@yhlorra）开发并维护，当前版本 v1.0.0。