← 返回 Skills 市场
78
总下载
0
收藏
0
当前安装
2
版本数
在 OpenClaw 中安装
/install mimo-tts-asr
功能描述
Xiaomi MiMo V2.5 TTS + ASR 全能语音技能。支持高质量中英文语音合成(TTS)和语音识别(ASR)。 TTS: 三款模型(精品音色 / VoiceDesign 音色设计 / VoiceClone 音色克隆)、方言支持、情感控制、多格式输出。 ASR: 音频转文字、多语言识别、方言、Code...
安全使用建议
This skill appears to implement the advertised MiMo TTS and ASR features, but there are a few things to check before installing:
- Provenance: The SKILL.md links to Xiaomi/MiMo domains, but the package author/owner in metadata is not clearly an official Xiaomi account. Verify the skill's source (official repo or vendor) before trusting it.
- API keys: The scripts require MIMO_API_KEY and/or MIMO_ASR_KEY, but the registry metadata lists no required env vars. Expect to provide an API key if you want cloud calls to work. Do not supply a high-privilege or unrelated credential; create a dedicated key with minimal scopes if possible.
- Privacy: Using the cloud API uploads audio (and any reference audio used for voice-clone) to api.xiaomimimo.com. If audio contains sensitive or personal data, consider running the open-source local ASR model instead or avoid uploading sensitive recordings.
- Voice cloning legality/consent: Voice-clone will upload reference audio (encoded in the request). Ensure you have consent to clone any person's voice.
- Practical step: Inspect the included scripts (they are small and readable) and test the skill in a sandboxed environment with a throwaway API key or using local open-source models before enabling it for real data.
If you need higher assurance, ask the publisher for a homepage or official vendor verification and request that required env vars be declared in the registry metadata.
功能分析
Type: OpenClaw Skill
Name: mimo-tts-asr
Version: 2.5.4
The skill bundle provides legitimate Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) functionality via the Xiaomi MiMo API. The Python scripts (asr.py and tts.py) are well-structured API clients that use standard libraries (urllib) for network communication and include robust features like automatic retries, error handling, and argument validation. The documentation (SKILL.md) is comprehensive, providing clear usage examples and official resource links that align perfectly with the code's logic, with no evidence of malicious intent, obfuscation, or prompt injection.
能力标签
能力评估
Purpose & Capability
The name/description match the included scripts and behavior: TTS and ASR functionality is implemented, with support for voice-design and voice-clone. However the registry lists no required credentials while both SKILL.md and the scripts clearly require API keys (MIMO_API_KEY / MIMO_ASR_KEY). Also the package author/owner in metadata (aaroncxxx / kn75hrty...) does not match an official Xiaomi homepage; the SKILL.md links point to Xiaomi domains, but the skill's provenance is not verified.
Instruction Scope
Runtime instructions and the included scripts are narrowly scoped to reading audio/text and calling MiMo APIs (or offering local open-source model use). They do not attempt to read unrelated system files or environment variables beyond the service keys. Important: both ASR and TTS implementations upload audio (including reference audio for voice-clone encoded in base64) to external endpoints (api.xiaomimimo.com/platform.xiaomimimo.com). Users should expect audio and reference clips to be transmitted to that service.
Install Mechanism
This is an instruction-only skill with two small Python scripts included and no install spec or external downloads. No archive downloads, package installs, or post-install scripts are present — low filesystem/install risk.
Credentials
The skill requires API keys (MIMO_API_KEY and/or MIMO_ASR_KEY) to call the vendor's APIs, which is reasonable for a cloud TTS/ASR integration. The concern is that the registry metadata did not declare any required env vars or a primary credential, creating a mismatch between claims and actual needs. Because audio (and reference audio for cloning) will be uploaded, supplying keys grants the skill/network access tied to your account — confirm scopes, billing, and data-retention policies before providing keys.
Persistence & Privilege
The skill does not request always:true and does not attempt to modify other skills or system-wide settings. It relies on agent invocation as normal. No elevated persistence or privileged system access is requested.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install mimo-tts-asr - 安装完成后,直接呼叫该 Skill 的名称或使用
/mimo-tts-asr触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v2.5.4
## v2.5.4
- 新增 VoiceDesign(音色设计)和 VoiceClone(音色克隆)TTS 模型支持
- TTS 支持三种模型选择(精品音色/VoiceDesign/VoiceClone),扩展参数说明及示例
- ASR 支持本地部署,补充开源项目文档与演示链接
- 文档增加官方 API/发布/体验/开源资源汇总,内容优化,与官方发布说明对齐
v2.5.2
Xiaomi MiMo V2.5.2 TTS + ASR: High-quality text-to-speech and speech recognition in one skill.
- Integrated TTS (text-to-speech) and ASR (automatic speech recognition) for seamless voice workflows
- TTS: 7 unique voices, dialect and emotion support, singing mode, inline audio tags, export as WAV/MP3/OGG
- ASR: Audio-to-text with auto/zh/en/ja/ko multi-language recognition, outputs in text, JSON (with timestamps), or SRT (subtitles)
- Flexible CLI usage and API key management, supports automatic retries and rate limiting
- Streamlined workflow to recognize, edit, and re-synthesize audio content
元数据
常见问题
Mimo Tts Asr 是什么?
Xiaomi MiMo V2.5 TTS + ASR 全能语音技能。支持高质量中英文语音合成(TTS)和语音识别(ASR)。 TTS: 三款模型(精品音色 / VoiceDesign 音色设计 / VoiceClone 音色克隆)、方言支持、情感控制、多格式输出。 ASR: 音频转文字、多语言识别、方言、Code... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 78 次。
如何安装 Mimo Tts Asr?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install mimo-tts-asr」即可一键安装,无需额外配置。
Mimo Tts Asr 是免费的吗?
是的,Mimo Tts Asr 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Mimo Tts Asr 支持哪些平台?
Mimo Tts Asr 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Mimo Tts Asr?
由 aaroncxxx(@aaroncxxx)开发并维护,当前版本 v2.5.4。
推荐 Skills