← Back to Skills Marketplace
78
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install mimo-tts-asr
Description
Xiaomi MiMo V2.5 TTS + ASR 全能语音技能。支持高质量中英文语音合成(TTS)和语音识别(ASR)。 TTS: 三款模型(精品音色 / VoiceDesign 音色设计 / VoiceClone 音色克隆)、方言支持、情感控制、多格式输出。 ASR: 音频转文字、多语言识别、方言、Code...
Usage Guidance
This skill appears to implement the advertised MiMo TTS and ASR features, but there are a few things to check before installing:
- Provenance: The SKILL.md links to Xiaomi/MiMo domains, but the package author/owner in metadata is not clearly an official Xiaomi account. Verify the skill's source (official repo or vendor) before trusting it.
- API keys: The scripts require MIMO_API_KEY and/or MIMO_ASR_KEY, but the registry metadata lists no required env vars. Expect to provide an API key if you want cloud calls to work. Do not supply a high-privilege or unrelated credential; create a dedicated key with minimal scopes if possible.
- Privacy: Using the cloud API uploads audio (and any reference audio used for voice-clone) to api.xiaomimimo.com. If audio contains sensitive or personal data, consider running the open-source local ASR model instead or avoid uploading sensitive recordings.
- Voice cloning legality/consent: Voice-clone will upload reference audio (encoded in the request). Ensure you have consent to clone any person's voice.
- Practical step: Inspect the included scripts (they are small and readable) and test the skill in a sandboxed environment with a throwaway API key or using local open-source models before enabling it for real data.
If you need higher assurance, ask the publisher for a homepage or official vendor verification and request that required env vars be declared in the registry metadata.
Capability Analysis
Type: OpenClaw Skill
Name: mimo-tts-asr
Version: 2.5.4
The skill bundle provides legitimate Text-to-Speech (TTS) and Automatic Speech Recognition (ASR) functionality via the Xiaomi MiMo API. The Python scripts (asr.py and tts.py) are well-structured API clients that use standard libraries (urllib) for network communication and include robust features like automatic retries, error handling, and argument validation. The documentation (SKILL.md) is comprehensive, providing clear usage examples and official resource links that align perfectly with the code's logic, with no evidence of malicious intent, obfuscation, or prompt injection.
Capability Tags
Capability Assessment
Purpose & Capability
The name/description match the included scripts and behavior: TTS and ASR functionality is implemented, with support for voice-design and voice-clone. However the registry lists no required credentials while both SKILL.md and the scripts clearly require API keys (MIMO_API_KEY / MIMO_ASR_KEY). Also the package author/owner in metadata (aaroncxxx / kn75hrty...) does not match an official Xiaomi homepage; the SKILL.md links point to Xiaomi domains, but the skill's provenance is not verified.
Instruction Scope
Runtime instructions and the included scripts are narrowly scoped to reading audio/text and calling MiMo APIs (or offering local open-source model use). They do not attempt to read unrelated system files or environment variables beyond the service keys. Important: both ASR and TTS implementations upload audio (including reference audio for voice-clone encoded in base64) to external endpoints (api.xiaomimimo.com/platform.xiaomimimo.com). Users should expect audio and reference clips to be transmitted to that service.
Install Mechanism
This is an instruction-only skill with two small Python scripts included and no install spec or external downloads. No archive downloads, package installs, or post-install scripts are present — low filesystem/install risk.
Credentials
The skill requires API keys (MIMO_API_KEY and/or MIMO_ASR_KEY) to call the vendor's APIs, which is reasonable for a cloud TTS/ASR integration. The concern is that the registry metadata did not declare any required env vars or a primary credential, creating a mismatch between claims and actual needs. Because audio (and reference audio for cloning) will be uploaded, supplying keys grants the skill/network access tied to your account — confirm scopes, billing, and data-retention policies before providing keys.
Persistence & Privilege
The skill does not request always:true and does not attempt to modify other skills or system-wide settings. It relies on agent invocation as normal. No elevated persistence or privileged system access is requested.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install mimo-tts-asr - After installation, invoke the skill by name or use
/mimo-tts-asr - Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.5.4
## v2.5.4
- 新增 VoiceDesign(音色设计)和 VoiceClone(音色克隆)TTS 模型支持
- TTS 支持三种模型选择(精品音色/VoiceDesign/VoiceClone),扩展参数说明及示例
- ASR 支持本地部署,补充开源项目文档与演示链接
- 文档增加官方 API/发布/体验/开源资源汇总,内容优化,与官方发布说明对齐
v2.5.2
Xiaomi MiMo V2.5.2 TTS + ASR: High-quality text-to-speech and speech recognition in one skill.
- Integrated TTS (text-to-speech) and ASR (automatic speech recognition) for seamless voice workflows
- TTS: 7 unique voices, dialect and emotion support, singing mode, inline audio tags, export as WAV/MP3/OGG
- ASR: Audio-to-text with auto/zh/en/ja/ko multi-language recognition, outputs in text, JSON (with timestamps), or SRT (subtitles)
- Flexible CLI usage and API key management, supports automatic retries and rate limiting
- Streamlined workflow to recognize, edit, and re-synthesize audio content
Metadata
Frequently Asked Questions
What is Mimo Tts Asr?
Xiaomi MiMo V2.5 TTS + ASR 全能语音技能。支持高质量中英文语音合成(TTS)和语音识别(ASR)。 TTS: 三款模型(精品音色 / VoiceDesign 音色设计 / VoiceClone 音色克隆)、方言支持、情感控制、多格式输出。 ASR: 音频转文字、多语言识别、方言、Code... It is an AI Agent Skill for Claude Code / OpenClaw, with 78 downloads so far.
How do I install Mimo Tts Asr?
Run "/install mimo-tts-asr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Mimo Tts Asr free?
Yes, Mimo Tts Asr is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Mimo Tts Asr support?
Mimo Tts Asr is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Mimo Tts Asr?
It is built and maintained by aaroncxxx (@aaroncxxx); the current version is v2.5.4.
More Skills