← 返回 Skills 市场

mimo-asr

Name: mimo-asr
Author: xcchenx345

作者 xcchenx345 · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ pending

总下载

当前安装

版本数

在 OpenClaw 中安装

/install mimo-asr

功能描述

小米 MiMo-V2.5-ASR 语音识别技能。通过 Gradio API 调用，将音频转录为文字，支持中英文自动检测和显式指定。无需本地模型，无需 API Key。当用户请求语音转文字、音频转录、提取对话内容时使用。

使用说明 (SKILL.md)

MiMo-V2.5-ASR 语音识别 / Speech Recognition (中文/English)

通过小米官方 Gradio 免费 API 调用语音识别模型，无需本地模型、无需 API Key。 Transcribe audio via Xiaomi's free public Gradio API — no local model, no API key needed.

为什么选这个？/ Why This One?

比起安装 4GB 本地模型 + mimo-audio 包，这个版本：

✅ 零依赖（只需要 requests）
✅ 零配置（不需要下载任何东西）
✅ 免费无限调用
✅ 支持中文/英文/自动检测
❌ 需要网络（音频传云端处理）

快速开始 / Quick Start

# 安装唯一依赖 / Install the only dependency
pip install requests

# 自动语言检测 / Auto language detection
python scripts/mimo_asr_api.py recording.wav

# 指定中文 / Specify Chinese
python scripts/mimo_asr_api.py speech.mp3 --language zh

# 指定英文 / Specify English
python scripts/mimo_asr_api.py lecture.wav --language en

# 输出到文件 / Output to file
python scripts/mimo_asr_api.py meeting.wav --output transcript.txt

参数 / Parameters

参数	说明	默认值
`audio`	音频文件路径（wav/mp3/m4a 等）	必填
`--language`	语言：`auto`（自动）、`zh`（中文）、`en`（英文）	`auto`
`--output`	输出文本文件路径（可选）	打印到终端

示例 / Examples

# 🪟 Windows
python scripts/mimo_asr_api.py C:\Users\abc15\Desktop\录音.wav --language zh --output output.txt

# 🍎 macOS / 🐧 Linux
python scripts/mimo_asr_api.py ~/Desktop/meeting.mp3 --language auto

API 内部原理 / How It Works

该脚本通过 Gradio API 调用 HuggingFace 上的小米官方空间： The script calls Xiaomi's official HuggingFace space via the Gradio API:

https://xiaomimimo-mimo-v2-5-asr.hf.space

步骤 / Steps:

上传音频 → /gradio_api/upload
启动推理 → /gradio_api/call/infer（传音频 URL + 语言标签）
轮询结果 → 获取转录文本

支持的文件格式 / Supported Formats

WAV、MP3、M4A、FLAC、OGG 等主流音频格式。 WAV, MP3, M4A, FLAC, OGG and common audio formats.

注意事项 / Notes

大文件处理时间较长 / Large files take longer
需网络连接 / Internet connection required
不需要 MIMO API Key / No MIMO_API_KEY needed
免费使用 / Free to use

能力标签

requires-sensitive-credentials

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install mimo-asr
安装完成后，直接呼叫该 Skill 的名称或使用 /mimo-asr 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

MiMo-V2.5-ASR Skill v1.0.0 - Initial release of the MiMo-ASR skill for audio-to-text transcription. - Uses Xiaomi's public Gradio API for speech recognition—no local model or API key required. - Supports Chinese, English, and auto language detection. - Compatible with major audio formats (wav, mp3, m4a, flac, ogg, etc.). - Simple setup: only requires the `requests` library and an internet connection. - Free unlimited usage.

元数据

Slug mimo-asr

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

mimo-asr 是什么？

小米 MiMo-V2.5-ASR 语音识别技能。通过 Gradio API 调用，将音频转录为文字，支持中英文自动检测和显式指定。无需本地模型，无需 API Key。当用户请求语音转文字、音频转录、提取对话内容时使用。它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 50 次。

如何安装 mimo-asr？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install mimo-asr」即可一键安装，无需额外配置。

mimo-asr 是免费的吗？

是的，mimo-asr 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

mimo-asr 支持哪些平台？

mimo-asr 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 mimo-asr？

由 xcchenx345（@xcchenx345）开发并维护，当前版本 v1.0.0。