Audio Command Handler
/install audio-command-handler
Audio Command Handler
Process audio messages and execute them as commands.
Workflow
Scenario 1: Audio Only (No Text)
User sends an audio file without any text instruction:
- Transcribe the audio using
ifly-speed-transcriptionskill - Use transcription as the command - execute it as if the user typed it
- Return result directly - no file upload needed, regardless of length
Scenario 2: Audio + Text Command
User sends an audio file WITH a text instruction:
- Transcribe the audio using
ifly-speed-transcriptionskill - Execute the text command with the transcription as context/input
- Check result length:
- If ≤ 58 characters: return result directly
- If > 58 characters: save to file, upload via
uploaderskill, return URL
Quick Reference
Transcription
python3 ~/.openclaw/workspace/skills/ifly-speed-transcription/scripts/transcribe.py /path/to/audio.mp3
Upload
python3 ~/.openclaw/workspace/skills/uploader/scripts/upload_media.py /path/to/file.txt
Execution Flow
┌─────────────────┐
│ Audio Message │
└────────┬────────┘
│
▼
┌─────────────────┐
│ Transcribe │
│ (ifly-speed- │
│ transcription) │
└────────┬────────┘
│
▼
┌─────────────────┐ NO ┌──────────────┐
│ Has Text Cmd? │────────────►│ Use Transcrip│
└────────┬────────┘ │ as Command │
│ YES └──────┬───────┘
▼ │
┌─────────────────┐ │
│ Execute Text │ │
│ Cmd with Trans │ │
│ Context │ │
└────────┬────────┘ │
│ │
│ ▼
│ ┌──────────────┐
│ │ Return Direct│
│ │ to User │
│ │ (no upload) │
│ └──────────────┘
│
▼
┌─────────────────┐
│ Result > 58 ch? │
└────────┬────────┘
│
┌─────────────┴─────────────┐
│ YES │ NO
▼ ▼
┌─────────────────┐ ┌──────────────┐
│ Save to File │ │ Return Direct│
│ Upload via │ │ to User │
│ uploader skill │ └──────────────┘
└────────┬────────┘
│
▼
┌─────────────────┐
│ Return URL to │
│ User │
└─────────────────┘
Example Scenarios
Example 1: Audio Only
User sends: 🎤 audio file (speech: "帮我查一下明天上海的天气")
Flow:
- Transcribe → "帮我查一下明天上海的天气"
- Execute as command → check Shanghai weather for tomorrow
- Return weather info directly (no upload, regardless of length)
Example 2: Audio + Command (Short Result)
User sends: 🎤 audio file + text "帮我总结这段录音"
Flow:
- Transcribe audio → get text content
- Execute "帮我总结这段录音" with transcription as context
- If summary ≤ 58 chars → return directly
Example 3: Audio + Command (Long Result)
User sends: 🎤 audio file + text "帮我根据这段录音写一篇文章"
Flow:
- Transcribe audio → get text content
- Execute command with transcription as context
- Result > 58 chars → save to file, upload
- Return: "已生成内容,下载链接:https://..."
Notes
- Audio formats: WAV, PCM, MP3 (16kHz, 16-bit, mono recommended)
- Max duration: 5 hours
- Language support: Chinese, English, 202+ Chinese dialects
- Result threshold: 58 characters (configurable per implementation)
- File location: Saved to
~/.openclaw/workspace/before upload
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install audio-command-handler - 安装完成后,直接呼叫该 Skill 的名称或使用
/audio-command-handler触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Audio Command Handler 是什么?
Handle audio messages as commands. When user sends an audio file (WAV/PCM/MP3), transcribe it using iFlytek Speed Transcription and either (1) execute the tr... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 49 次。
如何安装 Audio Command Handler?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install audio-command-handler」即可一键安装,无需额外配置。
Audio Command Handler 是免费的吗?
是的,Audio Command Handler 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Audio Command Handler 支持哪些平台?
Audio Command Handler 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Audio Command Handler?
由 smallKeyboy(@smallkeyboy)开发并维护,当前版本 v1.0.0。