/install coze-asr
Coze Automatic Speech Recognition (ASR)
Transcribe audio files to text using Coze API.
Setup
1. Get your API Key: Get a key from Coze Platform
2. Set it in your environment:
export COZE_API_KEY="your-key-here"
Supported Audio Formats
- MP3 - Recommended
- WAV - Supported
- OGG - Supported (包括 opus 编码)
Note: Coze API 原生支持 mp3、wav、ogg 格式,无需转换。
Usage
Basic Transcription
Transcribe an audio file:
bash scripts/speech_to_text.sh recording.mp3
Full Options
bash scripts/speech_to_text.sh \x3Caudio_file> [language]
Parameters:
audio_file(required): Path to audio filelanguage(optional): Language code (default: zh)
Output Format
The script outputs JSON with transcribed text.
Example output:
{
"text": "你好,这是转录的文本内容"
}
Troubleshooting
File Size Issues:
- Check Coze API documentation for file size limits
- Reduce sample rate or bit depth if needed
Poor Accuracy:
- Improve audio quality
- Ensure clear speech and minimal noise
- Use appropriate language code
Format Issues:
- Ensure file is not corrupted
- Verify audio can be played by standard players
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install coze-asr - After installation, invoke the skill by name or use
/coze-asr - Provide required inputs per the skill's parameter spec and get structured output
What is Coze Asr?
Automatic Speech Recognition (ASR) using Coze API. Use when you need to transcribe audio files to text. Supports Chinese audio transcription via Coze's speec... It is an AI Agent Skill for Claude Code / OpenClaw, with 216 downloads so far.
How do I install Coze Asr?
Run "/install coze-asr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Coze Asr free?
Yes, Coze Asr is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Coze Asr support?
Coze Asr is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Coze Asr?
It is built and maintained by xiaofei (@franklu0819-lang); the current version is v1.0.0.