/install asr
Speech is Cheap (SIC) Skill
Fast, accurate, and incredibly inexpensive automatic speech-to-text transcription service.
🚀 Why use this skill?
- Disruptive Pricing: $0.06 - $0.12 per hour (2-15x cheaper than Deepgram or OpenAI).
- Extreme Speed: 100 minutes of audio transcribes in ~1 minute.
- Multilingual: Supports 100 languages with auto-detection.
- Agent-Ready: Designed for high-volume, automated pipelines.
🛠 Setup
1. Get an API Key
Sign up at speechischeap.com. Use code CH5 for $5 off.
2. Configure Authentication
This skill looks for your API key in the SIC_API_KEY environment variable.
Add this to your .env or agent config:
SIC_API_KEY=your_key_here
📖 Usage
🤖 TL;DR for Agents
When this skill is installed, you can transcribe any URL from an OpenClaw session and get the JSON results immediately by running:
./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3"
Transcribe a URL
# Basic transcription
./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3"
# Advanced transcription with options
./skills/asr/scripts/asr.sh transcribe --url "https://example.com/audio.mp3" \
--speakers --words --labels \
--language "en" \
--format "srt" \
--private
Transcribe a Local File
Perfect for processing audio already on your disk. This handles the upload automatically.
# Upload and transcribe local media
./skills/asr/scripts/asr.sh transcribe --file "./local-audio.wav"
# Upload with webhook callback
./skills/asr/scripts/asr.sh transcribe --file "./local-audio.wav" --webhook "https://mysite.com/callback"
# Note: For local files, the skill handles the multi-part upload to
# https://upload.speechischeap.com before starting the transcription.
Supported Options
--speakers: Enable speaker diarization--words: Enable word-level timestamps--labels: Enable audio labeling (music, noise, etc.)--stream: Enable streaming output--private: Do not store audio/transcript (privacy mode)--language \x3Ccode>: ISO language code (e.g., 'en', 'es')--confidence \x3Cfloat>: Minimum confidence threshold (default 0.5)--format \x3Cfmt>: Output format (json, srt, vtt, webvtt)--webhook \x3Curl>: URL to receive job completion payload--segment-duration \x3Cn>: Segment duration in seconds (default 30)
Check Job Status
./skills/asr/scripts/asr.sh status "job-id-here"
🤖 For Agents
The asr.sh command-line tool returns JSON by default when successful, making it easy to pipe into other tools or parse directly.
If the SIC_API_KEY is missing, the tool will provide a clear error message and a direct link to the signup page.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install asr - 安装完成后,直接呼叫该 Skill 的名称或使用
/asr触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Speech is Cheap Transcribe 是什么?
Fast, affordable automatic speech-to-text transcription supporting 100 languages, speaker diarization, word timestamps, and customizable output formats. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 2734 次。
如何安装 Speech is Cheap Transcribe?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install asr」即可一键安装,无需额外配置。
Speech is Cheap Transcribe 是免费的吗?
是的,Speech is Cheap Transcribe 完全免费(开源免费),可自由下载、安装和使用。
Speech is Cheap Transcribe 支持哪些平台?
Speech is Cheap Transcribe 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Speech is Cheap Transcribe?
由 ilyakam(@ilyakam)开发并维护,当前版本 v1.2.0。