/install elevenlabs-transcribe
ElevenLabs Speech-to-Text
Official ElevenLabs skill for speech-to-text transcription.
Convert audio to text with state-of-the-art accuracy. Supports 90+ languages, speaker diarization, and realtime streaming.
Prerequisites
- ffmpeg installed (
brew install ffmpegon macOS) - ELEVENLABS_API_KEY environment variable set
- Python 3.8+ (dependencies auto-install on first run)
Usage
{baseDir}/scripts/transcribe.sh \x3Caudio_file> [options]
{baseDir}/scripts/transcribe.sh --url \x3Cstream_url> [options]
{baseDir}/scripts/transcribe.sh --mic [options]
Examples
Batch Transcription
Transcribe a local audio file:
{baseDir}/scripts/transcribe.sh recording.mp3
With speaker identification:
{baseDir}/scripts/transcribe.sh meeting.mp3 --diarize
Get full JSON response with timestamps:
{baseDir}/scripts/transcribe.sh interview.wav --diarize --json
Realtime Streaming
Stream from a URL (e.g., live radio, podcast):
{baseDir}/scripts/transcribe.sh --url https://npr-ice.streamguys1.com/live.mp3
Transcribe from microphone:
{baseDir}/scripts/transcribe.sh --mic
Stream a local file in realtime (useful for testing):
{baseDir}/scripts/transcribe.sh audio.mp3 --realtime
Quiet Mode for Agents
Suppress status messages on stderr:
{baseDir}/scripts/transcribe.sh --mic --quiet
Options
| Option | Description |
|---|---|
--diarize |
Identify different speakers in the audio |
--lang CODE |
ISO language hint (e.g., en, pt, es, fr) |
--json |
Output full JSON with timestamps and metadata |
--events |
Tag audio events (laughter, music, applause) |
--realtime |
Stream local file instead of batch processing |
--partials |
Show interim transcripts during realtime mode |
-q, --quiet |
Suppress status messages (recommended for agents) |
Output Format
Text Mode (default)
Plain text transcription:
The quick brown fox jumps over the lazy dog.
JSON Mode (--json)
{
"text": "The quick brown fox jumps over the lazy dog.",
"language_code": "eng",
"language_probability": 0.98,
"words": [
{"text": "The", "start": 0.0, "end": 0.15, "type": "word", "speaker_id": "speaker_0"}
]
}
Realtime Mode
Final transcripts print as they're committed. With --partials:
[partial] The quick
[partial] The quick brown fox
The quick brown fox jumps over the lazy dog.
Supported Formats
Audio: MP3, WAV, M4A, FLAC, OGG, WebM, AAC, AIFF, Opus Video: MP4, AVI, MKV, MOV, WMV, FLV, WebM, MPEG, 3GPP
Limits: Up to 3GB file size, 10 hours duration
Error Handling
The script exits with non-zero status on errors:
- Missing API key: Set
ELEVENLABS_API_KEYenvironment variable - File not found: Check the file path exists
- Missing ffmpeg: Install with your package manager
- API errors: Check API key validity and rate limits
When to Use Each Mode
| Scenario | Command |
|---|---|
| Transcribe a recording | ./transcribe.sh file.mp3 |
| Meeting with multiple speakers | ./transcribe.sh meeting.mp3 --diarize |
| Live radio/podcast stream | ./transcribe.sh --url \x3Curl> |
| Voice input from user | ./transcribe.sh --mic --quiet |
| Need word timestamps | ./transcribe.sh file.mp3 --json |
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install elevenlabs-transcribe - 安装完成后,直接呼叫该 Skill 的名称或使用
/elevenlabs-transcribe触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Elevenlabs Transcribe 是什么?
Transcribe audio to text using ElevenLabs Scribe. Supports batch transcription, realtime streaming from URLs, microphone input, and local files. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 2551 次。
如何安装 Elevenlabs Transcribe?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install elevenlabs-transcribe」即可一键安装,无需额外配置。
Elevenlabs Transcribe 是免费的吗?
是的,Elevenlabs Transcribe 完全免费(开源免费),可自由下载、安装和使用。
Elevenlabs Transcribe 支持哪些平台?
Elevenlabs Transcribe 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Elevenlabs Transcribe?
由 PaulAsjes(@paulasjes)开发并维护,当前版本 v1.0.1。