/install xiaog-elevenlabs-tts
ElevenLabs TTS
Use this skill for ELE / ElevenLabs voice generation, low-latency streaming TTS, voice IDs, voice listing, output formats, or TTS benchmark work.
Non-negotiables
- Keep this skill independent from Xiaomi MiMo TTS. Do not mix API keys, endpoint shapes, voice IDs, or scripts.
- Treat
ELEVENLABS_API_KEYas a secret. Never echo it, write it into skill files, commit it, or expose it in logs/screenshots. - Prefer environment variable injection for tests:
ELEVENLABS_API_KEY=... python3 .... - If the key is missing, verify scripts and docs locally, then report that live API synthesis is blocked by missing credentials.
Standard workflow
- Verify API key availability without printing it.
- List voices with
scripts/list_voices.py; choose a requested voice by ID/name or use the first available voice for smoke tests. - Generate a short HTTP streaming sample with
scripts/tts_stream_http.py. - Capture evidence: output path, bytes, chunks,
ttfb_seconds, elapsed time, model, output format. - If audio validation tools exist (
ffprobe,afinfo, orfile), inspect the generated file before claiming success.
Recommended defaults
- Model:
eleven_flash_v2_5for lowest latency;eleven_turbo_v2_5for higher quality with low latency;eleven_multilingual_v2for broad multilingual quality. - Endpoint:
POST /v1/text-to-speech/{voice_id}/streamwhen the full text is available. - Output:
mp3_44100_128for shareable files;pcm_16000/pcm_24000for realtime playback pipelines. - Latency:
optimize_streaming_latency=2or3; avoid4unless the user accepts possible number/date mispronunciation. - Voice settings: keep
use_speaker_boost=falsefor low latency.
Quick commands
List voices:
ELEVENLABS_API_KEY="$ELEVENLABS_API_KEY" python3 skills/elevenlabs-tts/scripts/list_voices.py
HTTP streaming TTS benchmark:
ELEVENLABS_API_KEY="$ELEVENLABS_API_KEY" python3 skills/elevenlabs-tts/scripts/tts_stream_http.py \
--text '野哥,ElevenLabs 流式语音测试成功。' \
--voice-name Rachel \
--model eleven_flash_v2_5 \
--output-format mp3_44100_128 \
--optimize-streaming-latency 2 \
--out /tmp/elevenlabs-skill-test.mp3
If --voice-name is omitted, the script uses the first voice returned by /v1/voices.
When to read more
- API contract, request/response details, WebSocket notes, and troubleshooting:
references/elevenlabs-api.md. - Deterministic scripts:
scripts/list_voices.pyscripts/tts_stream_http.py
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install xiaog-elevenlabs-tts - After installation, invoke the skill by name or use
/xiaog-elevenlabs-tts - Provide required inputs per the skill's parameter spec and get structured output
What is Elevenlabs Tts?
Set up, test, benchmark, and use ElevenLabs text-to-speech as an independent TTS skill, including HTTP streaming, WebSocket streaming guidance, voice listing... It is an AI Agent Skill for Claude Code / OpenClaw, with 27 downloads so far.
How do I install Elevenlabs Tts?
Run "/install xiaog-elevenlabs-tts" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Elevenlabs Tts free?
Yes, Elevenlabs Tts is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Elevenlabs Tts support?
Elevenlabs Tts is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Elevenlabs Tts?
It is built and maintained by JerryXn (@jerryxn); the current version is v1.0.0.