← Back to Skills Marketplace
jerryxn

Elevenlabs Tts

by JerryXn · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
27
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install xiaog-elevenlabs-tts
Description
Set up, test, benchmark, and use ElevenLabs text-to-speech as an independent TTS skill, including HTTP streaming, WebSocket streaming guidance, voice listing...
README (SKILL.md)

ElevenLabs TTS

Use this skill for ELE / ElevenLabs voice generation, low-latency streaming TTS, voice IDs, voice listing, output formats, or TTS benchmark work.

Non-negotiables

  • Keep this skill independent from Xiaomi MiMo TTS. Do not mix API keys, endpoint shapes, voice IDs, or scripts.
  • Treat ELEVENLABS_API_KEY as a secret. Never echo it, write it into skill files, commit it, or expose it in logs/screenshots.
  • Prefer environment variable injection for tests: ELEVENLABS_API_KEY=... python3 ....
  • If the key is missing, verify scripts and docs locally, then report that live API synthesis is blocked by missing credentials.

Standard workflow

  1. Verify API key availability without printing it.
  2. List voices with scripts/list_voices.py; choose a requested voice by ID/name or use the first available voice for smoke tests.
  3. Generate a short HTTP streaming sample with scripts/tts_stream_http.py.
  4. Capture evidence: output path, bytes, chunks, ttfb_seconds, elapsed time, model, output format.
  5. If audio validation tools exist (ffprobe, afinfo, or file), inspect the generated file before claiming success.

Recommended defaults

  • Model: eleven_flash_v2_5 for lowest latency; eleven_turbo_v2_5 for higher quality with low latency; eleven_multilingual_v2 for broad multilingual quality.
  • Endpoint: POST /v1/text-to-speech/{voice_id}/stream when the full text is available.
  • Output: mp3_44100_128 for shareable files; pcm_16000/pcm_24000 for realtime playback pipelines.
  • Latency: optimize_streaming_latency=2 or 3; avoid 4 unless the user accepts possible number/date mispronunciation.
  • Voice settings: keep use_speaker_boost=false for low latency.

Quick commands

List voices:

ELEVENLABS_API_KEY="$ELEVENLABS_API_KEY" python3 skills/elevenlabs-tts/scripts/list_voices.py

HTTP streaming TTS benchmark:

ELEVENLABS_API_KEY="$ELEVENLABS_API_KEY" python3 skills/elevenlabs-tts/scripts/tts_stream_http.py \
  --text '野哥,ElevenLabs 流式语音测试成功。' \
  --voice-name Rachel \
  --model eleven_flash_v2_5 \
  --output-format mp3_44100_128 \
  --optimize-streaming-latency 2 \
  --out /tmp/elevenlabs-skill-test.mp3

If --voice-name is omitted, the script uses the first voice returned by /v1/voices.

When to read more

  • API contract, request/response details, WebSocket notes, and troubleshooting: references/elevenlabs-api.md.
  • Deterministic scripts:
    • scripts/list_voices.py
    • scripts/tts_stream_http.py
Usage Guidance
Reasonable to install if you need one-time agent handoffs. Before use, confirm you trust the external Agent Wormhole service, keep wormhole codes private, avoid placing long-lived secrets in payloads, and only approve wallet signatures or x402 payments when you intentionally want that access path.
Capability Tags
requires-sensitive-credentials
Capability Assessment
Purpose & Capability
The skill is explicitly about one-time encrypted handoffs for mission briefs, artifacts, config drops, receipts, and scoped secrets, including CLI/API use and optional ECHO holder or Bankr x402 access paths.
Instruction Scope
The instructions include operational commands such as send, receive, inspect, cleanup, and npx use, but they are tied to the stated handoff workflow and are not hidden or automatic.
Install Mechanism
The artifact contains markdown instructions and a small YAML agent prompt, with no executable scripts reported by SkillSpector.
Credentials
The skill intentionally handles sensitive handoff data through an external service and may require wallet signatures or paid x402 access, so users should treat payloads and wallet actions carefully.
Persistence & Privilege
The design describes temporary TTL-based handoffs, one-time claims, receipts, and cleanup commands; there is no evidence of unbounded persistence or privilege escalation in the artifact.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install xiaog-elevenlabs-tts
  3. After installation, invoke the skill by name or use /xiaog-elevenlabs-tts
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Standardized independent ElevenLabs TTS skill with API reference, streaming test scripts, safe key handling, and troubleshooting for free-tier synthesis restrictions.
Metadata
Slug xiaog-elevenlabs-tts
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Elevenlabs Tts?

Set up, test, benchmark, and use ElevenLabs text-to-speech as an independent TTS skill, including HTTP streaming, WebSocket streaming guidance, voice listing... It is an AI Agent Skill for Claude Code / OpenClaw, with 27 downloads so far.

How do I install Elevenlabs Tts?

Run "/install xiaog-elevenlabs-tts" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Elevenlabs Tts free?

Yes, Elevenlabs Tts is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Elevenlabs Tts support?

Elevenlabs Tts is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Elevenlabs Tts?

It is built and maintained by JerryXn (@jerryxn); the current version is v1.0.0.

💬 Comments