← Back to Skills Marketplace

Elevenlabs Tts

Name: Elevenlabs Tts
Author: jerryxn

by JerryXn · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install xiaog-elevenlabs-tts

Description

Set up, test, benchmark, and use ElevenLabs text-to-speech as an independent TTS skill, including HTTP streaming, WebSocket streaming guidance, voice listing...

README (SKILL.md)

ElevenLabs TTS

Use this skill for ELE / ElevenLabs voice generation, low-latency streaming TTS, voice IDs, voice listing, output formats, or TTS benchmark work.

Non-negotiables

Keep this skill independent from Xiaomi MiMo TTS. Do not mix API keys, endpoint shapes, voice IDs, or scripts.
Treat ELEVENLABS_API_KEY as a secret. Never echo it, write it into skill files, commit it, or expose it in logs/screenshots.
Prefer environment variable injection for tests: ELEVENLABS_API_KEY=... python3 ....
If the key is missing, verify scripts and docs locally, then report that live API synthesis is blocked by missing credentials.

Standard workflow

Verify API key availability without printing it.
List voices with scripts/list_voices.py; choose a requested voice by ID/name or use the first available voice for smoke tests.
Generate a short HTTP streaming sample with scripts/tts_stream_http.py.
Capture evidence: output path, bytes, chunks, ttfb_seconds, elapsed time, model, output format.
If audio validation tools exist (ffprobe, afinfo, or file), inspect the generated file before claiming success.

Recommended defaults

Model: eleven_flash_v2_5 for lowest latency; eleven_turbo_v2_5 for higher quality with low latency; eleven_multilingual_v2 for broad multilingual quality.
Endpoint: POST /v1/text-to-speech/{voice_id}/stream when the full text is available.
Output: mp3_44100_128 for shareable files; pcm_16000/pcm_24000 for realtime playback pipelines.
Latency: optimize_streaming_latency=2 or 3; avoid 4 unless the user accepts possible number/date mispronunciation.
Voice settings: keep use_speaker_boost=false for low latency.

Quick commands

List voices:

ELEVENLABS_API_KEY="$ELEVENLABS_API_KEY" python3 skills/elevenlabs-tts/scripts/list_voices.py

HTTP streaming TTS benchmark:

ELEVENLABS_API_KEY="$ELEVENLABS_API_KEY" python3 skills/elevenlabs-tts/scripts/tts_stream_http.py \
  --text '野哥，ElevenLabs 流式语音测试成功。' \
  --voice-name Rachel \
  --model eleven_flash_v2_5 \
  --output-format mp3_44100_128 \
  --optimize-streaming-latency 2 \
  --out /tmp/elevenlabs-skill-test.mp3

If --voice-name is omitted, the script uses the first voice returned by /v1/voices.

When to read more

API contract, request/response details, WebSocket notes, and troubleshooting: references/elevenlabs-api.md.
Deterministic scripts:
- scripts/list_voices.py
- scripts/tts_stream_http.py

Usage Guidance

Reasonable to install if you need one-time agent handoffs. Before use, confirm you trust the external Agent Wormhole service, keep wormhole codes private, avoid placing long-lived secrets in payloads, and only approve wallet signatures or x402 payments when you intentionally want that access path.

Capability Tags

requires-sensitive-credentials

Capability Assessment

✓ Purpose & Capability

The skill is explicitly about one-time encrypted handoffs for mission briefs, artifacts, config drops, receipts, and scoped secrets, including CLI/API use and optional ECHO holder or Bankr x402 access paths.

ℹ Instruction Scope

The instructions include operational commands such as send, receive, inspect, cleanup, and npx use, but they are tied to the stated handoff workflow and are not hidden or automatic.

✓ Install Mechanism

The artifact contains markdown instructions and a small YAML agent prompt, with no executable scripts reported by SkillSpector.

ℹ Credentials

The skill intentionally handles sensitive handoff data through an external service and may require wallet signatures or paid x402 access, so users should treat payloads and wallet actions carefully.

ℹ Persistence & Privilege

The design describes temporary TTL-based handoffs, one-time claims, receipts, and cleanup commands; there is no evidence of unbounded persistence or privilege escalation in the artifact.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install xiaog-elevenlabs-tts
After installation, invoke the skill by name or use /xiaog-elevenlabs-tts
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Standardized independent ElevenLabs TTS skill with API reference, streaming test scripts, safe key handling, and troubleshooting for free-tier synthesis restrictions.

Metadata

Slug xiaog-elevenlabs-tts

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Elevenlabs Tts?

Set up, test, benchmark, and use ElevenLabs text-to-speech as an independent TTS skill, including HTTP streaming, WebSocket streaming guidance, voice listing... It is an AI Agent Skill for Claude Code / OpenClaw, with 27 downloads so far.

How do I install Elevenlabs Tts?

Run "/install xiaog-elevenlabs-tts" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Elevenlabs Tts free?

Yes, Elevenlabs Tts is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Elevenlabs Tts support?

Elevenlabs Tts is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Elevenlabs Tts?

It is built and maintained by JerryXn (@jerryxn); the current version is v1.0.0.

More Skills