← 返回 Skills 市场
jerryxn

Elevenlabs Tts

作者 JerryXn · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
27
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install xiaog-elevenlabs-tts
功能描述
Set up, test, benchmark, and use ElevenLabs text-to-speech as an independent TTS skill, including HTTP streaming, WebSocket streaming guidance, voice listing...
使用说明 (SKILL.md)

ElevenLabs TTS

Use this skill for ELE / ElevenLabs voice generation, low-latency streaming TTS, voice IDs, voice listing, output formats, or TTS benchmark work.

Non-negotiables

  • Keep this skill independent from Xiaomi MiMo TTS. Do not mix API keys, endpoint shapes, voice IDs, or scripts.
  • Treat ELEVENLABS_API_KEY as a secret. Never echo it, write it into skill files, commit it, or expose it in logs/screenshots.
  • Prefer environment variable injection for tests: ELEVENLABS_API_KEY=... python3 ....
  • If the key is missing, verify scripts and docs locally, then report that live API synthesis is blocked by missing credentials.

Standard workflow

  1. Verify API key availability without printing it.
  2. List voices with scripts/list_voices.py; choose a requested voice by ID/name or use the first available voice for smoke tests.
  3. Generate a short HTTP streaming sample with scripts/tts_stream_http.py.
  4. Capture evidence: output path, bytes, chunks, ttfb_seconds, elapsed time, model, output format.
  5. If audio validation tools exist (ffprobe, afinfo, or file), inspect the generated file before claiming success.

Recommended defaults

  • Model: eleven_flash_v2_5 for lowest latency; eleven_turbo_v2_5 for higher quality with low latency; eleven_multilingual_v2 for broad multilingual quality.
  • Endpoint: POST /v1/text-to-speech/{voice_id}/stream when the full text is available.
  • Output: mp3_44100_128 for shareable files; pcm_16000/pcm_24000 for realtime playback pipelines.
  • Latency: optimize_streaming_latency=2 or 3; avoid 4 unless the user accepts possible number/date mispronunciation.
  • Voice settings: keep use_speaker_boost=false for low latency.

Quick commands

List voices:

ELEVENLABS_API_KEY="$ELEVENLABS_API_KEY" python3 skills/elevenlabs-tts/scripts/list_voices.py

HTTP streaming TTS benchmark:

ELEVENLABS_API_KEY="$ELEVENLABS_API_KEY" python3 skills/elevenlabs-tts/scripts/tts_stream_http.py \
  --text '野哥,ElevenLabs 流式语音测试成功。' \
  --voice-name Rachel \
  --model eleven_flash_v2_5 \
  --output-format mp3_44100_128 \
  --optimize-streaming-latency 2 \
  --out /tmp/elevenlabs-skill-test.mp3

If --voice-name is omitted, the script uses the first voice returned by /v1/voices.

When to read more

  • API contract, request/response details, WebSocket notes, and troubleshooting: references/elevenlabs-api.md.
  • Deterministic scripts:
    • scripts/list_voices.py
    • scripts/tts_stream_http.py
安全使用建议
Reasonable to install if you need one-time agent handoffs. Before use, confirm you trust the external Agent Wormhole service, keep wormhole codes private, avoid placing long-lived secrets in payloads, and only approve wallet signatures or x402 payments when you intentionally want that access path.
能力标签
requires-sensitive-credentials
能力评估
Purpose & Capability
The skill is explicitly about one-time encrypted handoffs for mission briefs, artifacts, config drops, receipts, and scoped secrets, including CLI/API use and optional ECHO holder or Bankr x402 access paths.
Instruction Scope
The instructions include operational commands such as send, receive, inspect, cleanup, and npx use, but they are tied to the stated handoff workflow and are not hidden or automatic.
Install Mechanism
The artifact contains markdown instructions and a small YAML agent prompt, with no executable scripts reported by SkillSpector.
Credentials
The skill intentionally handles sensitive handoff data through an external service and may require wallet signatures or paid x402 access, so users should treat payloads and wallet actions carefully.
Persistence & Privilege
The design describes temporary TTL-based handoffs, one-time claims, receipts, and cleanup commands; there is no evidence of unbounded persistence or privilege escalation in the artifact.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install xiaog-elevenlabs-tts
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /xiaog-elevenlabs-tts 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Standardized independent ElevenLabs TTS skill with API reference, streaming test scripts, safe key handling, and troubleshooting for free-tier synthesis restrictions.
元数据
Slug xiaog-elevenlabs-tts
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Elevenlabs Tts 是什么?

Set up, test, benchmark, and use ElevenLabs text-to-speech as an independent TTS skill, including HTTP streaming, WebSocket streaming guidance, voice listing... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 27 次。

如何安装 Elevenlabs Tts?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install xiaog-elevenlabs-tts」即可一键安装,无需额外配置。

Elevenlabs Tts 是免费的吗?

是的,Elevenlabs Tts 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Elevenlabs Tts 支持哪些平台?

Elevenlabs Tts 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Elevenlabs Tts?

由 JerryXn(@jerryxn)开发并维护,当前版本 v1.0.0。

💬 留言讨论