← 返回 Skills 市场
豆包语音合成 2.0
作者
yanmomuyu-sys
· GitHub ↗
· v1.0.0
· MIT-0
111
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install seedtts2
功能描述
豆包语音合成 2.0,支持情绪控制、多音色、语音指令。34 种音色可选,含 JARVIS 同款男声。
安全使用建议
Things to check before installing/use:
- The skill's registry metadata did not list required environment variables, but both SKILL.md and the shipped Python code require VOLCANO_APP_ID and VOLCANO_ACCESS_TOKEN. Don't provide credentials until you confirm this is the intended provider and you trust the package source.
- Verify the API endpoint and token format with the official 火山引擎 (Volcano/ByteDance) docs. The code and docs use an unusual 'Bearer; {token}' header format (semicolon + space) — confirm that's required and not a typo that will prompt manual token hacks.
- Inspect the included tts_client.py locally (it is provided) before running. The code performs network calls to https://openspeech.bytedance.com and writes audio files under ~/.openclaw/tts_output and reads ~/.openclaw/openclaw.json; ensure you are comfortable with that file access.
- Playback uses subprocess.run with system binaries (afplay/aplay/Windows start). Running these is typical but ensure your environment has the expected players; subprocess usage appears to use list form (no shell injection), but you should still inspect the tail of the file (the provided tts_client.py was partially truncated in the scan) to confirm no unexpected shell=True or unescaped concatenation.
- If you want least privilege: create a dedicated minimal-volume API key on the provider side (if supported) and test in a sandboxed account before using production credentials.
Given the above inconsistencies and the need to handle credentials, proceed with caution and verify the token/header format and the skill's origin before granting credentials or installing into a production agent.
功能分析
Type: OpenClaw Skill
Name: seedtts2
Version: 1.0.0
The skill provides a functional Python client and CLI for Volcengine's SeedTTS 2.0 API. While the logic is aligned with its stated purpose, tts_client.py contains a shell injection vulnerability in the say_and_play method; on Windows, it uses subprocess.run with shell=True on the output file path, which could allow arbitrary command execution if a user provides a crafted filename. No evidence of intentional malice or unauthorized data exfiltration was found, as API credentials are only sent to the official ByteDance endpoint (openspeech.bytedance.com).
能力评估
Purpose & Capability
The name, description, SKILL.md, docs, and code all describe a TTS client calling a 火山引擎/openspeech.bytedance.com API — requiring APP_ID and Access Token is consistent with that purpose. However the skill registry metadata lists no required environment variables or primary credential, which is inconsistent with the shipped code and documentation that both require VOLCANO_APP_ID and VOLCANO_ACCESS_TOKEN.
Instruction Scope
Runtime instructions and code are limited to: reading env vars and an OpenClaw JSON config, posting JSON to the documented TTS API endpoint, parsing streamed responses, saving audio files under ~/.openclaw/tts_output, and optionally invoking local players (afplay/aplay/Windows start). The skill does not instruct collection of unrelated system data or sending data to unexpected external endpoints beyond openspeech.bytedance.com. Note: SKILL.md and docs repeatedly recommend placing credentials in ~/.openclaw/openclaw.json which the client will read.
Install Mechanism
There is no install spec (instruction-only skill), and included files are pure Python with no external arbitrary downloads. This is low risk from an installation mechanism perspective.
Credentials
The code and SKILL.md require VOLCANO_APP_ID and VOLCANO_ACCESS_TOKEN (and optionally VOLCANO_RESOURCE_ID / VOLCANO_API_URL), which are proportional to a TTS client. However the skill registry metadata declared no required env vars or primary credential — an incoherence that may mislead users. Also the docs and code use an unusual token header format 'Authorization: Bearer; {token}' (semicolon + space) — this is nonstandard and should be verified against the provider to ensure it is intentional and not a bug that might cause manual token workarounds.
Persistence & Privilege
The skill does write output to ~/.openclaw/tts_output and will read ~/.openclaw/openclaw.json; this is reasonable for a client that stores outputs and reads local configuration. The skill is not marked always:true and does not request system-wide persistent privileges beyond its own config and output directory.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install seedtts2 - 安装完成后,直接呼叫该 Skill 的名称或使用
/seedtts2触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
初始版本 - 豆包 SeedTTS 2.0,支持 34 种音色和情绪控制,优化了结束标记检测/Session 复用/超时配置
元数据
常见问题
豆包语音合成 2.0 是什么?
豆包语音合成 2.0,支持情绪控制、多音色、语音指令。34 种音色可选,含 JARVIS 同款男声。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 111 次。
如何安装 豆包语音合成 2.0?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install seedtts2」即可一键安装,无需额外配置。
豆包语音合成 2.0 是免费的吗?
是的,豆包语音合成 2.0 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
豆包语音合成 2.0 支持哪些平台?
豆包语音合成 2.0 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 豆包语音合成 2.0?
由 yanmomuyu-sys(@yanmomuyu-sys)开发并维护,当前版本 v1.0.0。
推荐 Skills