← 返回 Skills 市场
cindypapa

火山引擎豆包语音播客

作者 Cindypapa · GitHub ↗ · v1.1.1 · MIT-0
cross-platform ⚠ suspicious
146
总下载
0
收藏
0
当前安装
2
版本数
在 OpenClaw 中安装
/install volcano-engine-podcast
功能描述
生成火山引擎豆包语音播客(PodcastTTS)。输入主题文本,自动生成双人对话式播客音频。
安全使用建议
What to consider before installing/using this skill: - Credentials: The code requires a Volcengine appid and access token (SKILL.md references VOLC_APPID and VOLC_ACCESS_TOKEN). The registry metadata does not list these — assume you must provide them. Only supply keys you control and are comfortable using for TTS calls. - Hidden config access: scripts/kamei_podcast.py will try to read ~/.openclaw/config.json for credentials. Inspect that file (if present) before running to ensure it doesn't contain unrelated secrets you don't want read by this code. - File writes & permissions: kamei_podcast.py copies generated audio to /root/.openclaw/media/qqbot/downloads (SEND_DIR). That path requires root permissions and is not documented in SKILL.md. If you run as a non-root user the copy will fail; if you run as root the skill will write into that location. Consider changing SEND_DIR or running in a sandbox/container and inspect the script before use. - Hardcoded key: an app_key default (aGjiRDfUWi) is embedded in the code. Confirm whether you should use your own key rather than the default, and avoid relying on embedded/unknown keys. - Network endpoint: the client connects to wss://openspeech.bytedance.com/api/v3/sami/podcasttts. If you require assurance about what is sent, review the code path that builds headers and payload (headers include your appid/access_key) and consider running it in an environment where you can monitor outbound connections. - Safety steps: review the files (generate_podcast.py and kamei_podcast.py) locally before running, provide only least-privilege credentials, and run the skill in a sandbox or non-root account. If you want to use it in an automated agent, update SEND_DIR to a safe location and remove or verify any reading of ~/.openclaw/config.json. If source/origin is important, ask the publisher for provenance (homepage, contact) because 'Source: unknown' increases risk. If you can provide the publisher/source or confirm the intended runtime environment (e.g., OpenClaw agent running as root in a container), I can re-evaluate and raise or lower the confidence accordingly.
功能分析
Type: OpenClaw Skill Name: volcano-engine-podcast Version: 1.1.1 The skill bundle is a legitimate implementation for generating AI-powered dual-speaker podcasts using the Volcengine (ByteDance) PodcastTTS API. It includes a robust implementation of a custom binary WebSocket protocol in `protocols.py` and a well-structured generator in `generate_podcast.py`. The integration script `kamei_podcast.py` correctly handles configuration by reading from the standard OpenClaw path (`~/.openclaw/config.json`) and outputs files to a designated media directory for bot delivery. No evidence of data exfiltration, malicious execution, or prompt injection was found.
能力标签
requires-oauth-tokenrequires-sensitive-credentials
能力评估
Purpose & Capability
Name/description match the code: the Python client connects to a Volcengine/openspeech WebSocket and streams/merges audio for PodcastTTS. Requiring appid/access_token is appropriate. However, registry metadata declares no required env vars while SKILL.md and code clearly expect VOLC_APPID / VOLC_ACCESS_TOKEN (and optionally VOLC_APP_KEY). That registry omission is an incoherence that could mislead users about secret requirements.
Instruction Scope
SKILL.md documents env vars and CLI/API usage and is generally scoped to TTS generation. But one bundled script (scripts/kamei_podcast.py) reads a config file at ~/.openclaw/config.json and copies output to a hardcoded SEND_DIR (/root/.openclaw/media/qqbot/downloads). Those file-path interactions are not documented in SKILL.md's '注意事项' and introduce side effects outside the described output_dir (reading user config and writing into a root-owned path). The code also embeds a default app_key in plain text. These behaviors expand scope beyond just calling the remote TTS service.
Install Mechanism
No install spec is present (instruction-only skill plus Python scripts). Nothing is downloaded or executed automatically by an installer, which minimizes supply-chain risk. The skill requires Python packages (websockets, optionally pydub) but those are documented in SKILL.md.
Credentials
The skill legitimately needs Volcengine credentials (appid and access token). However, the registry metadata lists no required env vars even though SKILL.md instructs use of VOLC_APPID and VOLC_ACCESS_TOKEN. The code also attempts to read credentials from ~/.openclaw/config.json (key name volc['access_key']) which could expose other local secrets if that file exists. Embedding an app_key default in code is also notable (hardcoded key). Overall the requested/used environment access is proportionate for the TTS task, but the undisclosed config-file access and hardcoded key are unexpected.
Persistence & Privilege
The skill does not request always:true and does not modify other skills' configurations. However, scripts will read ~/.openclaw/config.json and write copied audio into /root/.openclaw/media/qqbot/downloads — writing into a root path assumes elevated privileges and is an operational concern. This is not an 'always' persistence problem, but it is a side-effect that could fail or cause unexpected file writes when run with different user permissions.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install volcano-engine-podcast
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /volcano-engine-podcast 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.1.1
v1.1.1 重新发布:支持双人对话播客生成,含片头音乐、分片流式下发、断点续传、音频后处理
v1.1.0
- Added multi-step user flow: skill now always asks if reference materials are available before generating a podcast. - Improved podcast generation logic for both cases: with or without provided materials. - Clearly defined required trigger keywords for easier activation. - Updated API usage examples and configuration guidelines. - Outlined output structure and return data details. - Clarified instructions to always send the final MP3 file to the user.
元数据
Slug volcano-engine-podcast
版本 1.1.1
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 2
常见问题

火山引擎豆包语音播客 是什么?

生成火山引擎豆包语音播客(PodcastTTS)。输入主题文本,自动生成双人对话式播客音频。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 146 次。

如何安装 火山引擎豆包语音播客?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install volcano-engine-podcast」即可一键安装,无需额外配置。

火山引擎豆包语音播客 是免费的吗?

是的,火山引擎豆包语音播客 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

火山引擎豆包语音播客 支持哪些平台?

火山引擎豆包语音播客 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 火山引擎豆包语音播客?

由 Cindypapa(@cindypapa)开发并维护,当前版本 v1.1.1。

💬 留言讨论