← 返回 Skills 市场
patches429

Speech

作者 Parker · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ⚠ suspicious
478
总下载
0
收藏
6
当前安装
1
版本数
在 OpenClaw 中安装
/install speech
功能描述
Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; ru...
安全使用建议
This appears to be a legitimate TTS skill that uses the OpenAI Audio API and a bundled Python CLI. Before installing: (1) confirm how your agent platform expects required credentials to be declared and stored — SKILL.md and the script require OPENAI_API_KEY but the registry metadata does not list it; (2) provide the API key via an environment variable or platform secret store (do not paste the key into chat); (3) review the bundled script locally (it supports --dry-run which prints payloads) and consider running a dry-run first to inspect behavior and outputs without network calls; (4) be cautious about enabling network access or relaxing sandbox approvals — speech generation requires outbound network access to OpenAI. If you need higher assurance, ask the author/registry maintainer to update the metadata to declare OPENAI_API_KEY as a required credential.
功能分析
Type: OpenClaw Skill Name: speech Version: 0.1.0 The speech generation skill is a well-structured tool for converting text to audio via the OpenAI Audio API. The bundled CLI script (scripts/text_to_speech.py) uses the official OpenAI SDK, implements sensible rate limiting (50 RPM), and includes proper input validation and error handling. The documentation (SKILL.md and references/) provides clear guidance for the AI agent without any evidence of prompt injection, data exfiltration, or unauthorized execution; it even includes proactive security advice regarding the handling of API keys.
能力评估
Purpose & Capability
Name, description, SKILL.md, references, and the bundled CLI script all align: this is a TTS skill that uses the OpenAI Audio API and built-in voices. However, the registry metadata claims no required environment variables or primary credential while the runtime instructions and script require OPENAI_API_KEY — an inconsistency in declared requirements.
Instruction Scope
SKILL.md instructs the agent to use the bundled CLI (scripts/text_to_speech.py), collect inputs, optionally write transient JSONL under tmp/, and write outputs under output/speech/. It requires an API key for live network calls and explicitly discourages pasting the key in chat. The instructions do not ask for unrelated files, additional credentials, or external endpoints beyond the OpenAI API.
Install Mechanism
There is no install spec (instruction-only), and the one bundled script relies on the public openai Python package. Installation guidance recommends pip (or uv pip). There are no downloads from arbitrary URLs or archive extraction steps in the repo.
Credentials
The runtime requires OPENAI_API_KEY for live API calls (and the script checks env). The skill metadata, however, lists no required env vars or primary credential — this omission is a red flag because the agent platform may not surface or protect the API key as expected. No other unrelated credentials or sensitive config paths are requested.
Persistence & Privilege
The skill does not request always:true, does not modify other skills' configs, and has normal ephemeral behavior (writes outputs and temporary JSONL). Autonomous invocation is allowed by default (platform normal) but not combined with other elevated privileges here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install speech
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /speech 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.0
Initial release of the speech skill. - Provides text-to-speech narration, voiceover, and batch speech generation using the OpenAI Audio API and bundled CLI. - Supports single or batch audio generation workflows with clear decision logic. - Covers default voice/model selection, instruction formatting, file conventions, and dependency setup. - Enforces environment checks, API key requirements, and output organization. - Includes comprehensive instruction on user input augmentation and delivery customization. - References sample templates and modules for common use cases (narration, IVR, accessibility, etc.).
元数据
Slug speech
版本 0.1.0
许可证 MIT-0
累计安装 6
当前安装数 6
历史版本数 1
常见问题

Speech 是什么?

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; ru... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 478 次。

如何安装 Speech?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install speech」即可一键安装,无需额外配置。

Speech 是免费的吗?

是的,Speech 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Speech 支持哪些平台?

Speech 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Speech?

由 Parker(@patches429)开发并维护,当前版本 v0.1.0。

💬 留言讨论