← Back to Skills Marketplace
patches429

Speech

by Parker · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ⚠ suspicious
478
Downloads
0
Stars
6
Active Installs
1
Versions
Install in OpenClaw
/install speech
Description
Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; ru...
Usage Guidance
This appears to be a legitimate TTS skill that uses the OpenAI Audio API and a bundled Python CLI. Before installing: (1) confirm how your agent platform expects required credentials to be declared and stored — SKILL.md and the script require OPENAI_API_KEY but the registry metadata does not list it; (2) provide the API key via an environment variable or platform secret store (do not paste the key into chat); (3) review the bundled script locally (it supports --dry-run which prints payloads) and consider running a dry-run first to inspect behavior and outputs without network calls; (4) be cautious about enabling network access or relaxing sandbox approvals — speech generation requires outbound network access to OpenAI. If you need higher assurance, ask the author/registry maintainer to update the metadata to declare OPENAI_API_KEY as a required credential.
Capability Analysis
Type: OpenClaw Skill Name: speech Version: 0.1.0 The speech generation skill is a well-structured tool for converting text to audio via the OpenAI Audio API. The bundled CLI script (scripts/text_to_speech.py) uses the official OpenAI SDK, implements sensible rate limiting (50 RPM), and includes proper input validation and error handling. The documentation (SKILL.md and references/) provides clear guidance for the AI agent without any evidence of prompt injection, data exfiltration, or unauthorized execution; it even includes proactive security advice regarding the handling of API keys.
Capability Assessment
Purpose & Capability
Name, description, SKILL.md, references, and the bundled CLI script all align: this is a TTS skill that uses the OpenAI Audio API and built-in voices. However, the registry metadata claims no required environment variables or primary credential while the runtime instructions and script require OPENAI_API_KEY — an inconsistency in declared requirements.
Instruction Scope
SKILL.md instructs the agent to use the bundled CLI (scripts/text_to_speech.py), collect inputs, optionally write transient JSONL under tmp/, and write outputs under output/speech/. It requires an API key for live network calls and explicitly discourages pasting the key in chat. The instructions do not ask for unrelated files, additional credentials, or external endpoints beyond the OpenAI API.
Install Mechanism
There is no install spec (instruction-only), and the one bundled script relies on the public openai Python package. Installation guidance recommends pip (or uv pip). There are no downloads from arbitrary URLs or archive extraction steps in the repo.
Credentials
The runtime requires OPENAI_API_KEY for live API calls (and the script checks env). The skill metadata, however, lists no required env vars or primary credential — this omission is a red flag because the agent platform may not surface or protect the API key as expected. No other unrelated credentials or sensitive config paths are requested.
Persistence & Privilege
The skill does not request always:true, does not modify other skills' configs, and has normal ephemeral behavior (writes outputs and temporary JSONL). Autonomous invocation is allowed by default (platform normal) but not combined with other elevated privileges here.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install speech
  3. After installation, invoke the skill by name or use /speech
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
Initial release of the speech skill. - Provides text-to-speech narration, voiceover, and batch speech generation using the OpenAI Audio API and bundled CLI. - Supports single or batch audio generation workflows with clear decision logic. - Covers default voice/model selection, instruction formatting, file conventions, and dependency setup. - Enforces environment checks, API key requirements, and output organization. - Includes comprehensive instruction on user input augmentation and delivery customization. - References sample templates and modules for common use cases (narration, IVR, accessibility, etc.).
Metadata
Slug speech
Version 0.1.0
License MIT-0
All-time Installs 6
Active Installs 6
Total Versions 1
Frequently Asked Questions

What is Speech?

Use when the user asks for text-to-speech narration or voiceover, accessibility reads, audio prompts, or batch speech generation via the OpenAI Audio API; ru... It is an AI Agent Skill for Claude Code / OpenClaw, with 478 downloads so far.

How do I install Speech?

Run "/install speech" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Speech free?

Yes, Speech is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Speech support?

Speech is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Speech?

It is built and maintained by Parker (@patches429); the current version is v0.1.0.

💬 Comments