← 返回 Skills 市场
nissan

Kokoro Agent Voices

作者 Nissan Dookeran · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ⚠ suspicious
462
总下载
0
收藏
1
当前安装
2
版本数
在 OpenClaw 中安装
/install kokoro-agent-voices
功能描述
Local zero-cost text-to-speech with per-agent voice profiles using Kokoro TTS (82M params). 54 voices available, named agent mappings, WAV output. Use when b...
使用说明 (SKILL.md)

Kokoro Agent Voices

Give each AI agent a distinct voice using Kokoro TTS — a lightweight 82M parameter model that runs locally with zero API costs. 54 voices across American, British, and other accents.

Agent Voice Profiles

AGENT_VOICES = {
    "loki": "am_fenrir",      # Deep, authoritative
    "archie": "bm_george",    # British analytical
    "sara": "af_bella",       # Warm, creative
    "kit": "am_echo",         # Clear, technical
    "liv": "af_nova",         # Bright, energetic
    "belle": "bf_emma",       # Refined, thoughtful
}

Usage

python3 scripts/speak.py --agent loki "System check complete"
python3 scripts/speak.py --voice af_bella "Hello world" --output /tmp/greeting.wav
python3 scripts/speak.py --list-voices    # Show all 54 voices
python3 scripts/speak.py --list-agents    # Show configured agent profiles

Setup

Requires a Python environment with kokoro, soundfile, and espeak-ng installed. The model downloads automatically from Hugging Face on first use (~350MB).

Files

  • scripts/speak.py — TTS script with agent profiles and voice selection
安全使用建议
This skill appears to do what it says: it downloads Kokoro model weights from Hugging Face once (≈350MB) and runs inference locally. Before installing, consider: (1) ensure you have sufficient disk space and are okay with a one-time outbound download from the public 'hexgrad/Kokoro-82M' repo; (2) install the required Python packages (kokoro, soundfile, numpy, huggingface_hub) in a virtualenv to avoid system contamination; (3) the script's shebang references a developer's local virtualenv path — invoke it with your python (python3 scripts/speak.py) rather than relying on the shebang; (4) the declared requirement for espeak/espeak-ng in metadata is not used in the script and can likely be ignored unless you plan to use those tools separately; (5) if you need to ensure absolute offline behavior, pre-download the model into the expected cache (or vendor the model) so the skill will not attempt network access at runtime. If you need more assurance, inspect or run the script in an isolated environment (VM/container) and verify the model source and license on Hugging Face before use.
功能分析
Type: OpenClaw Skill Name: kokoro-agent-voices Version: 1.0.1 The skill provides local text-to-speech capabilities but contains a shell injection vulnerability in `scripts/speak.py`. The `--output` argument is passed directly into a shell command via `os.system(f"afplay {out}")` when the `--play` flag is used, allowing for arbitrary command execution if the filename is manipulated. Additionally, the script contains a hardcoded shebang path (`/Users/loki/.kokoro-venv/bin/python3`) specific to the author's local environment.
能力评估
Purpose & Capability
Name/description match the included code: the script uses the Kokoro KPipeline and downloads the 'hexgrad/Kokoro-82M' weights from Hugging Face for local TTS. One minor mismatch: the metadata/requirements list espeak-ng and espeak as required binaries, but the included script does not invoke those binaries (it uses Python libraries and macOS 'afplay' only for playback). The espeak requirement appears unnecessary or at least not used by the provided code.
Instruction Scope
SKILL.md explicitly documents that the model is downloaded from Hugging Face on first run (~350MB) and that subsequent inference is local. The script only reads its arguments and writes WAV output (defaults to /tmp). It does not read unrelated files or environment variables, nor does it transmit text/audio to external services beyond the one-time model download.
Install Mechanism
There is no install spec — the skill is instruction- and script-based. That means nothing arbitrary is downloaded/installed by the platform on install time. Runtime dependencies (Python packages like kokoro, soundfile, huggingface_hub, numpy) must be installed by the user/environment; model weights are fetched from a known public Hugging Face repo at runtime.
Credentials
The skill requests no environment variables or credentials. It uses huggingface_hub to snapshot_download a public repo; no auth is required for a public model. There are no unrelated credential requests or config path accesses.
Persistence & Privilege
The skill is not marked always:true, does not request persistent system-wide changes, and does not modify other skills. Autonomous invocation is allowed but is the platform default and not a special privilege here.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install kokoro-agent-voices
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /kokoro-agent-voices 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
Add security_notes: one-time model download only, all TTS inference is local
v1.0.0
Initial release — local zero-cost TTS with per-agent voice profiles
元数据
Slug kokoro-agent-voices
版本 1.0.1
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 2
常见问题

Kokoro Agent Voices 是什么?

Local zero-cost text-to-speech with per-agent voice profiles using Kokoro TTS (82M params). 54 voices available, named agent mappings, WAV output. Use when b... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 462 次。

如何安装 Kokoro Agent Voices?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install kokoro-agent-voices」即可一键安装,无需额外配置。

Kokoro Agent Voices 是免费的吗?

是的,Kokoro Agent Voices 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Kokoro Agent Voices 支持哪些平台?

Kokoro Agent Voices 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Kokoro Agent Voices?

由 Nissan Dookeran(@nissan)开发并维护,当前版本 v1.0.1。

💬 留言讨论