← Back to Skills Marketplace
conanwhf

Voice Clone Bot

by conanwhf · GitHub ↗ · v1.1.0 · MIT-0
cross-platform ⚠ suspicious
116
Downloads
0
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install voice-clone-bot
Description
Synthesize speech by cloning a user's voice from a reference audio sample, then reading generated text aloud in that cloned voice. Use this skill whenever th...
Usage Guidance
This skill does what it says (local voice cloning) but is intrusive: it auto-creates a Python venv, clones and pip-installs third-party repos, downloads multi‑GB model weights, registers itself into ~/.openclaw/skills, and launches a background daemon. Only install/run it if you trust the upstream repos and are comfortable with those filesystem and network changes. Recommended precautions: (1) review the install scripts and the cloned repos (ChatTTS, CosyVoice, OpenVoice) before running; (2) run the skill in an isolated environment or VM/container; (3) do not provide sensitive audio files until you verify where models and outputs are stored and who can access them; (4) if you install, keep an eye on ~/.openclaw/ and the daemon process and be prepared to run uninstall.sh --purge to remove model weights. If you want a less intrusive option, prefer a hosted TTS API or demand an explicit install step rather than letting the skill auto‑install itself.
Capability Analysis
Type: OpenClaw Skill Name: voice-clone-bot Version: 1.1.0 The voice-clone-bot skill bundle is a legitimate and well-structured tool for zero-shot voice cloning and text-to-speech synthesis. It supports multiple engines (F5-TTS, CosyVoice, ChatTTS, OpenVoice) and includes comprehensive automation for environment setup, dependency management, and model weight retrieval from reputable sources like GitHub and HuggingFace. The code follows safe practices, such as using list-based subprocess calls to prevent shell injection and implementing a local FastAPI daemon to manage heavy model weights in memory. The instructions in SKILL.md are purely functional, guiding the AI agent on how to handle audio inputs and generate spoken responses without any evidence of prompt injection or malicious intent.
Capability Assessment
Purpose & Capability
The declared purpose (voice cloning / TTS) matches the code: it installs heavy TTS engines, downloads model weights, and runs a local inference server. Creating a venv, installing torch/torchaudio/safetensors/f5-tts and cloning TTS repos is proportionate to implementing a local voice‑cloning daemon. However the skill also auto-registers itself into the host (~/.openclaw/skills) and creates a global model directory (~/.openclaw/models/voice-clone), which are broader, persistent host changes that go beyond a transient helper script.
Instruction Scope
SKILL.md instructs agents to call scripts/run_tts.sh with absolute ref_audio paths and optional output_dir. That script will auto-install dependencies, source .env, spawn a background FastAPI daemon, and accept arbitrary absolute paths for ref_audio and output_dir. The agent-provided absolute paths could allow the skill process to read arbitrary local files (if the agent passes such paths). The instructions grant the skill wide discretion to modify and run local services and write files outside the repo.
Install Mechanism
Although the registry shows no install spec, the repository includes auto_installer.sh and engine install scripts that: create venv, pip install server/requirements.txt (including torch, f5-tts, safetensors), git clone multiple external GitHub projects, and use huggingface_hub snapshot_download to fetch model weights. Those actions cause network downloads and execution of third‑party code inside the venv (pip install -e from cloned repos). This is moderate-to-high risk unless you trust the referenced upstream projects.
Credentials
The skill doesn't declare required env vars or credentials, which seems appropriate on paper. In practice it reads and writes .env, sets HF_HOME/MODELSCOPE_CACHE to ~/.openclaw/models/voice-clone, and will attempt to download models from Hugging Face and GitHub. If private models or private repos were needed, the installer could use user-supplied tokens (it doesn't explicitly require them), so credential access may be needed implicitly. No explicit unrelated credentials are requested by the skill.
Persistence & Privilege
The auto_installer.sh creates persistent state on the host: a venv, a global model directory (~/.openclaw/models/voice-clone), and registers the skill by creating a symlink in ~/.openclaw/skills. run_tts.sh will start a background daemon (nohup) that remains running. While that behavior is convenient for low-latency TTS, it gives the skill persistent presence and modifies host skill registration — both are privileged operations worth explicit user consent.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install voice-clone-bot
  3. After installation, invoke the skill by name or use /voice-clone-bot
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.0
Rename synchronization to voice-clone-bot, skill metadata alignment, docs/scripts naming sync, and skill-creator formatted release package (no TTS logic changes).
v1.0.0
Local Telegram voice-clone bot skill with model selection and long-form synthesis guidance.
Metadata
Slug voice-clone-bot
Version 1.1.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 2
Frequently Asked Questions

What is Voice Clone Bot?

Synthesize speech by cloning a user's voice from a reference audio sample, then reading generated text aloud in that cloned voice. Use this skill whenever th... It is an AI Agent Skill for Claude Code / OpenClaw, with 116 downloads so far.

How do I install Voice Clone Bot?

Run "/install voice-clone-bot" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Voice Clone Bot free?

Yes, Voice Clone Bot is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Voice Clone Bot support?

Voice Clone Bot is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Voice Clone Bot?

It is built and maintained by conanwhf (@conanwhf); the current version is v1.1.0.

💬 Comments