← Back to Skills Marketplace
808
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install vibevoice
Description
Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages.
Usage Guidance
This skill is internally consistent with its stated purpose, but before installing consider: (1) The runtime will likely download large model weights from Hugging Face (microsoft/VibeVoice-Realtime-0.5B) unless you already have them locally — expect heavy network use and large disk usage. (2) Installing torch/torchaudio can be large and may require CUDA/tooling matching your GPU; follow official install docs for your environment. (3) The skill runs local Python code which will execute on your machine — only install from trusted sources and inspect the VibeVoice repo you clone. (4) No credentials are required, but ensure you have sufficient GPU/VRAM, disk space, and bandwidth. If you want to be stricter, clone and verify the upstream microsoft/VibeVoice repository yourself and run the script in an isolated environment (container or dedicated VM).
Capability Analysis
Type: OpenClaw Skill
Name: vibevoice
Version: 1.0.0
The skill is classified as suspicious due to multiple critical shell injection vulnerabilities in `scripts/vv.sh`. User-controlled variables like `$VOICE`, `$SPEED`, and `$OUTPUT` are directly embedded into a Python heredoc and `ffmpeg` commands without proper sanitization, allowing for arbitrary command execution. Additionally, the `-f` argument allows reading arbitrary files, which could be combined with the shell injection for data exfiltration. While there is no clear evidence of intentional malicious behavior (e.g., self-exfiltration or backdoors), these severe vulnerabilities pose a significant security risk.
Capability Assessment
Purpose & Capability
Name/description (local Spanish TTS using Microsoft VibeVoice) match the provided scripts and README: it expects a local VibeVoice repo, Python + torch, and ffmpeg to produce .ogg/.mp3/.wav audio.
Instruction Scope
SKILL.md and scripts instruct cloning the official Microsoft VibeVoice repo and creating a venv. The runtime python snippet calls from_pretrained('microsoft/VibeVoice-Realtime-0.5B') which will attempt to download model weights from Hugging Face if not present — this network activity and large download is not explicitly documented in SKILL.md. Otherwise the script stays within the TTS scope and only reads provided text and local voice .pt files.
Install Mechanism
There is no automated install spec; the manual install steps clone the official GitHub repo and pip-install dependencies. This is a low-risk, expected install pattern (no obscure URLs or archives). Note: pip installing torch/torchaudio can be heavyweight and may pull CUDA-specific packages depending on environment.
Credentials
The skill requests no credentials or special env vars. It uses optional env vars (VIBEVOICE_DIR, VIBEVOICE_VOICE, VIBEVOICE_SPEED) which are appropriate for configuration. No unrelated secrets or system paths are requested.
Persistence & Privilege
Skill does not request always:true and does not modify other skills or system-wide settings. It's instruction-only plus a CLI script that runs locally — no elevated persistence or privilege escalations are apparent.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install vibevoice - After installation, invoke the skill by name or use
/vibevoice - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: Local Spanish TTS with Microsoft VibeVoice
Metadata
Frequently Asked Questions
What is VibeVoice TTS?
Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages. It is an AI Agent Skill for Claude Code / OpenClaw, with 808 downloads so far.
How do I install VibeVoice TTS?
Run "/install vibevoice" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is VibeVoice TTS free?
Yes, VibeVoice TTS is completely free (open-source). You can download, install and use it at no cost.
Which platforms does VibeVoice TTS support?
VibeVoice TTS is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created VibeVoice TTS?
It is built and maintained by Hoddix (@javier887); the current version is v1.0.0.
More Skills