← Back to Skills Marketplace

VibeVoice TTS

Name: VibeVoice TTS
Author: javier887

by Hoddix · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

808

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install vibevoice

Description

Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages.

Usage Guidance

This skill is internally consistent with its stated purpose, but before installing consider: (1) The runtime will likely download large model weights from Hugging Face (microsoft/VibeVoice-Realtime-0.5B) unless you already have them locally — expect heavy network use and large disk usage. (2) Installing torch/torchaudio can be large and may require CUDA/tooling matching your GPU; follow official install docs for your environment. (3) The skill runs local Python code which will execute on your machine — only install from trusted sources and inspect the VibeVoice repo you clone. (4) No credentials are required, but ensure you have sufficient GPU/VRAM, disk space, and bandwidth. If you want to be stricter, clone and verify the upstream microsoft/VibeVoice repository yourself and run the script in an isolated environment (container or dedicated VM).

Capability Analysis

Type: OpenClaw Skill Name: vibevoice Version: 1.0.0 The skill is classified as suspicious due to multiple critical shell injection vulnerabilities in `scripts/vv.sh`. User-controlled variables like `$VOICE`, `$SPEED`, and `$OUTPUT` are directly embedded into a Python heredoc and `ffmpeg` commands without proper sanitization, allowing for arbitrary command execution. Additionally, the `-f` argument allows reading arbitrary files, which could be combined with the shell injection for data exfiltration. While there is no clear evidence of intentional malicious behavior (e.g., self-exfiltration or backdoors), these severe vulnerabilities pose a significant security risk.

Capability Assessment

✓ Purpose & Capability

Name/description (local Spanish TTS using Microsoft VibeVoice) match the provided scripts and README: it expects a local VibeVoice repo, Python + torch, and ffmpeg to produce .ogg/.mp3/.wav audio.

ℹ Instruction Scope

SKILL.md and scripts instruct cloning the official Microsoft VibeVoice repo and creating a venv. The runtime python snippet calls from_pretrained('microsoft/VibeVoice-Realtime-0.5B') which will attempt to download model weights from Hugging Face if not present — this network activity and large download is not explicitly documented in SKILL.md. Otherwise the script stays within the TTS scope and only reads provided text and local voice .pt files.

✓ Install Mechanism

There is no automated install spec; the manual install steps clone the official GitHub repo and pip-install dependencies. This is a low-risk, expected install pattern (no obscure URLs or archives). Note: pip installing torch/torchaudio can be heavyweight and may pull CUDA-specific packages depending on environment.

✓ Credentials

The skill requests no credentials or special env vars. It uses optional env vars (VIBEVOICE_DIR, VIBEVOICE_VOICE, VIBEVOICE_SPEED) which are appropriate for configuration. No unrelated secrets or system paths are requested.

✓ Persistence & Privilege

Skill does not request always:true and does not modify other skills or system-wide settings. It's instruction-only plus a CLI script that runs locally — no elevated persistence or privilege escalations are apparent.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install vibevoice
After installation, invoke the skill by name or use /vibevoice
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release: Local Spanish TTS with Microsoft VibeVoice

Metadata

Slug vibevoice

Version 1.0.0

License —

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is VibeVoice TTS?

Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages. It is an AI Agent Skill for Claude Code / OpenClaw, with 808 downloads so far.

How do I install VibeVoice TTS?

Run "/install vibevoice" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is VibeVoice TTS free?

Yes, VibeVoice TTS is completely free (open-source). You can download, install and use it at no cost.

Which platforms does VibeVoice TTS support?

VibeVoice TTS is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created VibeVoice TTS?

It is built and maintained by Hoddix (@javier887); the current version is v1.0.0.

More Skills