← 返回 Skills 市场
808
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install vibevoice
功能描述
Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages.
安全使用建议
This skill is internally consistent with its stated purpose, but before installing consider: (1) The runtime will likely download large model weights from Hugging Face (microsoft/VibeVoice-Realtime-0.5B) unless you already have them locally — expect heavy network use and large disk usage. (2) Installing torch/torchaudio can be large and may require CUDA/tooling matching your GPU; follow official install docs for your environment. (3) The skill runs local Python code which will execute on your machine — only install from trusted sources and inspect the VibeVoice repo you clone. (4) No credentials are required, but ensure you have sufficient GPU/VRAM, disk space, and bandwidth. If you want to be stricter, clone and verify the upstream microsoft/VibeVoice repository yourself and run the script in an isolated environment (container or dedicated VM).
功能分析
Type: OpenClaw Skill
Name: vibevoice
Version: 1.0.0
The skill is classified as suspicious due to multiple critical shell injection vulnerabilities in `scripts/vv.sh`. User-controlled variables like `$VOICE`, `$SPEED`, and `$OUTPUT` are directly embedded into a Python heredoc and `ffmpeg` commands without proper sanitization, allowing for arbitrary command execution. Additionally, the `-f` argument allows reading arbitrary files, which could be combined with the shell injection for data exfiltration. While there is no clear evidence of intentional malicious behavior (e.g., self-exfiltration or backdoors), these severe vulnerabilities pose a significant security risk.
能力评估
Purpose & Capability
Name/description (local Spanish TTS using Microsoft VibeVoice) match the provided scripts and README: it expects a local VibeVoice repo, Python + torch, and ffmpeg to produce .ogg/.mp3/.wav audio.
Instruction Scope
SKILL.md and scripts instruct cloning the official Microsoft VibeVoice repo and creating a venv. The runtime python snippet calls from_pretrained('microsoft/VibeVoice-Realtime-0.5B') which will attempt to download model weights from Hugging Face if not present — this network activity and large download is not explicitly documented in SKILL.md. Otherwise the script stays within the TTS scope and only reads provided text and local voice .pt files.
Install Mechanism
There is no automated install spec; the manual install steps clone the official GitHub repo and pip-install dependencies. This is a low-risk, expected install pattern (no obscure URLs or archives). Note: pip installing torch/torchaudio can be heavyweight and may pull CUDA-specific packages depending on environment.
Credentials
The skill requests no credentials or special env vars. It uses optional env vars (VIBEVOICE_DIR, VIBEVOICE_VOICE, VIBEVOICE_SPEED) which are appropriate for configuration. No unrelated secrets or system paths are requested.
Persistence & Privilege
Skill does not request always:true and does not modify other skills or system-wide settings. It's instruction-only plus a CLI script that runs locally — no elevated persistence or privilege escalations are apparent.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install vibevoice - 安装完成后,直接呼叫该 Skill 的名称或使用
/vibevoice触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release: Local Spanish TTS with Microsoft VibeVoice
元数据
常见问题
VibeVoice TTS 是什么?
Local Spanish TTS using Microsoft VibeVoice. Generate natural voice audio from text, optimized for WhatsApp voice messages. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 808 次。
如何安装 VibeVoice TTS?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install vibevoice」即可一键安装,无需额外配置。
VibeVoice TTS 是免费的吗?
是的,VibeVoice TTS 完全免费(开源免费),可自由下载、安装和使用。
VibeVoice TTS 支持哪些平台?
VibeVoice TTS 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 VibeVoice TTS?
由 Hoddix(@javier887)开发并维护,当前版本 v1.0.0。
推荐 Skills