← 返回 Skills 市场
Voice Agent Pro
作者
Wesley Armando
· GitHub ↗
· v1.0.7
· MIT-0
220
总下载
0
收藏
0
当前安装
7
版本数
在 OpenClaw 中安装
/install voice-agent-pro-v1
功能描述
Gives any OpenClaw agent a complete voice layer via ElevenLabs. Clones the principal's voice from audio samples, converts any text to MP3 audio (VSL, podcast...
安全使用建议
This skill can do what it says (clone voices, generate TTS, place calls) but there are red flags you should address before installing:
- Resolve the metadata mismatch: the registry lists NO required env vars but SKILL.md requests ELEVENLABS_*, TWILIO_*, and TELEGRAM_* — assume those credentials are needed and treat them as sensitive.
- Do NOT let the skill or agent write credentials into your host-wide .env or other global config unless you understand the impact. Prefer creating dedicated, limited-scope API keys and store them in a separate, isolated config for this skill.
- Avoid enabling the 'virtual-desktop' automated path unless you trust the virtual-desktop skill and accept that the agent may access an active browser session (this can expose Google sessions and other credentials).
- Test in an isolated environment (throwaway container/VPS) first. Use temporary API keys, restrict Twilio numbers and callback URLs, and monitor network traffic.
- Review the included voice_generator.py and any Twilio/Telegram integration code (the repository contains code) to confirm it only calls expected endpoints and that logs/audits do not leak sensitive inputs.
- If you plan to make calls, check legal/compliance implications for cloned voices and automated calling in your jurisdiction; ensure consent and opt-out handling.
If you cannot validate the above, treat this skill as high-risk and consider alternative, officially supported integrations or keep it disabled until you can audit and scope its permissions.
功能分析
Type: OpenClaw Skill
Name: voice-agent-pro-v1
Version: 1.0.7
The voice-agent-pro-v3 skill provides a legitimate voice layer for OpenClaw agents, enabling voice cloning, text-to-speech, and automated calling via ElevenLabs and Twilio. The bundle includes a well-structured Python CLI (voice_generator.py) that handles API interactions and local file management securely, avoiding shell injection vulnerabilities. While the SKILL.md instructions include high-privilege setup steps like automated browser navigation for API key retrieval and system-level package installation, these are transparently documented as necessary configuration paths for the stated functionality. No evidence of malicious intent, unauthorized data exfiltration, or hidden backdoors was found.
能力评估
Purpose & Capability
The skill claims to be a voice/TTS/calling layer (ElevenLabs + Twilio + Telegram), which explains needing those credentials — but the registry metadata (top-level requirements) lists no required env vars or config paths while SKILL.md metadata and README explicitly require multiple secrets and many read/write paths. This mismatch (registry says 'none' but SKILL.md requires ELEVENLABS_*, TWILIO_*, TELEGRAM_*, and specific host/workspace paths) is an incoherence that should be resolved.
Instruction Scope
SKILL.md instructs the agent to install packages inside the OpenClaw container (pip install elevenlabs, twilio; apt-get ffmpeg), to read/write /workspace and host .env files, and offers an automated 'virtual-desktop' path that has the agent use an active Google browser session to create an API key. The instructions therefore go beyond simple TTS: they instruct host-level modifications and recommend automated browser actions that could access third‑party sessions — this expands the attack surface and scope of what the agent may touch.
Install Mechanism
There is no formal install spec in the registry (instruction-only) but SKILL.md expects the agent to run pip/apt commands inside the container. Installing third-party Python packages (elevenlabs, twilio) and apt packages via instructions is common but carries supply-chain risk; no external arbitrary download URLs are used. The use of pip with --break-system-packages is noted (less safe on some systems).
Credentials
The requested environment access is extensive but mostly aligned with the declared functionality (ElevenLabs API key/voice id for cloning/TTS, Twilio creds for calls, Telegram token/chat for notifications). However SKILL.md and README also instruct writing credentials into host-level .env (/docker/openclaw-yyvg/.env) and to read many workspace files; the registry incorrectly declared no env requirements. Writing/updating a host .env for the agent is a privilege that affects other services and is disproportionate unless the user intends that change.
Persistence & Privilege
The skill is not always:true, but instructions explicitly tell the agent (or operator) to write credentials into host docker .env and to restart the OpenClaw container, which modifies system-wide agent configuration. The SKILL.md also suggests using an automated 'virtual-desktop' to create keys from an active Google session — combining autonomous agent actions with access to browser sessions and host env files increases risk. The skill does write audit/error logs to workspace paths, further persisting state.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install voice-agent-pro-v1 - 安装完成后,直接呼叫该 Skill 的名称或使用
/voice-agent-pro-v1触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.7
- Added required Twilio environment variables (TWILIO_ACCOUNT_SID, TWILIO_AUTH_TOKEN, TWILIO_PHONE_NUMBER) to enable full integration for voice call features.
- Updated SKILL metadata to ensure all necessary environment variables are documented for both ElevenLabs and Twilio.
- Expanded write permissions on /workspace/.learnings/ to allow broader logging or error capture.
- No code changes; updates are limited to improved environment documentation.
v1.0.6
No file changes detected in this version.
- No code or documentation changes; functionality remains unchanged.
- Version or metadata may have been updated without modifications to skill files.
v1.0.5
No file changes detected in this version.
- No new features, fixes, or updates in this release.
- Skill version, structure, and requirements remain unchanged.
v1.0.4
No functional or documentation changes in this release.
- No file changes detected for version 1.0.4.
- Skill behavior and documentation remain unchanged.
v1.0.3
Version 1.0.3
- Added detailed setup instructions in a new file: setup_guide.md
- Users now have a dedicated guide for configuring credentials and preparing required voice samples.
v1.0.2
No file changes detected; no updates in this release.
v1.0.1
**Version 1.0.1 — Major transition to autonomous voice configuration and deployment via ElevenLabs.**
- Rebranded skill as "voice-agent-pro" and removed legacy files related to Vapi/Retell/Bland.ai.
- Added autonomous self-configuration: agent now navigates ElevenLabs using the virtual-desktop skill to create API keys, clone voices, and store credentials without manual API setup.
- Added detailed agent prompt, configuration file, and readme to support new workflow.
- Integrated direct text-to-speech (MP3), voice cloning, and Twilio-based conversational call handling.
- Implemented automated Telegram notifications for configuration steps, voice readiness, and call outcomes.
元数据
常见问题
Voice Agent Pro 是什么?
Gives any OpenClaw agent a complete voice layer via ElevenLabs. Clones the principal's voice from audio samples, converts any text to MP3 audio (VSL, podcast... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 220 次。
如何安装 Voice Agent Pro?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install voice-agent-pro-v1」即可一键安装,无需额外配置。
Voice Agent Pro 是免费的吗?
是的,Voice Agent Pro 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Voice Agent Pro 支持哪些平台?
Voice Agent Pro 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Voice Agent Pro?
由 Wesley Armando(@georges91560)开发并维护,当前版本 v1.0.7。
推荐 Skills