← 返回 Skills 市场
Voice Agent
作者
Wesley Armando
· GitHub ↗
· v1.0.0
· MIT-0
212
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install voice-agent-v1
功能描述
Enables autonomous cloning of your voice via ElevenLabs, converts text to speech, and deploys AI voice agents for automated inbound/outbound calls with Twili...
安全使用建议
Before installing: 1) Verify the registry metadata is updated to declare the exact env vars and credential requirements (ElevenLabs, Twilio, Telegram, or email/password). 2) Inspect the full SKILL.md and voice_generator.py (including truncated sections) to confirm there are no hidden endpoints or undisclosed notification channels. 3) Understand and consent to the skill using the virtual-desktop's active Google session (this lets it act as a logged-in user). 4) Run this skill in an isolated workspace or sandbox if you will allow it to create/write API keys and .env files. 5) If you don't trust automatic browser-based credential creation, opt for manual API key creation and supply only the minimally necessary credentials. 6) If Telegram notifications are required, confirm where the token is stored and that it won't be used to exfiltrate secrets. If any of these points are unclear or the registry metadata doesn't match the instructions, do not enable the skill until clarified.
功能分析
Type: OpenClaw Skill
Name: voice-agent-v1
Version: 1.0.0
The skill implements a 'self-configuration' routine in SKILL.md that instructs the AI agent to autonomously navigate the ElevenLabs dashboard via a virtual desktop to extract API keys and write them to the .env file. While aligned with the stated purpose of a 'voice agent,' this pattern of UI-based credential scraping and automated secret management is high-risk. Additionally, the skill possesses broad capabilities for automated outbound calling and workspace-wide file access (voice_generator.py, config.json), which could be leveraged for unauthorized activities if the agent's instructions are compromised.
能力评估
Purpose & Capability
The skill's stated purpose (voice cloning, TTS, Twilio calls) matches the included code and instructions, but the registry metadata omits the sensitive credentials and paths the SKILL.md actually uses. SKILL.md expects ElevenLabs API keys, optional Twilio credentials, Google browser sessions, and Telegram notifications — none of which are declared in the top-level requirements. That mismatch is a red flag for either sloppy packaging or hidden assumptions about available credentials/sessions.
Instruction Scope
Runtime instructions tell the agent to autonomously navigate elevenlabs.io via a virtual-desktop, perform Google OAuth sign-in using the active browser session or fill in email/password, create API keys, copy them, and write secrets into /workspace/voice/config.json and a .env file. The SKILL.md also references Telegram notifications and many workspace read/write paths. These instructions involve accessing and storing sensitive credentials and an external browser session — scope beyond simple TTS generation and worth explicit user consent and review.
Install Mechanism
There is no install spec (instruction-only) and only one Python script is included. That reduces supply-chain risk compared to arbitrary downloads, but the script performs network calls (ElevenLabs API) and writes files to the workspace. The lack of an install step does not remove the need to audit the included code for data exfiltration or hidden endpoints.
Credentials
The skill will create and store ELEVENLABS_API_KEY and voice IDs and may require ELEVENLABS_EMAIL/ELEVENLABS_PASSWORD and TWILIO_ACCOUNT_SID/AUTH_TOKEN, yet the registry metadata lists no required environment variables. Telegram notifications are described but no Telegram credential is declared. Requesting or grabbing an active Google session via virtual-desktop is sensitive and should be explicitly declared — its absence in the registry is disproportionate and inconsistent.
Persistence & Privilege
The skill writes secrets and config to workspace files (.env and config.json) and creates audit/error logs under the workspace; it also relies on a virtual-desktop skill to access browser sessions. While always:false (so it's not force-included), writing credentials and using another skill's browser session increases persistence/privilege — verify you want a component that can create and store API keys and use an active Google session on your behalf.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install voice-agent-v1 - 安装完成后,直接呼叫该 Skill 的名称或使用
/voice-agent-v1触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Voice Agent 1.0.0 — Major overhaul: now fully automates voice cloning, ElevenLabs setup, and conversational AI calling, with self-configuration and Telegram notifications.
- Full migration from local API & Whisper/AWS Polly to ElevenLabs: agent now clones voice, generates MP3s, and runs calls via Twilio.
- Autonomous self-configuration: navigates ElevenLabs dashboard via virtual-desktop to generate API keys, clone principal’s voice, and create agents.
- Local scripts and legacy audio handling removed; all config handled in config.json, voice_generator.py, and supporting docs.
- Sends Telegram updates on configuration, voice readiness, audio generation, and call outcomes.
- No manual API key setup required: agent fetches credentials and completes onboarding autonomously.
- Supports audio-driven workflows, scalable text-to-speech, and automated inbound/outbound calls.
元数据
常见问题
Voice Agent 是什么?
Enables autonomous cloning of your voice via ElevenLabs, converts text to speech, and deploys AI voice agents for automated inbound/outbound calls with Twili... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 212 次。
如何安装 Voice Agent?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install voice-agent-v1」即可一键安装,无需额外配置。
Voice Agent 是免费的吗?
是的,Voice Agent 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Voice Agent 支持哪些平台?
Voice Agent 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Voice Agent?
由 Wesley Armando(@georges91560)开发并维护,当前版本 v1.0.0。
推荐 Skills