← Back to Skills Marketplace

Voice Agent Pro

Name: Voice Agent Pro
Author: georges91560

by Wesley Armando · GitHub ↗ · v1.0.7 · MIT-0

cross-platform ⚠ suspicious

220

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install voice-agent-pro-v1

Description

Gives any OpenClaw agent a complete voice layer via ElevenLabs. Clones the principal's voice from audio samples, converts any text to MP3 audio (VSL, podcast...

Usage Guidance

This skill can do what it says (clone voices, generate TTS, place calls) but there are red flags you should address before installing: - Resolve the metadata mismatch: the registry lists NO required env vars but SKILL.md requests ELEVENLABS_*, TWILIO_*, and TELEGRAM_* — assume those credentials are needed and treat them as sensitive. - Do NOT let the skill or agent write credentials into your host-wide .env or other global config unless you understand the impact. Prefer creating dedicated, limited-scope API keys and store them in a separate, isolated config for this skill. - Avoid enabling the 'virtual-desktop' automated path unless you trust the virtual-desktop skill and accept that the agent may access an active browser session (this can expose Google sessions and other credentials). - Test in an isolated environment (throwaway container/VPS) first. Use temporary API keys, restrict Twilio numbers and callback URLs, and monitor network traffic. - Review the included voice_generator.py and any Twilio/Telegram integration code (the repository contains code) to confirm it only calls expected endpoints and that logs/audits do not leak sensitive inputs. - If you plan to make calls, check legal/compliance implications for cloned voices and automated calling in your jurisdiction; ensure consent and opt-out handling. If you cannot validate the above, treat this skill as high-risk and consider alternative, officially supported integrations or keep it disabled until you can audit and scope its permissions.

Capability Analysis

Type: OpenClaw Skill Name: voice-agent-pro-v1 Version: 1.0.7 The voice-agent-pro-v3 skill provides a legitimate voice layer for OpenClaw agents, enabling voice cloning, text-to-speech, and automated calling via ElevenLabs and Twilio. The bundle includes a well-structured Python CLI (voice_generator.py) that handles API interactions and local file management securely, avoiding shell injection vulnerabilities. While the SKILL.md instructions include high-privilege setup steps like automated browser navigation for API key retrieval and system-level package installation, these are transparently documented as necessary configuration paths for the stated functionality. No evidence of malicious intent, unauthorized data exfiltration, or hidden backdoors was found.

Capability Assessment

⚠ Purpose & Capability

The skill claims to be a voice/TTS/calling layer (ElevenLabs + Twilio + Telegram), which explains needing those credentials — but the registry metadata (top-level requirements) lists no required env vars or config paths while SKILL.md metadata and README explicitly require multiple secrets and many read/write paths. This mismatch (registry says 'none' but SKILL.md requires ELEVENLABS_*, TWILIO_*, TELEGRAM_*, and specific host/workspace paths) is an incoherence that should be resolved.

⚠ Instruction Scope

SKILL.md instructs the agent to install packages inside the OpenClaw container (pip install elevenlabs, twilio; apt-get ffmpeg), to read/write /workspace and host .env files, and offers an automated 'virtual-desktop' path that has the agent use an active Google browser session to create an API key. The instructions therefore go beyond simple TTS: they instruct host-level modifications and recommend automated browser actions that could access third‑party sessions — this expands the attack surface and scope of what the agent may touch.

ℹ Install Mechanism

There is no formal install spec in the registry (instruction-only) but SKILL.md expects the agent to run pip/apt commands inside the container. Installing third-party Python packages (elevenlabs, twilio) and apt packages via instructions is common but carries supply-chain risk; no external arbitrary download URLs are used. The use of pip with --break-system-packages is noted (less safe on some systems).

⚠ Credentials

The requested environment access is extensive but mostly aligned with the declared functionality (ElevenLabs API key/voice id for cloning/TTS, Twilio creds for calls, Telegram token/chat for notifications). However SKILL.md and README also instruct writing credentials into host-level .env (/docker/openclaw-yyvg/.env) and to read many workspace files; the registry incorrectly declared no env requirements. Writing/updating a host .env for the agent is a privilege that affects other services and is disproportionate unless the user intends that change.

⚠ Persistence & Privilege

The skill is not always:true, but instructions explicitly tell the agent (or operator) to write credentials into host docker .env and to restart the OpenClaw container, which modifies system-wide agent configuration. The SKILL.md also suggests using an automated 'virtual-desktop' to create keys from an active Google session — combining autonomous agent actions with access to browser sessions and host env files increases risk. The skill does write audit/error logs to workspace paths, further persisting state.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install voice-agent-pro-v1
After installation, invoke the skill by name or use /voice-agent-pro-v1
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.7

- Added required Twilio environment variables (TWILIO_ACCOUNT_SID, TWILIO_AUTH_TOKEN, TWILIO_PHONE_NUMBER) to enable full integration for voice call features. - Updated SKILL metadata to ensure all necessary environment variables are documented for both ElevenLabs and Twilio. - Expanded write permissions on /workspace/.learnings/ to allow broader logging or error capture. - No code changes; updates are limited to improved environment documentation.

v1.0.6

No file changes detected in this version. - No code or documentation changes; functionality remains unchanged. - Version or metadata may have been updated without modifications to skill files.

v1.0.5

No file changes detected in this version. - No new features, fixes, or updates in this release. - Skill version, structure, and requirements remain unchanged.

v1.0.4

No functional or documentation changes in this release. - No file changes detected for version 1.0.4. - Skill behavior and documentation remain unchanged.

v1.0.3

Version 1.0.3 - Added detailed setup instructions in a new file: setup_guide.md - Users now have a dedicated guide for configuring credentials and preparing required voice samples.

v1.0.2

No file changes detected; no updates in this release.

v1.0.1

**Version 1.0.1 — Major transition to autonomous voice configuration and deployment via ElevenLabs.** - Rebranded skill as "voice-agent-pro" and removed legacy files related to Vapi/Retell/Bland.ai. - Added autonomous self-configuration: agent now navigates ElevenLabs using the virtual-desktop skill to create API keys, clone voices, and store credentials without manual API setup. - Added detailed agent prompt, configuration file, and readme to support new workflow. - Integrated direct text-to-speech (MP3), voice cloning, and Twilio-based conversational call handling. - Implemented automated Telegram notifications for configuration steps, voice readiness, and call outcomes.

Metadata

Slug voice-agent-pro-v1

Version 1.0.7

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 7

Frequently Asked Questions

What is Voice Agent Pro?

Gives any OpenClaw agent a complete voice layer via ElevenLabs. Clones the principal's voice from audio samples, converts any text to MP3 audio (VSL, podcast... It is an AI Agent Skill for Claude Code / OpenClaw, with 220 downloads so far.

How do I install Voice Agent Pro?

Run "/install voice-agent-pro-v1" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Voice Agent Pro free?

Yes, Voice Agent Pro is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Voice Agent Pro support?

Voice Agent Pro is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Voice Agent Pro?

It is built and maintained by Wesley Armando (@georges91560); the current version is v1.0.7.

More Skills