← Back to Skills Marketplace
Voice Chat Skill
by
fangkelvin
· GitHub ↗
· v1.0.0
364
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install voice-chat-skill
Description
语音对话集成技能,支持双向语音交流。使用TTS和STT实现完整的语音对话功能。
Usage Guidance
This skill appears to implement the advertised voice chat features, but there are several inconsistencies you should address before installing or running it:
- Binaries and runtime: The skill metadata only requires Python, but the docs/examples show calling a Node-based OpenClaw TTS tool. If you plan to use OpenClaw TTS as shown, ensure Node and the referenced tts-tool script exist and come from a trusted source.
- Environment variables / API keys: The code references ELEVENLABS_API_KEY (and mentions OpenAI) but the skill doesn't declare or require any env vars. Do not provide API keys unless you trust the maintainer and have confirmed which keys are actually needed. Consider running the skill without keys to verify local-only behavior first.
- Missing files & hardcoded paths: README mentions install_deps.ps1 and launcher scripts that are not included, and a Windows-specific openclaw_path is hardcoded. Ask the publisher to clarify and remove or parameterize hardcoded paths.
- Network activity: Some modes check network connectivity (urllib to baidu) and the ElevenLabs example does outbound requests. If you need to keep audio data local for privacy, avoid enabling cloud STT/TTS or supplying cloud keys.
- Safe testing: Run the package in a controlled environment (sandbox or VM) first. Inspect or search for any unexpected subprocess invocations or external endpoints (the visible code shows only typical TTS/STT and benign subprocess/play commands, but the node path and example subprocess calls would execute external code if present).
If you want to proceed, request that the maintainer update the skill metadata to list all required binaries (node?), required env vars (ELEVENLABS_API_KEY, OPENAI_API_KEY if used), and provide the missing install/launcher scripts or remove references to them. That will make the skill's behavior transparent and easier to judge.
Capability Analysis
Type: OpenClaw Skill
Name: voice-chat-skill
Version: 1.0.0
The skill bundle implements a voice chat system using speech-to-text (STT) and text-to-speech (TTS) functionalities. All network calls (Google STT, ElevenLabs TTS) and file system operations (temporary files for TTS, audio playback) are directly related to the stated purpose. There is no evidence of data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts against the AI agent in any of the analyzed files. The use of `subprocess.run` for audio playback in `voice_chat.py` is done securely by passing arguments as a list, and the `shell=True` example in `SKILL.md` is contained with controlled inputs, not indicating malicious intent.
Capability Assessment
Purpose & Capability
The name/description (双向语音对话 using TTS/STT) matches the included Python code (speech_recognition, pyaudio, local TTS hooks). However the documentation and examples also reference Node-based OpenClaw TTS tooling and cloud APIs (ElevenLabs, OpenAI, Whisper) while the skill's declared requirements list only python — a partial mismatch. The code hardcodes an OpenClaw path pointing at a Windows npm global folder (C:\Users\41728\AppData\Roaming\npm\node_modules\openclaw), which is unusual but could be an environment-specific convenience rather than malicious intent.
Instruction Scope
SKILL.md and code direct the agent to access the microphone, list devices, create temporary files, and optionally call external services (Google STT, Whisper local model, ElevenLabs TTS). The docs include a subprocess example that invokes `node path/to/openclaw/tts-tool.js` and the ElevenLabs example reads ELEVENLABS_API_KEY from the environment — but the skill metadata declares no required env vars. The README also references install_deps.ps1 and voice_chat_launcher.ps1 that are not present in the package. These gaps expand runtime behavior beyond what's declared and could surprise a user (e.g., network calls, usage of Node, optional API keys).
Install Mechanism
There is no install spec (instruction-only install), so nothing is automatically downloaded or executed by the platform. That lowers install-time risk. The included source is plain Python (no obfuscated code).
Credentials
The skill declares no required environment variables, but SKILL.md/code reference environment keys (e.g., ELEVENLABS_API_KEY via os.environ.get) and mention OpenAI API use. It also shows an example calling a Node tts-tool (which implies Node must be present). These undeclared dependencies/credentials are disproportionate to the declared metadata and should be declared explicitly so a user can decide whether to provide them.
Persistence & Privilege
The skill does not request always:true and is user-invocable only. It does not attempt to modify other skills or system-wide agent settings. No elevated persistence or privilege escalation is requested in the package.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install voice-chat-skill - After installation, invoke the skill by name or use
/voice-chat-skill - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
voice-chat-skill v1.0.0
- Initial release of voice-chat skill, enabling bi-directional voice conversation.
- Supports speech-to-text (STT) using speech_recognition (Google, Whisper, etc.).
- Supports text-to-speech (TTS) through OpenClaw built-in tools and optional ElevenLabs API.
- Maintains conversation context and includes voice activity detection.
- Includes robust setup instructions, advanced configuration options, troubleshooting, and security guidelines.
Metadata
Frequently Asked Questions
What is Voice Chat Skill?
语音对话集成技能,支持双向语音交流。使用TTS和STT实现完整的语音对话功能。 It is an AI Agent Skill for Claude Code / OpenClaw, with 364 downloads so far.
How do I install Voice Chat Skill?
Run "/install voice-chat-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Voice Chat Skill free?
Yes, Voice Chat Skill is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Voice Chat Skill support?
Voice Chat Skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Voice Chat Skill?
It is built and maintained by fangkelvin (@fangkelvin); the current version is v1.0.0.
More Skills