← 返回 Skills 市场
Voice Chat Skill
作者
fangkelvin
· GitHub ↗
· v1.0.0
364
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install voice-chat-skill
功能描述
语音对话集成技能,支持双向语音交流。使用TTS和STT实现完整的语音对话功能。
安全使用建议
This skill appears to implement the advertised voice chat features, but there are several inconsistencies you should address before installing or running it:
- Binaries and runtime: The skill metadata only requires Python, but the docs/examples show calling a Node-based OpenClaw TTS tool. If you plan to use OpenClaw TTS as shown, ensure Node and the referenced tts-tool script exist and come from a trusted source.
- Environment variables / API keys: The code references ELEVENLABS_API_KEY (and mentions OpenAI) but the skill doesn't declare or require any env vars. Do not provide API keys unless you trust the maintainer and have confirmed which keys are actually needed. Consider running the skill without keys to verify local-only behavior first.
- Missing files & hardcoded paths: README mentions install_deps.ps1 and launcher scripts that are not included, and a Windows-specific openclaw_path is hardcoded. Ask the publisher to clarify and remove or parameterize hardcoded paths.
- Network activity: Some modes check network connectivity (urllib to baidu) and the ElevenLabs example does outbound requests. If you need to keep audio data local for privacy, avoid enabling cloud STT/TTS or supplying cloud keys.
- Safe testing: Run the package in a controlled environment (sandbox or VM) first. Inspect or search for any unexpected subprocess invocations or external endpoints (the visible code shows only typical TTS/STT and benign subprocess/play commands, but the node path and example subprocess calls would execute external code if present).
If you want to proceed, request that the maintainer update the skill metadata to list all required binaries (node?), required env vars (ELEVENLABS_API_KEY, OPENAI_API_KEY if used), and provide the missing install/launcher scripts or remove references to them. That will make the skill's behavior transparent and easier to judge.
功能分析
Type: OpenClaw Skill
Name: voice-chat-skill
Version: 1.0.0
The skill bundle implements a voice chat system using speech-to-text (STT) and text-to-speech (TTS) functionalities. All network calls (Google STT, ElevenLabs TTS) and file system operations (temporary files for TTS, audio playback) are directly related to the stated purpose. There is no evidence of data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts against the AI agent in any of the analyzed files. The use of `subprocess.run` for audio playback in `voice_chat.py` is done securely by passing arguments as a list, and the `shell=True` example in `SKILL.md` is contained with controlled inputs, not indicating malicious intent.
能力评估
Purpose & Capability
The name/description (双向语音对话 using TTS/STT) matches the included Python code (speech_recognition, pyaudio, local TTS hooks). However the documentation and examples also reference Node-based OpenClaw TTS tooling and cloud APIs (ElevenLabs, OpenAI, Whisper) while the skill's declared requirements list only python — a partial mismatch. The code hardcodes an OpenClaw path pointing at a Windows npm global folder (C:\Users\41728\AppData\Roaming\npm\node_modules\openclaw), which is unusual but could be an environment-specific convenience rather than malicious intent.
Instruction Scope
SKILL.md and code direct the agent to access the microphone, list devices, create temporary files, and optionally call external services (Google STT, Whisper local model, ElevenLabs TTS). The docs include a subprocess example that invokes `node path/to/openclaw/tts-tool.js` and the ElevenLabs example reads ELEVENLABS_API_KEY from the environment — but the skill metadata declares no required env vars. The README also references install_deps.ps1 and voice_chat_launcher.ps1 that are not present in the package. These gaps expand runtime behavior beyond what's declared and could surprise a user (e.g., network calls, usage of Node, optional API keys).
Install Mechanism
There is no install spec (instruction-only install), so nothing is automatically downloaded or executed by the platform. That lowers install-time risk. The included source is plain Python (no obfuscated code).
Credentials
The skill declares no required environment variables, but SKILL.md/code reference environment keys (e.g., ELEVENLABS_API_KEY via os.environ.get) and mention OpenAI API use. It also shows an example calling a Node tts-tool (which implies Node must be present). These undeclared dependencies/credentials are disproportionate to the declared metadata and should be declared explicitly so a user can decide whether to provide them.
Persistence & Privilege
The skill does not request always:true and is user-invocable only. It does not attempt to modify other skills or system-wide agent settings. No elevated persistence or privilege escalation is requested in the package.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install voice-chat-skill - 安装完成后,直接呼叫该 Skill 的名称或使用
/voice-chat-skill触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
voice-chat-skill v1.0.0
- Initial release of voice-chat skill, enabling bi-directional voice conversation.
- Supports speech-to-text (STT) using speech_recognition (Google, Whisper, etc.).
- Supports text-to-speech (TTS) through OpenClaw built-in tools and optional ElevenLabs API.
- Maintains conversation context and includes voice activity detection.
- Includes robust setup instructions, advanced configuration options, troubleshooting, and security guidelines.
元数据
常见问题
Voice Chat Skill 是什么?
语音对话集成技能,支持双向语音交流。使用TTS和STT实现完整的语音对话功能。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 364 次。
如何安装 Voice Chat Skill?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install voice-chat-skill」即可一键安装,无需额外配置。
Voice Chat Skill 是免费的吗?
是的,Voice Chat Skill 完全免费(开源免费),可自由下载、安装和使用。
Voice Chat Skill 支持哪些平台?
Voice Chat Skill 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Voice Chat Skill?
由 fangkelvin(@fangkelvin)开发并维护,当前版本 v1.0.0。
推荐 Skills