← 返回 Skills 市场
patrickgeek

Bidirectional Voice Chat System

作者 patrickgeek · GitHub ↗ · v1.1.0
cross-platform ⚠ suspicious
426
总下载
0
收藏
1
当前安装
1
版本数
在 OpenClaw 中安装
/install voice-chat-bridge
功能描述
双向语音对话系统 - 语音识别转文字 + Edge TTS语音合成 + Cloudflare Tunnel公网访问
安全使用建议
What to check before installing or running this skill: - Missing scripts: SKILL.md references hotkey_recorder.py, voice_chat_loop.py, chat.py and other runtime components that are not included. Ask the author for the missing files or disable features that call them. Running instructions that reference absent scripts will fail or be misleading. - Public exposure risk: If you enable 'ngrok' / 'cloudflared' modes or set a real domain, generated voice files under ~/.openclaw/workspace/voice_output will become reachable from the Internet. The bundled HTTP server suppresses access logs, so traffic may not be visible locally. Only expose this if you understand who can access the URLs and you are comfortable with voice data being public. - Credentials and tokens: The skill metadata does not declare any credentials, but ngrok/cloudflared require auth tokens (and Cloudflare Tunnel may require zone credentials). Manage those tokens carefully; do not paste them into untrusted code. The skill does not automatically upload data to any remote service in the provided code, but Edge TTS (edge-tts CLI) likely uses an online service — check its privacy policy. - Telemetry / monitoring: daily_monitor.py writes local reports and runs a local test that invokes generate_voice.py. It does not appear to exfiltrate telemetry, but the code refers to ClawHub stats ('需手动从 ClawHub 获取') without automated upload. If you are uncomfortable with local reports under ~/.openclaw/workspace/memory, inspect or remove that script. - Run in a sandbox first: Execute the scripts in a controlled environment (VM/container) to confirm behavior. Inspect generated URLs and verify that public-tunnel steps are manual and require your explicit tokens/configuration before you go public. - Review edge-tts & third-party binaries: edge-tts and 'hear' are third-party programs; verify their source, CLI behavior (whether they send audio/text to external servers), and install them intentionally. The SKILL does recommend fetching hear from GitHub releases — confirm checksums/limits before placing binaries into ~/.local/bin. If you want, I can enumerate the specific missing script names found in the SKILL.md and produce a minimal checklist of commands to safely test the local-only mode (server bound to localhost, no tunnels) in a sandbox.
功能分析
Type: OpenClaw Skill Name: voice-chat-bridge Version: 1.1.0 The skill is classified as suspicious due to several risky capabilities and potential vulnerabilities, though without clear malicious intent. The `SKILL.md` contains instructions for the AI agent to modify its internal state (`connection` and `habits.json`), which is a form of prompt injection vulnerability. Additionally, the installation instructions in `SKILL.md` for the `hear` tool involve downloading and executing a binary from a remote URL (`curl -LO ... unzip ... cp`), posing a supply chain risk if the source were compromised. Finally, the skill leverages tools like Cloudflare Tunnel and Ngrok to expose local services to the public internet, a powerful capability that, while intended for benign purposes (serving voice files), introduces a significant attack surface if misused or if the exposed service were to become vulnerable.
能力评估
Purpose & Capability
Name/description, included scripts (transcribe, generate_voice, voice_server), and declared tools (ffmpeg, edge-tts, optional cloudflared/ngrok) are consistent with a bidirectional voice chat bridge that converts speech→text and text→speech and can serve files over HTTP.
Instruction Scope
SKILL.md refers to many runtime scripts and features (hotkey_recorder.py, voice_chat_loop.py, chat.py, chat-related behavior, habits.json updates, .voice_trigger file) that are not present in the package. It instructs users to open public tunnels (ngrok/cloudflared) and to serve voice files with a HTTP server that intentionally suppresses access logs — this combination raises privacy/exposure concerns because generated voice files could become publicly accessible without obvious logging. The instructions also instruct adding AGENTS.md behaviors (writing to habits.json and emotion updates) that are not implemented here.
Install Mechanism
No formal install spec (instruction-only with shipped scripts). The instructions recommend installing third-party binaries (ffmpeg, cloudflared, ngrok, hear) via brew/npm and downloading hear from a GitHub release — these are common but still involve executing fetched binaries. No archive downloads from obscure servers are present in the provided files.
Credentials
The skill requests no environment variables or credentials in metadata, which matches the included code. However, optional deployment modes (ngrok/cloudflared) require external tokens/credentials that are not listed or discussed in the skill metadata; daily_monitor mentions collecting 'installation data' but only writes local reports (no remote exfiltration in code).
Persistence & Privilege
The skill does not request always:true and does not modify other skills. It creates local state under ~/.openclaw/workspace and ~/.openclaw/workspace/memory. The HTTP server suppresses logging (QuietHTTPRequestHandler), which reduces visibility into external access when used with public tunnels — a design choice with privacy implications but not an explicit elevated privilege.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install voice-chat-bridge
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /voice-chat-bridge 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.1.0
Voice Chat Bridge Skill 1.1.0 adds robust, flexible two-way voice assistant capabilities. - Supports bi-directional voice interaction: speech-to-text transcription + Edge TTS natural speech synthesis. - Multiple deployment options: local playback, web interface, Cloudflare Tunnel, Ngrok, and LocalTunnel for global access. - Integrates with Telegram, Discord, Slack, webhook, and supports both command-line and GUI modes. - Customizable voice, hotkey recording, language selection, and local/online recognition engine support. - Quick setup guides and full conversational loop for hands-free, voice-first experiences.
元数据
Slug voice-chat-bridge
版本 1.1.0
许可证
累计安装 1
当前安装数 1
历史版本数 1
常见问题

Bidirectional Voice Chat System 是什么?

双向语音对话系统 - 语音识别转文字 + Edge TTS语音合成 + Cloudflare Tunnel公网访问. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 426 次。

如何安装 Bidirectional Voice Chat System?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install voice-chat-bridge」即可一键安装,无需额外配置。

Bidirectional Voice Chat System 是免费的吗?

是的,Bidirectional Voice Chat System 完全免费(开源免费),可自由下载、安装和使用。

Bidirectional Voice Chat System 支持哪些平台?

Bidirectional Voice Chat System 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Bidirectional Voice Chat System?

由 patrickgeek(@patrickgeek)开发并维护,当前版本 v1.1.0。

💬 留言讨论