← 返回 Skills 市场
neldar

WebChat Voice GUI

作者 neldar · GitHub ↗ · v0.1.1 · MIT-0
cross-platform ⚠ suspicious
547
总下载
0
收藏
5
当前安装
2
版本数
在 OpenClaw 中安装
/install webchat-voice-gui
功能描述
Voice input and microphone button for OpenClaw WebChat Control UI. Adds a mic button to chat, records audio via browser MediaRecorder, transcribes locally vi...
安全使用建议
This skill appears to do what it says: add a local mic button and send audio to a local faster-whisper endpoint. Before installing: 1) Back up your openclaw/dist/control-ui/index.html (and assets) so you can revert easily. 2) Confirm the registry metadata bug (the UI showed "Required env vars: [object Object]") and verify there are no hidden env/credential requirements. 3) Ensure you trust the local faster-whisper service and webchat-https-proxy it depends on (they receive audio data). 4) Review the included scripts (deploy.sh, inject.sh, handler.ts, uninstall.sh) — they are readable, idempotent, and user-level but will modify the global Control UI and install a hook under ~/.openclaw/hooks. 5) To remove the changes, run scripts/uninstall.sh (it attempts to remove the hook and injected files). If you are uncomfortable with persistent modifications to the Control UI or you cannot easily restore the original index.html, do not install until you have a backup or can test in an isolated environment.
功能分析
Type: OpenClaw Skill Name: webchat-voice-gui Version: 0.1.1 The webchat-voice-gui skill provides a local voice-to-text interface for the OpenClaw WebChat UI. It functions by injecting a client-side script (voice-input.js) into the application's index.html and establishing a gateway startup hook (handler.ts) to ensure the modification persists after updates. The code follows security best practices, such as using same-origin requests for transcription, avoiding eval(), and using safe execution methods (execFileSync) in its hooks. All audio processing is directed to a local faster-whisper service (127.0.0.1:18790), and there is no evidence of data exfiltration or unauthorized remote access.
能力评估
Purpose & Capability
The skill's declared dependencies (webchat-https-proxy and a local faster-whisper STT service) match its description of providing local speech-to-text and using a same-origin /transcribe endpoint. It modifies the Control UI (index.html and assets) and installs a gateway hook so the injection survives updates — this is coherent for a UI-injection skill. One inconsistency: the registry metadata shows "Required env vars: [object Object]" (likely a serialization bug) while the SKILL.md only requires an optional VOICE_LANG; confirm the metadata before installing.
Instruction Scope
SKILL.md and included scripts limit actions to copying JS/i18n files, injecting a <script> tag into the Control UI index.html, and installing a user-level startup hook that reruns the injector. The client JS uses MediaRecorder and posts audio to /transcribe (or localhost:18790 for dev). There are no instructions to read unrelated system files or to send data off-host. The skill does perform DOM operations to insert text/send messages in the chat UI, which is expected for this feature.
Install Mechanism
There is no remote download/install step — deployment is handled by local scripts that copy bundled assets into ~/.openclaw/workspace and the global openclaw control-ui directory (found via npm -g root). That is lower risk than fetching remote code, but the scripts will write into the global Control UI (npm-global) and user hooks (~/.openclaw/hooks). This is expected for a UI patch but is persistent and affects files outside the skill's own directory.
Credentials
No secrets/credentials are requested. The only declared environment-related inputs are optional VOICE_LANG and the optional OPENCLAW_UI_DIR/WORKSPACE overrides used to locate Control UI and workspace paths. The skill checks systemctl state of local services but does not require tokens or keys. The malformed registry metadata for env vars should be clarified.
Persistence & Privilege
The skill installs a gateway startup hook under ~/.openclaw/hooks/voice-input-inject to re-run the injector on gateway startup, and it edits the Control UI index.html and copies an asset into the global openclaw install. This grants persistent, user-level presence that survives updates (intended for this plugin). It does not request always:true or root/sudo. Because it modifies the main Control UI, you should be aware the change affects the broader UI and persists until uninstalled.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install webchat-voice-gui
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /webchat-voice-gui 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.1
MIT license added; remove browser token scraping; restore clean control-ui compatibility.
v0.1.0
Split from webchat-voice-proxy: standalone voice GUI skill with hardened deploy scripts, improved docs, accessibility and i18n sync
元数据
Slug webchat-voice-gui
版本 0.1.1
许可证 MIT-0
累计安装 5
当前安装数 5
历史版本数 2
常见问题

WebChat Voice GUI 是什么?

Voice input and microphone button for OpenClaw WebChat Control UI. Adds a mic button to chat, records audio via browser MediaRecorder, transcribes locally vi... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 547 次。

如何安装 WebChat Voice GUI?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install webchat-voice-gui」即可一键安装,无需额外配置。

WebChat Voice GUI 是免费的吗?

是的,WebChat Voice GUI 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

WebChat Voice GUI 支持哪些平台?

WebChat Voice GUI 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 WebChat Voice GUI?

由 neldar(@neldar)开发并维护,当前版本 v0.1.1。

💬 留言讨论