← 返回 Skills 市场
Walkie-Talkie Mode
作者
Rubén Fernández Boullón
· GitHub ↗
· v1.0.0
1747
总下载
1
收藏
2
当前安装
1
版本数
在 OpenClaw 中安装
/install walkie-talkie-mode
功能描述
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.
安全使用建议
This skill's behavior (transcribe incoming audio, produce local TTS, send .ogg back) matches its description, but the SKILL.md depends on local tools and scripts that are not declared anywhere. Before installing or enabling: 1) Verify the agent environment actually has the required binaries (ffmpeg, whisper-cpp, sherpa-onnx-tts) and the helper script paths (tools/transcribe_voice.sh, bin/sherpa-onnx-tts). 2) Ask the author to update metadata to list required binaries, exact paths, and any model files or hardware needs. 3) Confirm the 'message' tool used to send files is the authorized platform tool (so audio is sent only to the intended chat) and that no unexpected external endpoints are contacted. 4) Review file permissions around /tmp and any model data to avoid exposing unrelated data. 5) Test in a sandboxed agent first — if the required local tools are missing, the skill will fail or may attempt to run arbitrary local programs if created later. If you cannot verify or supply the declared dependencies, treat this skill as untrusted.
功能分析
Type: OpenClaw Skill
Name: walkie-talkie-mode
Version: 1.0.0
The skill bundle describes a 'walkie-talkie' mode for voice-to-voice conversations, primarily using local tools for transcription and text-to-speech. The `SKILL.md` file clearly outlines the workflow, triggers, and constraints, instructing the agent to execute local scripts (`tools/transcribe_voice.sh`) and binaries (`bin/sherpa-onnx-tts`) for its stated purpose. There is no evidence of prompt injection attempting to subvert the agent, exfiltrate data, establish persistence, or perform other malicious actions. The instructions are straightforward and align with the benign functionality described.
能力评估
Purpose & Capability
The name/description (voice-to-voice WhatsApp) matches the SKILL.md workflow, but the skill metadata declares no required binaries, env vars, or installs while the instructions explicitly require local tooling (ffmpeg, whisper-cpp, sherpa-onnx-tts), a helper script (tools/transcribe_voice.sh), and a local TTS binary (bin/sherpa-onnx-tts). That inconsistency means the skill either omits necessary requirements or assumes access to arbitrary local executables.
Instruction Scope
Runtime instructions tell the agent to run local scripts/binaries and read/write files (e.g., /tmp/reply.ogg) and to use a 'message' tool to send files. These actions are coherent with the stated purpose, but they reference specific local paths and tools not declared in metadata. This grants the skill broad discretion to execute unspecified local programs and rely on local model artifacts.
Install Mechanism
There is no install spec (lowest install risk), which is fine for an instruction-only skill — but here it's problematic because the skill expects several local binaries and scripts. Because nothing will be installed by the skill, the operator must supply these dependencies; the missing install/dependency declarations are an integrity/usability risk.
Credentials
The skill requests no environment variables or credentials (appropriate). However, it implicitly requires access to local filesystem paths and local model binaries; the SKILL.md does not request or document any permissions or configuration for those resources.
Persistence & Privilege
The skill does not request always:true and does not declare persistent/system-wide changes. It appears to be user-invocable only and does not request elevated persistent privileges.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install walkie-talkie-mode - 安装完成后,直接呼叫该 Skill 的名称或使用
/walkie-talkie-mode触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of walkie-talkie-mode: enables seamless voice-to-voice conversations on WhatsApp.
- Automatically transcribes incoming WhatsApp audio messages using local tools.
- Generates voice note replies using local TTS and replies with both audio and text.
- Activates when users send audios or command with phrases like "activa modo walkie-talkie".
- Prioritizes fast, fully offline processing for privacy and speed.
- Includes manual execution instructions for internal use.
元数据
常见问题
Walkie-Talkie Mode 是什么?
Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1747 次。
如何安装 Walkie-Talkie Mode?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install walkie-talkie-mode」即可一键安装,无需额外配置。
Walkie-Talkie Mode 是免费的吗?
是的,Walkie-Talkie Mode 完全免费(开源免费),可自由下载、安装和使用。
Walkie-Talkie Mode 支持哪些平台?
Walkie-Talkie Mode 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Walkie-Talkie Mode?
由 Rubén Fernández Boullón(@rubenfb23)开发并维护,当前版本 v1.0.0。
推荐 Skills