← Back to Skills Marketplace

Walkie-Talkie Mode

Name: Walkie-Talkie Mode
Author: rubenfb23

by Rubén Fernández Boullón · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

2585

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install walkie-talkie

Description

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

Usage Guidance

This skill's behavior is plausible but inconsistent: SKILL.md requires local scripts/binaries (tools/transcribe_voice.sh, bin/sherpa-onnx-tts, ffmpeg, whisper-cpp) that are neither included nor declared. Before installing or enabling it, verify: 1) where those binaries/scripts will come from and that they are from trusted sources; 2) the exact content of tools/transcribe_voice.sh (so it doesn't run unexpected commands); 3) that you are comfortable the agent can execute local binaries on the host. If you can't audit or control the referenced scripts/binaries, consider not installing or asking the author for a clear dependency list and safe installation instructions. Providing the missing files or explicit dependency declarations (and ideally checksums or official sources) would reduce the concern and could change the assessment to benign.

Capability Analysis

Type: OpenClaw Skill Name: walkie-talkie Version: 1.0.0 The skill bundle describes a 'walkie-talkie' mode for WhatsApp, using local tools for voice transcription and text-to-speech. The `SKILL.md` file outlines a workflow that involves executing local scripts (`tools/transcribe_voice.sh`) and binaries (`bin/sherpa-onnx-tts`) and writing temporary files to `/tmp`. Crucially, the constraints explicitly state 'Use local tools only', which mitigates concerns about remote execution or data exfiltration. There is no evidence of prompt injection, malicious execution, data exfiltration, or other harmful behaviors in the provided content. The described actions are consistent with the stated purpose.

Capability Assessment

⚠ Purpose & Capability

The description (voice-to-voice on WhatsApp using local transcription and TTS) matches the actions described in SKILL.md. However, the skill metadata declares no required binaries or files while the instructions explicitly reference tools/transcribe_voice.sh, bin/sherpa-onnx-tts, ffmpeg, whisper-cpp, and sherpa-onnx-tts. Those are necessary for the stated purpose but are neither included nor declared, which is an inconsistency.

⚠ Instruction Scope

Instructions tell the agent to execute local scripts/binaries and to send .ogg files via the message tool. They do not ask for extra env vars or unrelated files, but they require executing code at host paths (tools/transcribe_voice.sh, bin/sherpa-onnx-tts). Because those files are not provided, the skill will rely on whatever binaries exist on the host—this gives the agent power to run arbitrary local code if those paths are populated.

✓ Install Mechanism

This is instruction-only (no install spec or code). That reduces the risk of the skill dropping arbitrary code during installation. However, the runtime depends on externally installed local binaries which the user must provide.

✓ Credentials

The skill requests no environment variables or credentials, which is proportionate to its described local-only operation. There is no unexplained request for unrelated secrets. Be aware that sending messages via the agent's messaging integration still requires whatever platform credentials the agent normally uses, but those are not requested by this skill.

✓ Persistence & Privilege

The skill is not always-enabled and does not request elevated platform privileges or attempt to modify other skills. Autonomous invocation is allowed (platform default), which is normal and not by itself a red flag.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install walkie-talkie
After installation, invoke the skill by name or use /walkie-talkie
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Walkie-Talkie skill initial release: - Enables automatic voice-to-voice conversations on WhatsApp via local transcription and text-to-speech. - Transcribes incoming audio messages and processes them as prompts. - Responds with both text and synthetic voice audio using local TTS. - Activates when receiving audio messages or upon user request ("activa modo walkie-talkie", "hablemos por voz"). - Uses only local tools for processing and aims to ensure rapid response times.

Metadata

Slug walkie-talkie

Version 1.0.0

License —

All-time Installs 8

Active Installs 6

Total Versions 1

Frequently Asked Questions

What is Walkie-Talkie Mode?

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type. It is an AI Agent Skill for Claude Code / OpenClaw, with 2585 downloads so far.

How do I install Walkie-Talkie Mode?

Run "/install walkie-talkie" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Walkie-Talkie Mode free?

Yes, Walkie-Talkie Mode is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Walkie-Talkie Mode support?

Walkie-Talkie Mode is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Walkie-Talkie Mode?

It is built and maintained by Rubén Fernández Boullón (@rubenfb23); the current version is v1.0.0.

More Skills