← 返回 Skills 市场

Vocal Chat

Name: Vocal Chat
Author: rubenfb23

作者 Rubén Fernández Boullón · GitHub ↗ · v1.0.0

cross-platform ⚠ suspicious

3639

总下载

当前安装

版本数

在 OpenClaw 中安装

/install vocal-chat

功能描述

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type.

安全使用建议

Before installing or enabling this skill, verify the following: (1) Confirm which binaries and scripts it requires (ffmpeg, whisper-cpp, sherpa-onnx-tts, tools/transcribe_voice.sh, bin/sherpa-onnx-tts) and install them from trusted sources — the manifest currently lists none. (2) Ensure your agent actually has a 'message' tool and WhatsApp integration set up and understand what credentials or API access that requires; the skill does not declare any credentials. (3) Ask the publisher to update the manifest to list required binaries, install instructions, and any needed credentials. (4) Consider running the skill in a sandbox or test account first — audio processing can involve sensitive content, and the skill assumes local filesystem access which could fail or be abused. (5) Note the performance constraint (RTF < 0.5) may be unrealistic for local models and could lead to degraded behavior; confirm resource needs. If the publisher cannot clarify these gaps, treat the skill as untrusted.

功能分析

Type: OpenClaw Skill Name: vocal-chat Version: 1.0.0 The skill bundle describes a 'walkie-talkie' mode for an AI agent, enabling voice-to-voice conversations using local transcription and text-to-speech. The `SKILL.md` clearly outlines the workflow, triggers, and constraints, explicitly stating the use of 'local tools only' (ffmpeg, whisper-cpp, sherpa-onnx-tts). There is no evidence of intentional harmful behavior such as data exfiltration, malicious execution, persistence, or prompt injection attempts against the agent from the skill's instructions themselves. The instructions are aligned with the stated purpose and do not exhibit high-risk behaviors.

能力评估

⚠ Purpose & Capability

The description (voice-to-voice on WhatsApp) is plausible, but the manifest declares no required binaries, no install steps, and no WhatsApp integration credentials or endpoints. The SKILL.md explicitly requires local tools (ffmpeg, whisper-cpp, sherpa-onnx-tts) and scripts (tools/transcribe_voice.sh, bin/sherpa-onnx-tts) which are not declared in the registry metadata. That mismatch is disproportionate to the claimed purpose and means the skill may fail or assume access it hasn't requested.

⚠ Instruction Scope

The instructions tell the agent to run local scripts and binaries and to send audio via a `message` tool, but they do not explain how incoming audio is surfaced to the agent, where the scripts come from, or what the `message` tool's required parameters/permissions are. The SKILL.md restricts use to 'local tools only' (no cloud) and asks the agent to always return both text and audio — no steps ask to read unrelated files or environment variables, but the instructions assume filesystem and binary access that aren't guaranteed.

ℹ Install Mechanism

There is no install spec (instruction-only), which lowers install risk. However, the skill depends on external binaries and scripts that would need to be present on the host. The lack of an install mechanism or references to known release sources means the agent or operator must manually install/verify those dependencies; that operational gap is noteworthy but not inherently malicious.

ℹ Credentials

The skill declares no environment variables or credentials, which is consistent with its claim to use local-only tools. However, because it targets WhatsApp conversations, the absence of any declared messaging/WhatsApp credential or integration details is suspicious — the skill assumes the agent has access to a messaging tool capable of sending files but doesn't declare what access is required.

✓ Persistence & Privilege

The skill does not request always:true and uses default invocation settings. It does not attempt to modify system-wide settings in the provided instructions. No persistence or elevated platform privileges are requested in the manifest.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install vocal-chat
安装完成后，直接呼叫该 Skill 的名称或使用 /vocal-chat 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

- Updated skill.

元数据

Slug vocal-chat

版本 1.0.0

许可证 —

累计安装 24

当前安装数 21

历史版本数 1

常见问题

Vocal Chat 是什么？

Handles voice-to-voice conversations on WhatsApp. Automatically transcribes incoming audio and responds with local TTS audio. Use when the user wants to "talk" instead of type. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 3639 次。

如何安装 Vocal Chat？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install vocal-chat」即可一键安装，无需额外配置。

Vocal Chat 是免费的吗？

是的，Vocal Chat 完全免费（开源免费），可自由下载、安装和使用。

Vocal Chat 支持哪些平台？

Vocal Chat 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Vocal Chat？

由 Rubén Fernández Boullón（@rubenfb23）开发并维护，当前版本 v1.0.0。