← 返回 Skills 市场

iMessage Voice Reply

Name: iMessage Voice Reply
Author: bolander72

作者 Michael Boland · GitHub ↗ · v1.0.3

cross-platform ✓ 安全检测通过

621

总下载

当前安装

版本数

在 OpenClaw 中安装

/install imessage-voice-reply

功能描述

Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not f...

使用说明 (SKILL.md)

iMessage Voice Reply

Generate and send native iMessage voice messages using local Kokoro TTS. Voice messages appear as inline playable bubbles with waveforms — identical to voice messages recorded in Messages.app.

How It Works

Your text response → Kokoro TTS (local) → afconvert (native Apple encoder) → CAF/Opus → BlueBubbles → iMessage voice bubble

Setup

bash ${baseDir}/scripts/setup.sh

Installs: kokoro-onnx, soundfile, numpy. Downloads Kokoro models (~136MB) to ~/.cache/kokoro-onnx/.

Requires: BlueBubbles channel configured in OpenClaw (channels.bluebubbles).

Generating and Sending a Voice Reply

Step 1: Generate audio

Write the response text to a temp file, then pass it via --text-file to avoid shell injection:

echo "Your response text here" > /tmp/voice_text.txt
${baseDir}/.venv/bin/python ${baseDir}/scripts/generate_voice_reply.py --text-file /tmp/voice_text.txt --output /tmp/voice_reply.caf

Alternatively, pass text directly (ensure proper shell escaping):

${baseDir}/.venv/bin/python ${baseDir}/scripts/generate_voice_reply.py --text "Your response text here" --output /tmp/voice_reply.caf

Options:

--voice af_heart — Kokoro voice (default: af_heart)
--speed 1.15 — Playback speed (default: 1.15)
--lang en-us — Language code (default: en-us)

Security note: The Python script uses argparse and subprocess.run with list arguments (no shell=True). Input is handled safely within the script. When calling from a shell, prefer --text-file for untrusted input to avoid shell metacharacter issues.

Step 2: Send via BlueBubbles

Use the message tool:

{
  "action": "sendAttachment",
  "channel": "bluebubbles",
  "target": "+1XXXXXXXXXX",
  "path": "/tmp/voice_reply.caf",
  "filename": "Audio Message.caf",
  "contentType": "audio/x-caf",
  "asVoice": true
}

Critical parameters for native voice bubble:

filename must be "Audio Message.caf"
contentType must be "audio/x-caf"
asVoice must be true

All three are required for iMessage to render the message as an inline voice bubble with waveform instead of a file attachment.

Voice Options

Language	Female	Male
English	af_heart ⭐	am_puck
Spanish	ef_dora	em_alex
French	ff_siwis	—
Japanese	jf_alpha	jm_beta
Chinese	zf_xiaobei	zm_yunjian

When to Reply with Voice

Reply with a voice message when:

The user sent you a voice message (voice-for-voice)
The user explicitly asks for an audio/voice response

Always include a text reply alongside the voice message for accessibility.

Audio Format

macOS: CAF container, Opus codec, 48kHz mono, 32kbps — encoded by Apple's native afconvert. Identical to what Messages.app produces.
Fallback: MP3 via ffmpeg (works but may not render as native voice bubble on all iMessage versions).

Cost

$0. Kokoro TTS runs entirely locally. No API calls for voice generation.

Troubleshooting

Voice message shows as file attachment — Ensure all three parameters are set: filename="Audio Message.caf", contentType="audio/x-caf", asVoice=true.

First word clipped — The script prepends 150ms silence automatically. If still clipped, increase the silence pad in the script.

Kokoro model not found — Run bash ${baseDir}/scripts/setup.sh.

afconvert not found — Only available on macOS. Script falls back to ffmpeg/MP3 on Linux.

安全使用建议

This skill appears coherent and matches its description, but review these practical points before installing: 1) setup.sh will pip install kokoro-onnx (and deps) from PyPI and will cause Kokoro to download ~136MB of models into ~/.cache/kokoro-onnx — expect network activity and disk usage. 2) Confirm you trust the kokoro-onnx PyPI package and its model source (audit upstream if needed). 3) The script uses tempfile.mktemp (potential race-condition/insecure temp path pattern) — not an immediate red flag but worth noting if you run in multi-user environments. 4) On non-macOS the setup requires ffmpeg; on macOS it uses afconvert. 5) The skill does not request credentials, but it uses your configured BlueBubbles channel to send attachments — ensure BlueBubbles is configured and trusted. If you want extra safety, run setup in an isolated environment (container or VM) and inspect/verify the kokoro-onnx package and downloaded model files before use.

功能分析

Type: OpenClaw Skill Name: imessage-voice-reply Version: 1.0.3 The OpenClaw AgentSkills skill bundle for 'imessage-voice-reply' is classified as benign. The code and documentation are transparent, well-structured, and align with the stated purpose of generating local TTS voice messages for iMessage. Key security indicators include the use of virtual environments for dependencies, safe `subprocess.run` calls with list arguments in `scripts/generate_voice_reply.py` to prevent shell injection, and explicit advice in `SKILL.md` to prefer `--text-file` for untrusted input. Model downloads are performed to a standard user cache directory (`~/.cache/kokoro-onnx/`), and no evidence of data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts against the agent was found.

能力评估

✓ Purpose & Capability

Name/description (iMessage voice replies) align with included scripts and instructions: a local Kokoro TTS pipeline, audio encoding (afconvert/ffmpeg), and use of a BlueBubbles channel to send the resulting CAF/Opus payload. No unrelated services, credentials, or binaries are requested.

ℹ Instruction Scope

SKILL.md stays on-task: it instructs setup (venv and pip install), model download to ~/.cache/kokoro-onnx, generating audio with the Python script, and sending via BlueBubbles. It does require network access during pip install and model download; the instructions do not attempt to read unrelated system configuration or secrets. The doc correctly warns about shell escaping and recommends --text-file for untrusted input.

ℹ Install Mechanism

There is no packaged install spec, but setup.sh runs python3 -m venv and pip installs kokoro-onnx, soundfile, and numpy from PyPI. This is a standard mechanism but does involve pulling code and (indirectly) models from the network; kokoro-onnx appears to fetch models when instantiated. No arbitrary URL downloads or archive extraction from unknown hosts are present in the repo files.

✓ Credentials

The skill requests no environment variables or credentials. It expects an existing BlueBubbles channel configured in OpenClaw to send the message (that is reasonable and proportional). The scripts write to the user's home cache (~/.cache/kokoro-onnx) and /tmp, which is expected for local model storage and temp audio files.

✓ Persistence & Privilege

The skill does not request always:true, does not modify other skills, and only installs a venv under the skill directory and model files under the user's cache. It runs only when invoked by the user/agent (default behavior).

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install imessage-voice-reply
安装完成后，直接呼叫该 Skill 的名称或使用 /imessage-voice-reply 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.3

Fix display name

v1.0.2

imessage-voice-reply 1.0.2 - No file or documentation changes detected in this release. - No new features, bug fixes, or updates introduced.

v1.0.1

- Added support for reading input text from a file using the `--text-file` argument to improve security and avoid shell injection. - Updated documentation to prefer `--text-file` for untrusted input and clarified direct `--text` usage requires careful shell escaping. - Noted that the Python script now safely handles input with argparse and subprocess (no shell=True). - Minor documentation updates to reflect new usage and strengthen security guidance.

v1.0.0

Initial release of iMessage Voice Reply: - Send native iMessage voice messages generated via local Kokoro-ONNX TTS. - Produces playable voice bubbles with waveform (CAF/Opus format) instead of file attachments. - Requires BlueBubbles channel in OpenClaw for sending messages. - All TTS processing runs locally for zero ongoing cost. - Supports multiple voices and languages; customizable voice, speed, and language options. - Ensures standard iMessage voice message appearance and behavior for recipients.

元数据

Slug imessage-voice-reply

版本 1.0.3

许可证 —

累计安装 1

当前安装数 1

历史版本数 4

常见问题