← Back to Skills Marketplace

iMessage Voice Reply

Name: iMessage Voice Reply
Author: bolander72

by Michael Boland · GitHub ↗ · v1.0.3

cross-platform ✓ Security Clean

621

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install imessage-voice-reply

Description

Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not f...

README (SKILL.md)

iMessage Voice Reply

Generate and send native iMessage voice messages using local Kokoro TTS. Voice messages appear as inline playable bubbles with waveforms — identical to voice messages recorded in Messages.app.

How It Works

Your text response → Kokoro TTS (local) → afconvert (native Apple encoder) → CAF/Opus → BlueBubbles → iMessage voice bubble

Setup

bash ${baseDir}/scripts/setup.sh

Installs: kokoro-onnx, soundfile, numpy. Downloads Kokoro models (~136MB) to ~/.cache/kokoro-onnx/.

Requires: BlueBubbles channel configured in OpenClaw (channels.bluebubbles).

Generating and Sending a Voice Reply

Step 1: Generate audio

Write the response text to a temp file, then pass it via --text-file to avoid shell injection:

echo "Your response text here" > /tmp/voice_text.txt
${baseDir}/.venv/bin/python ${baseDir}/scripts/generate_voice_reply.py --text-file /tmp/voice_text.txt --output /tmp/voice_reply.caf

Alternatively, pass text directly (ensure proper shell escaping):

${baseDir}/.venv/bin/python ${baseDir}/scripts/generate_voice_reply.py --text "Your response text here" --output /tmp/voice_reply.caf

Options:

--voice af_heart — Kokoro voice (default: af_heart)
--speed 1.15 — Playback speed (default: 1.15)
--lang en-us — Language code (default: en-us)

Security note: The Python script uses argparse and subprocess.run with list arguments (no shell=True). Input is handled safely within the script. When calling from a shell, prefer --text-file for untrusted input to avoid shell metacharacter issues.

Step 2: Send via BlueBubbles

Use the message tool:

{
  "action": "sendAttachment",
  "channel": "bluebubbles",
  "target": "+1XXXXXXXXXX",
  "path": "/tmp/voice_reply.caf",
  "filename": "Audio Message.caf",
  "contentType": "audio/x-caf",
  "asVoice": true
}

Critical parameters for native voice bubble:

filename must be "Audio Message.caf"
contentType must be "audio/x-caf"
asVoice must be true

All three are required for iMessage to render the message as an inline voice bubble with waveform instead of a file attachment.

Voice Options

Language	Female	Male
English	af_heart ⭐	am_puck
Spanish	ef_dora	em_alex
French	ff_siwis	—
Japanese	jf_alpha	jm_beta
Chinese	zf_xiaobei	zm_yunjian

When to Reply with Voice

Reply with a voice message when:

The user sent you a voice message (voice-for-voice)
The user explicitly asks for an audio/voice response

Always include a text reply alongside the voice message for accessibility.

Audio Format

macOS: CAF container, Opus codec, 48kHz mono, 32kbps — encoded by Apple's native afconvert. Identical to what Messages.app produces.
Fallback: MP3 via ffmpeg (works but may not render as native voice bubble on all iMessage versions).

Cost

$0. Kokoro TTS runs entirely locally. No API calls for voice generation.

Troubleshooting

Voice message shows as file attachment — Ensure all three parameters are set: filename="Audio Message.caf", contentType="audio/x-caf", asVoice=true.

First word clipped — The script prepends 150ms silence automatically. If still clipped, increase the silence pad in the script.

Kokoro model not found — Run bash ${baseDir}/scripts/setup.sh.

afconvert not found — Only available on macOS. Script falls back to ffmpeg/MP3 on Linux.

Usage Guidance

This skill appears coherent and matches its description, but review these practical points before installing: 1) setup.sh will pip install kokoro-onnx (and deps) from PyPI and will cause Kokoro to download ~136MB of models into ~/.cache/kokoro-onnx — expect network activity and disk usage. 2) Confirm you trust the kokoro-onnx PyPI package and its model source (audit upstream if needed). 3) The script uses tempfile.mktemp (potential race-condition/insecure temp path pattern) — not an immediate red flag but worth noting if you run in multi-user environments. 4) On non-macOS the setup requires ffmpeg; on macOS it uses afconvert. 5) The skill does not request credentials, but it uses your configured BlueBubbles channel to send attachments — ensure BlueBubbles is configured and trusted. If you want extra safety, run setup in an isolated environment (container or VM) and inspect/verify the kokoro-onnx package and downloaded model files before use.

Capability Analysis

Type: OpenClaw Skill Name: imessage-voice-reply Version: 1.0.3 The OpenClaw AgentSkills skill bundle for 'imessage-voice-reply' is classified as benign. The code and documentation are transparent, well-structured, and align with the stated purpose of generating local TTS voice messages for iMessage. Key security indicators include the use of virtual environments for dependencies, safe `subprocess.run` calls with list arguments in `scripts/generate_voice_reply.py` to prevent shell injection, and explicit advice in `SKILL.md` to prefer `--text-file` for untrusted input. Model downloads are performed to a standard user cache directory (`~/.cache/kokoro-onnx/`), and no evidence of data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts against the agent was found.

Capability Assessment

✓ Purpose & Capability

Name/description (iMessage voice replies) align with included scripts and instructions: a local Kokoro TTS pipeline, audio encoding (afconvert/ffmpeg), and use of a BlueBubbles channel to send the resulting CAF/Opus payload. No unrelated services, credentials, or binaries are requested.

ℹ Instruction Scope

SKILL.md stays on-task: it instructs setup (venv and pip install), model download to ~/.cache/kokoro-onnx, generating audio with the Python script, and sending via BlueBubbles. It does require network access during pip install and model download; the instructions do not attempt to read unrelated system configuration or secrets. The doc correctly warns about shell escaping and recommends --text-file for untrusted input.

ℹ Install Mechanism

There is no packaged install spec, but setup.sh runs python3 -m venv and pip installs kokoro-onnx, soundfile, and numpy from PyPI. This is a standard mechanism but does involve pulling code and (indirectly) models from the network; kokoro-onnx appears to fetch models when instantiated. No arbitrary URL downloads or archive extraction from unknown hosts are present in the repo files.

✓ Credentials

The skill requests no environment variables or credentials. It expects an existing BlueBubbles channel configured in OpenClaw to send the message (that is reasonable and proportional). The scripts write to the user's home cache (~/.cache/kokoro-onnx) and /tmp, which is expected for local model storage and temp audio files.

✓ Persistence & Privilege

The skill does not request always:true, does not modify other skills, and only installs a venv under the skill directory and model files under the user's cache. It runs only when invoked by the user/agent (default behavior).

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install imessage-voice-reply
After installation, invoke the skill by name or use /imessage-voice-reply
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.3

Fix display name

v1.0.2

imessage-voice-reply 1.0.2 - No file or documentation changes detected in this release. - No new features, bug fixes, or updates introduced.

v1.0.1

- Added support for reading input text from a file using the `--text-file` argument to improve security and avoid shell injection. - Updated documentation to prefer `--text-file` for untrusted input and clarified direct `--text` usage requires careful shell escaping. - Noted that the Python script now safely handles input with argparse and subprocess (no shell=True). - Minor documentation updates to reflect new usage and strengthen security guidance.

v1.0.0

Initial release of iMessage Voice Reply: - Send native iMessage voice messages generated via local Kokoro-ONNX TTS. - Produces playable voice bubbles with waveform (CAF/Opus format) instead of file attachments. - Requires BlueBubbles channel in OpenClaw for sending messages. - All TTS processing runs locally for zero ongoing cost. - Supports multiple voices and languages; customizable voice, speed, and language options. - Ensures standard iMessage voice message appearance and behavior for recipients.

Metadata

Slug imessage-voice-reply

Version 1.0.3

License —

All-time Installs 1

Active Installs 1

Total Versions 4

Frequently Asked Questions

What is iMessage Voice Reply?

Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not f... It is an AI Agent Skill for Claude Code / OpenClaw, with 621 downloads so far.

How do I install iMessage Voice Reply?

Run "/install imessage-voice-reply" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is iMessage Voice Reply free?

Yes, iMessage Voice Reply is completely free (open-source). You can download, install and use it at no cost.

Which platforms does iMessage Voice Reply support?

iMessage Voice Reply is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created iMessage Voice Reply?

It is built and maintained by Michael Boland (@bolander72); the current version is v1.0.3.

More Skills

iMessage Voice Reply

iMessage Voice Reply

How It Works

Setup

Generating and Sending a Voice Reply

Step 1: Generate audio

Step 2: Send via BlueBubbles

Voice Options

When to Reply with Voice

Audio Format

Cost

Troubleshooting

What is iMessage Voice Reply?

How do I install iMessage Voice Reply?

Is iMessage Voice Reply free?

Which platforms does iMessage Voice Reply support?

Who created iMessage Voice Reply?

💬 Comments