← Back to Skills Marketplace
bolander72

iMessage Voice Reply

by Michael Boland · GitHub ↗ · v1.0.3
cross-platform ✓ Security Clean
621
Downloads
0
Stars
1
Active Installs
4
Versions
Install in OpenClaw
/install imessage-voice-reply
Description
Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not f...
README (SKILL.md)

iMessage Voice Reply

Generate and send native iMessage voice messages using local Kokoro TTS. Voice messages appear as inline playable bubbles with waveforms — identical to voice messages recorded in Messages.app.

How It Works

Your text response → Kokoro TTS (local) → afconvert (native Apple encoder) → CAF/Opus → BlueBubbles → iMessage voice bubble

Setup

bash ${baseDir}/scripts/setup.sh

Installs: kokoro-onnx, soundfile, numpy. Downloads Kokoro models (~136MB) to ~/.cache/kokoro-onnx/.

Requires: BlueBubbles channel configured in OpenClaw (channels.bluebubbles).

Generating and Sending a Voice Reply

Step 1: Generate audio

Write the response text to a temp file, then pass it via --text-file to avoid shell injection:

echo "Your response text here" > /tmp/voice_text.txt
${baseDir}/.venv/bin/python ${baseDir}/scripts/generate_voice_reply.py --text-file /tmp/voice_text.txt --output /tmp/voice_reply.caf

Alternatively, pass text directly (ensure proper shell escaping):

${baseDir}/.venv/bin/python ${baseDir}/scripts/generate_voice_reply.py --text "Your response text here" --output /tmp/voice_reply.caf

Options:

  • --voice af_heart — Kokoro voice (default: af_heart)
  • --speed 1.15 — Playback speed (default: 1.15)
  • --lang en-us — Language code (default: en-us)

Security note: The Python script uses argparse and subprocess.run with list arguments (no shell=True). Input is handled safely within the script. When calling from a shell, prefer --text-file for untrusted input to avoid shell metacharacter issues.

Step 2: Send via BlueBubbles

Use the message tool:

{
  "action": "sendAttachment",
  "channel": "bluebubbles",
  "target": "+1XXXXXXXXXX",
  "path": "/tmp/voice_reply.caf",
  "filename": "Audio Message.caf",
  "contentType": "audio/x-caf",
  "asVoice": true
}

Critical parameters for native voice bubble:

  • filename must be "Audio Message.caf"
  • contentType must be "audio/x-caf"
  • asVoice must be true

All three are required for iMessage to render the message as an inline voice bubble with waveform instead of a file attachment.

Voice Options

Language Female Male
English af_heart ⭐ am_puck
Spanish ef_dora em_alex
French ff_siwis
Japanese jf_alpha jm_beta
Chinese zf_xiaobei zm_yunjian

When to Reply with Voice

Reply with a voice message when:

  • The user sent you a voice message (voice-for-voice)
  • The user explicitly asks for an audio/voice response

Always include a text reply alongside the voice message for accessibility.

Audio Format

  • macOS: CAF container, Opus codec, 48kHz mono, 32kbps — encoded by Apple's native afconvert. Identical to what Messages.app produces.
  • Fallback: MP3 via ffmpeg (works but may not render as native voice bubble on all iMessage versions).

Cost

$0. Kokoro TTS runs entirely locally. No API calls for voice generation.

Troubleshooting

Voice message shows as file attachment — Ensure all three parameters are set: filename="Audio Message.caf", contentType="audio/x-caf", asVoice=true.

First word clipped — The script prepends 150ms silence automatically. If still clipped, increase the silence pad in the script.

Kokoro model not found — Run bash ${baseDir}/scripts/setup.sh.

afconvert not found — Only available on macOS. Script falls back to ffmpeg/MP3 on Linux.

Usage Guidance
This skill appears coherent and matches its description, but review these practical points before installing: 1) setup.sh will pip install kokoro-onnx (and deps) from PyPI and will cause Kokoro to download ~136MB of models into ~/.cache/kokoro-onnx — expect network activity and disk usage. 2) Confirm you trust the kokoro-onnx PyPI package and its model source (audit upstream if needed). 3) The script uses tempfile.mktemp (potential race-condition/insecure temp path pattern) — not an immediate red flag but worth noting if you run in multi-user environments. 4) On non-macOS the setup requires ffmpeg; on macOS it uses afconvert. 5) The skill does not request credentials, but it uses your configured BlueBubbles channel to send attachments — ensure BlueBubbles is configured and trusted. If you want extra safety, run setup in an isolated environment (container or VM) and inspect/verify the kokoro-onnx package and downloaded model files before use.
Capability Analysis
Type: OpenClaw Skill Name: imessage-voice-reply Version: 1.0.3 The OpenClaw AgentSkills skill bundle for 'imessage-voice-reply' is classified as benign. The code and documentation are transparent, well-structured, and align with the stated purpose of generating local TTS voice messages for iMessage. Key security indicators include the use of virtual environments for dependencies, safe `subprocess.run` calls with list arguments in `scripts/generate_voice_reply.py` to prevent shell injection, and explicit advice in `SKILL.md` to prefer `--text-file` for untrusted input. Model downloads are performed to a standard user cache directory (`~/.cache/kokoro-onnx/`), and no evidence of data exfiltration, malicious execution, persistence mechanisms, or prompt injection attempts against the agent was found.
Capability Assessment
Purpose & Capability
Name/description (iMessage voice replies) align with included scripts and instructions: a local Kokoro TTS pipeline, audio encoding (afconvert/ffmpeg), and use of a BlueBubbles channel to send the resulting CAF/Opus payload. No unrelated services, credentials, or binaries are requested.
Instruction Scope
SKILL.md stays on-task: it instructs setup (venv and pip install), model download to ~/.cache/kokoro-onnx, generating audio with the Python script, and sending via BlueBubbles. It does require network access during pip install and model download; the instructions do not attempt to read unrelated system configuration or secrets. The doc correctly warns about shell escaping and recommends --text-file for untrusted input.
Install Mechanism
There is no packaged install spec, but setup.sh runs python3 -m venv and pip installs kokoro-onnx, soundfile, and numpy from PyPI. This is a standard mechanism but does involve pulling code and (indirectly) models from the network; kokoro-onnx appears to fetch models when instantiated. No arbitrary URL downloads or archive extraction from unknown hosts are present in the repo files.
Credentials
The skill requests no environment variables or credentials. It expects an existing BlueBubbles channel configured in OpenClaw to send the message (that is reasonable and proportional). The scripts write to the user's home cache (~/.cache/kokoro-onnx) and /tmp, which is expected for local model storage and temp audio files.
Persistence & Privilege
The skill does not request always:true, does not modify other skills, and only installs a venv under the skill directory and model files under the user's cache. It runs only when invoked by the user/agent (default behavior).
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install imessage-voice-reply
  3. After installation, invoke the skill by name or use /imessage-voice-reply
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.3
Fix display name
v1.0.2
imessage-voice-reply 1.0.2 - No file or documentation changes detected in this release. - No new features, bug fixes, or updates introduced.
v1.0.1
- Added support for reading input text from a file using the `--text-file` argument to improve security and avoid shell injection. - Updated documentation to prefer `--text-file` for untrusted input and clarified direct `--text` usage requires careful shell escaping. - Noted that the Python script now safely handles input with argparse and subprocess (no shell=True). - Minor documentation updates to reflect new usage and strengthen security guidance.
v1.0.0
Initial release of iMessage Voice Reply: - Send native iMessage voice messages generated via local Kokoro-ONNX TTS. - Produces playable voice bubbles with waveform (CAF/Opus format) instead of file attachments. - Requires BlueBubbles channel in OpenClaw for sending messages. - All TTS processing runs locally for zero ongoing cost. - Supports multiple voices and languages; customizable voice, speed, and language options. - Ensures standard iMessage voice message appearance and behavior for recipients.
Metadata
Slug imessage-voice-reply
Version 1.0.3
License
All-time Installs 1
Active Installs 1
Total Versions 4
Frequently Asked Questions

What is iMessage Voice Reply?

Send voice message replies in iMessage using local Kokoro-ONNX TTS. Generates native iMessage voice bubbles (CAF/Opus) that play inline with waveform — not f... It is an AI Agent Skill for Claude Code / OpenClaw, with 621 downloads so far.

How do I install iMessage Voice Reply?

Run "/install imessage-voice-reply" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is iMessage Voice Reply free?

Yes, iMessage Voice Reply is completely free (open-source). You can download, install and use it at no cost.

Which platforms does iMessage Voice Reply support?

iMessage Voice Reply is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created iMessage Voice Reply?

It is built and maintained by Michael Boland (@bolander72); the current version is v1.0.3.

💬 Comments