← Back to Skills Marketplace

Voice Memo

Name: Voice Memo
Author: amzzzzzzz

by amzzzzzzz · GitHub ↗ · v0.1.2

cross-platform ⚠ suspicious

482

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install imessage-voice-memo-skill

Description

Send native iMessage voice bubbles with ElevenLabs TTS via BlueBubbles. Use when: user asks to send a voice message, wants something spoken aloud, storytelli...

README (SKILL.md)

Voice Memo

Send native iMessage voice bubbles (not file attachments) using ElevenLabs TTS and BlueBubbles.

Quick Start

Run the script with text and recipient:

scripts/send-voice-memo.sh "Your message here" +14169060839

This will:

Generate TTS audio via ElevenLabs (Rachel voice by default)
Convert to Opus CAF @ 24kHz (iMessage native format)
Send as native voice bubble via BlueBubbles

Requirements

BlueBubbles running locally with Private API enabled
ElevenLabs API key (for TTS)
macOS (for afconvert audio conversion)

Environment variables in ~/.openclaw/.env:

ELEVENLABS_API_KEY=your-key-here
BLUEBUBBLES_PASSWORD=your-password-here
# Optional overrides:
ELEVENLABS_VOICE_ID=21m00Tcm4TlvDq8ikWAM  # Rachel (default)
ELEVENLABS_MODEL_ID=eleven_turbo_v2_5      # Turbo v2.5 (default)

The Working Formula

Critical parameters discovered 2026-02-19:

Parameter	Value	Why
chatGuid	`any;-;+PHONE`	NOT `iMessage;-;` (causes timeouts)
method	`private-api`	Required for native bubble
isAudioMessage	`true`	Required
Audio format	Opus @ 24kHz in CAF	iMessage native format
Pre-convert	Yes	Don't let BlueBubbles convert (wrong codec)

Voice Options

Default voice: Rachel (ElevenLabs)

Voice ID: 21m00Tcm4TlvDq8ikWAM
Model: eleven_turbo_v2_5 (fast, natural)
Cost: ~$0.04 per 30s message

Expressive tags:

[laughs] — natural laughter
[sighs] — expressive sigh
[excited] — energetic delivery

Example: "[excited] Oh my god, it worked!"

For full voice list and IDs, see VOICES.md.

Bidirectional Voice Memos

Sending (Amz → Amy): Use this skill. Native voice bubbles appear with waveform UI.

Receiving (Amy → Amz): BlueBubbles auto-converts incoming voice memos to MP3. OpenClaw transcribes via Whisper. Transcribed text flows into conversation context automatically.

Memory note: Incoming voice memo transcriptions flow into conversation context like any text message. They are NOT automatically persisted to memory or files — the agent must explicitly choose to store them, same as any conversation content. If you want to prevent transcriptions from being retained, instruct the agent not to record voice memo content in memory.

Troubleshooting

Voice bubble arrives as file attachment:

Check method=private-api is set
Verify chatGuid uses any;-; prefix (not iMessage;-;)
Check response has "isAudioMessage": true

API times out:

Use any;-;+PHONE format for chatGuid
Verify BlueBubbles Private API is enabled
Restart BlueBubbles if consistently slow

Audio is 0 seconds / unplayable:

Ensure pre-conversion to Opus @ 24kHz
Don't let BlueBubbles convert (uses wrong codec)
Verify with: afinfo output.caf (should show opus @ 24000 Hz)

Usage Guidance

This skill appears to do what it claims: generate ElevenLabs TTS, convert to Opus CAF, and post to a BlueBubbles Private API to create native iMessage voice bubbles. Before installing: (1) verify the skill's source (homepage is a GitHub repo) and inspect ~/.openclaw/.env to ensure it only contains the expected ELEVENLABS_API_KEY and BLUEBUBBLES_PASSWORD; (2) ensure BLUEBUBBLES_URL points to a trusted local BlueBubbles instance (default is http://127.0.0.1:1234); (3) treat the ElevenLabs API key as billable/privileged — restrict its scope and rotate if shared; (4) be aware that if you set BLUEBUBBLES_URL to a remote endpoint, audio and metadata will be sent there (so only use trusted endpoints); and (5) note the metadata/registry inconsistency (registry omitted required env vars) — this is likely a packaging oversight but worth confirming with the author before granting credentials.

Capability Analysis

Type: OpenClaw Skill Name: imessage-voice-memo-skill Version: 0.1.2 The `scripts/send-voice-memo.sh` file contains a shell injection vulnerability. The `RECIPIENT` variable, which can be controlled by user input, is directly embedded into the `CHAT_GUID` string and then passed as a `--form-string` argument to `curl` without proper shell escaping. This allows an attacker to inject arbitrary shell commands by crafting the recipient phone number, leading to potential remote code execution. There is no evidence of intentional malicious behavior such as data exfiltration or persistence mechanisms.

Capability Assessment

✓ Purpose & Capability

The script implements exactly what the skill describes: it calls ElevenLabs TTS, converts audio with afconvert to Opus CAF, and posts to a BlueBubbles Private API to produce native iMessage voice bubbles. Required binaries (curl, afconvert) and the two service credentials are appropriate for this functionality.

✓ Instruction Scope

SKILL.md and the script confine actions to generating audio, converting it, and sending it to the BlueBubbles endpoint. The script sources ~/.openclaw/.env for credentials and does not attempt to read unrelated system files or exfiltrate arbitrary data. It documents how transcriptions integrate into conversation context and notes that transcriptions are not auto-persisted.

✓ Install Mechanism

No install spec (instruction-only) and a small included shell script. No downloads, archive extraction, or third-party package installs — low install-time risk.

ℹ Credentials

The script requires ELEVENLABS_API_KEY and BLUEBUBBLES_PASSWORD (proportionate to the task). However, registry metadata at the top of the submission lists no required env vars while SKILL.md and the script do — an inconsistency. Also note the script blindly sources ~/.openclaw/.env, which may contain additional secrets; ensure that file contains only intended credentials and is protected.

✓ Persistence & Privilege

always:false and no system-wide configuration changes. The skill can be invoked autonomously (platform default), which is expected for skills; it does not request persistent elevated privileges or modify other skills' configs.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install imessage-voice-memo-skill
After installation, invoke the skill by name or use /imessage-voice-memo-skill
Provide required inputs per the skill's parameter spec and get structured output

Version History

v0.1.2

Version 0.1.2 of imessage-voice-memo-skill - No file changes detected in this release. - No functional, documentation, or metadata updates. - Behavior and usage remain unchanged from the previous version.

v0.1.1

Voice memo skill 0.1.1: metadata updated and memory note clarified. - Updated SKILL metadata: added platform info, refined required bins, and listed necessary credentials. - Clarified treatment of incoming voice memo transcriptions: not automatically persisted to memory; explicit agent action required to save. - No behavioral changes to scripts or user-facing commands.

v0.1.0

imessage-voice-memo-skill v0.1.0 - Initial release: Send native iMessage voice bubbles using ElevenLabs TTS via BlueBubbles. - Supports expressive voice tags and configurable voices. - Ensures proper audio format (Opus CAF @ 24kHz) for native iMessage compatibility. - Includes troubleshooting section for common issues like file attachments or playback errors. - Requires BlueBubbles with Private API and macOS audio tools.

Metadata

Slug imessage-voice-memo-skill

Version 0.1.2

License —

All-time Installs 0

Active Installs 0

Total Versions 3

Frequently Asked Questions

What is Voice Memo?

Send native iMessage voice bubbles with ElevenLabs TTS via BlueBubbles. Use when: user asks to send a voice message, wants something spoken aloud, storytelli... It is an AI Agent Skill for Claude Code / OpenClaw, with 482 downloads so far.

How do I install Voice Memo?

Run "/install imessage-voice-memo-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Voice Memo free?

Yes, Voice Memo is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Voice Memo support?

Voice Memo is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Voice Memo?

It is built and maintained by amzzzzzzz (@amzzzzzzz); the current version is v0.1.2.

More Skills

Voice Memo

Voice Memo

Quick Start

Requirements

The Working Formula

Voice Options

Bidirectional Voice Memos

Troubleshooting

What is Voice Memo?

How do I install Voice Memo?

Is Voice Memo free?

Which platforms does Voice Memo support?

Who created Voice Memo?

💬 Comments