Multimodal Content Creator
/install multimodal-content-creator
Multimodal Content Creator
A WhatsApp-powered content creation workflow that lets customers send text or voice messages and receive AI-generated images in return.
How It Works
- Receive a WhatsApp message (text or voice note)
- Transcribe voice notes using OpenAI Whisper
- Generate an image from the prompt using DALL-E 3
- Reply with the generated image back to the customer
Prerequisites
- OpenAI API key set as
OPENAI_API_KEYenvironment variable - WhatsApp CLI authentication (
python wacli.py login \x3Ctoken>)
Usage
# Process all unread WhatsApp messages
python scripts/workflow.py process-all
# Generate a single image
python scripts/generate_images.py "a cat riding a skateboard"
# Batch generate from prompts file
python scripts/generate_images.py prompts.txt
# Transcribe an audio file
python scripts/transcribe.py recording.mp3
Files
scripts/workflow.py— Main orchestration scriptscripts/generate_images.py— DALL-E 3 image generationscripts/transcribe.py— Whisper audio transcription (with chunking for large files)scripts/wacli.py— WhatsApp CLI client
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install multimodal-content-creator - After installation, invoke the skill by name or use
/multimodal-content-creator - Provide required inputs per the skill's parameter spec and get structured output
What is Multimodal Content Creator?
Multimodal content creation workflow — receive WhatsApp messages (text or voice), transcribe audio via Whisper, generate images with DALL-E 3, and reply auto... It is an AI Agent Skill for Claude Code / OpenClaw, with 33 downloads so far.
How do I install Multimodal Content Creator?
Run "/install multimodal-content-creator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Multimodal Content Creator free?
Yes, Multimodal Content Creator is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Multimodal Content Creator support?
Multimodal Content Creator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Multimodal Content Creator?
It is built and maintained by terrycarter1985 (@terrycarter1985); the current version is v1.0.0.