/install comfyui-tts
ComfyUI TTS Skill
Generate speech audio using ComfyUI's Qwen-TTS service. This skill allows you to convert text to speech through ComfyUI's API.
Configuration
Environment Variables
Set these environment variables to configure the ComfyUI connection:
export COMFYUI_HOST="localhost" # ComfyUI server host
export COMFYUI_PORT="8188" # ComfyUI server port
export COMFYUI_OUTPUT_DIR="" # Optional: Custom output directory
Usage
Basic Text-to-Speech
Generate audio from text using default settings:
scripts/tts.sh "你好,世界"
Advanced Options
Customize voice characteristics:
# Specify character and style
scripts/tts.sh "你好" --character "Girl" --style "Emotional"
# Change model size
scripts/tts.sh "你好" --model "3B"
# Specify output file
scripts/tts.sh "你好" --output "/path/to/output.wav"
# Combine options
scripts/tts.sh "你好,这是测试" \
--character "Girl" \
--style "Emotional" \
--model "1.7B" \
--output "~/audio/test.wav"
Available Options
| Option | Description | Default |
|---|---|---|
--character |
Voice character (Girl/Boy/etc.) | "Girl" |
--style |
Speaking style (Emotional/Neutral/etc.) | "Emotional" |
--model |
Model size (0.5B/1.7B/3B) | "1.7B" |
--output |
Output file path | Auto-generated |
--temperature |
Generation temperature (0-1) | 0.9 |
--top-p |
Top-p sampling | 0.9 |
--top-k |
Top-k sampling | 50 |
Workflow
The skill performs these steps:
- Construct Workflow: Builds a ComfyUI workflow JSON with your text and settings
- Submit Job: Sends the workflow to ComfyUI's
/promptendpoint - Poll Status: Monitors job completion via
/historyendpoint - Retrieve Audio: Returns the path to the generated audio file
Troubleshooting
Connection Refused
- Verify ComfyUI is running:
curl http://$COMFYUI_HOST:$COMFYUI_PORT/system_stats - Check host and port settings
Job Timeout
- Large models (3B) take longer to generate
- Try smaller models (0.5B, 1.7B) for faster results
Output Not Found
- Check ComfyUI's output directory configuration
- Verify file permissions
API Reference
The skill uses ComfyUI's native API endpoints:
POST /prompt- Submit workflowGET /history- Check job status- Output files are saved to ComfyUI's configured output directory
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install comfyui-tts - After installation, invoke the skill by name or use
/comfyui-tts - Provide required inputs per the skill's parameter spec and get structured output
What is ComfyUI TTS?
Convert text to speech audio via ComfyUI's Qwen-TTS API, supporting customizable voice, style, model, and output options. It is an AI Agent Skill for Claude Code / OpenClaw, with 908 downloads so far.
How do I install ComfyUI TTS?
Run "/install comfyui-tts" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is ComfyUI TTS free?
Yes, ComfyUI TTS is completely free (open-source). You can download, install and use it at no cost.
Which platforms does ComfyUI TTS support?
ComfyUI TTS is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created ComfyUI TTS?
It is built and maintained by YHSI5358 (@yhsi5358); the current version is v1.0.0.