← Back to Skills Marketplace
onimka

Free voice from Comfy UI + Qwen3 TTS

by Onimka · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
2252
Downloads
0
Stars
8
Active Installs
1
Versions
Install in OpenClaw
/install free-voice
Description
Generate Russian male voice audio using ComfyUI with Qwen3 TTS node and save as MP3 for voice messages.
README (SKILL.md)

TTS через ComfyUI

Описание

Навык для генерации аудио через ComfyUI с использованием узла AILab_Qwen3TTSVoiceDesign_Advanced.

Требования

  • ComfyUI с установленным плагином AILab_Qwen3TTSVoiceDesign_Advanced
  • Модель Qwen3
  • Папка E:\Ai\Comfy UI\output\ для сохранения аудио

Запуск ComfyUI

Если ComfyUI не запущена, использовать:

cmd /c start "" "C:\Users\user\Desktop\ComfyUI.lnk"

Ожидание готовности

Проверять доступность по http://localhost:8000 каждые 10 секунд до готовности.

Формат запроса

{
  "prompt": {
    "50": {
      "inputs": {
        "filename_prefix": "qwen3-tts/[UNIQUE_ID]",
        "quality": "320k",
        "audioUI": "",
        "audio": ["55", 0]
      },
      "class_type": "SaveAudioMP3",
      "_meta": {"title": "Сохранить аудио (MP3)"}
    },
    "55": {
      "inputs": {
        "text": "[TEXT]",
        "instruct": "A male voice with a slightly hoarse, warm tone, speaking in a confident and friendly manner.",
        "model_size": "1.7B",
        "device": "auto",
        "precision": "bf16",
        "language": "Russian",
        "max_new_tokens": 2048,
        "do_sample": true,
        "top_p": 0.9,
        "top_k": 50,
        "temperature": 0.9,
        "repetition_penalty": 1,
        "unload_models": false,
        "seed": -1
      },
      "class_type": "AILab_Qwen3TTSVoiceDesign_Advanced",
      "_meta": {"title": "Qwen3 TTS VoiceDesign (Advanced)"}
    }
  }
}

Обработка

  1. Отправить POST-запрос на http://localhost:8000/prompt
  2. Получить prompt_id
  3. Ждать завершения в /history/[prompt_id]
  4. Найти файл в E:\Ai\Comfy UI\output\[filename_prefix]*.mp3
  5. Отправить как голосовое сообщение через message(asVoice=true)
Usage Guidance
This skill is coherent for local TTS generation, but check these before installing: 1) Ensure you actually have ComfyUI, the AILab_Qwen3TTS plugin, and the Qwen3 model installed and licensed as required. 2) Confirm the hard-coded paths/shortcut match your machine (the SKILL.md uses C:\Users\user\Desktop\ComfyUI.lnk and E:\Ai\Comfy UI\output\); if not, edit the instructions or don't install. 3) Be aware the skill will try to start a local application and read a local folder (it needs permission to launch processes and access the filesystem). 4) The skill only talks to localhost, but generated audio may be sent out by the agent via your messaging function (message(asVoice=true))—consider whether that's acceptable for the content you plan to synthesize. 5) If you run this in an environment without those local resources, the skill will fail rather than silently do something else.
Capability Analysis
Type: OpenClaw Skill Name: free-voice Version: 1.0.0 The skill is classified as suspicious due to the direct command execution instruction `cmd /c start "" "C:\Users\user\Desktop\ComfyUI.lnk"` found in `SKILL.md`. While this command is plausibly intended to launch a local application (ComfyUI) required for the skill's function, direct shell command execution by an AI agent represents a significant security risk. Additionally, the skill instructs the agent to access the local file system to retrieve generated audio from `E:\Ai\Comfy UI\output\`, which, while necessary for its purpose, contributes to the overall elevated risk profile.
Capability Assessment
Purpose & Capability
The skill's name (ComfyUI + Qwen3 TTS) matches the instructions: it needs ComfyUI, the AILab_Qwen3TTSVoiceDesign_Advanced node, the Qwen3 model, and an output folder to save mp3 files. Nothing requested appears unrelated to the stated TTS purpose.
Instruction Scope
Instructions are narrowly scoped: start ComfyUI (via a desktop shortcut command), poll http://localhost:8000, POST a JSON prompt, wait for completion, then read the generated MP3 from a local output folder and send it as a voice message. Minor concerns: the instructions use hard-coded, system-specific paths (C:\Users\user\Desktop\ComfyUI.lnk and E:\Ai\Comfy UI\output\) which will fail or behave unexpectedly on other machines; the skill will attempt to launch a local shortcut and read a local folder, so it requires the agent/runtime to have access to launch processes and read that path.
Install Mechanism
No install spec or downloaded code; this is instruction-only, so nothing is written to disk by the skill itself and no external packages are fetched.
Credentials
No environment variables, credentials, or config paths are requested. The only resources referenced are local ComfyUI service on localhost and a local output directory—these are proportional to generating and retrieving TTS output.
Persistence & Privilege
always is false and the skill does not request persistent system-wide changes or cross-skill configuration. It will launch a local process (ComfyUI) and access local files, which is expected for this functionality.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install free-voice
  3. After installation, invoke the skill by name or use /free-voice
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release of TTS skill for generating audio via ComfyUI using the AILab_Qwen3TTSVoiceDesign_Advanced node. - Requires pre-configured ComfyUI with the specified plugin and Qwen3 model. - Defines detailed audio generation and saving steps, including request/response workflow. - Output audio is saved to a specific folder and delivered as a voice message. - Instruction for automatic ComfyUI startup and server readiness monitoring included.
Metadata
Slug free-voice
Version 1.0.0
License
All-time Installs 8
Active Installs 8
Total Versions 1
Frequently Asked Questions

What is Free voice from Comfy UI + Qwen3 TTS?

Generate Russian male voice audio using ComfyUI with Qwen3 TTS node and save as MP3 for voice messages. It is an AI Agent Skill for Claude Code / OpenClaw, with 2252 downloads so far.

How do I install Free voice from Comfy UI + Qwen3 TTS?

Run "/install free-voice" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Free voice from Comfy UI + Qwen3 TTS free?

Yes, Free voice from Comfy UI + Qwen3 TTS is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Free voice from Comfy UI + Qwen3 TTS support?

Free voice from Comfy UI + Qwen3 TTS is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Free voice from Comfy UI + Qwen3 TTS?

It is built and maintained by Onimka (@onimka); the current version is v1.0.0.

💬 Comments