← 返回 Skills 市场
xiaoyaner0201

Qwen3-TTS VoiceDesign

作者 xiaoyaner0201 · GitHub ↗ · v1.0.0
cross-platform ⚠ suspicious
653
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install qwen3-tts-voicedesign
功能描述
Text-to-speech with Qwen3-TTS VoiceDesign. Design custom voices via natural language descriptions + seed-based timbre fixation. Includes OpenAI-compatible AP...
安全使用建议
This package appears to do what it says: set up a local TTS server, download a voice model, and provide client scripts. Before installing: 1) Expect large downloads (~3.5GB) and pip installing many packages (including torch/CUDA) — run in a controlled environment or VM/container if you don't want changes to your main system. 2) The server clears HTTP(S)_PROXY env vars at startup — if you are on a corporate network that requires a proxy for outbound connections, that may change routing; run behind a firewall or bind the server to 127.0.0.1 (TTS_HOST) if you only need local access. 3) The setup and server will download model data from ModelScope/HuggingFace and install packages from PyPI — verify you trust those sources and the specified model repo. 4) The client shell scripts construct JSON via simple interpolation — avoid passing untrusted/unsanitized text that could break the shell invocation. 5) If you plan to expose the server beyond localhost, secure it (firewall, reverse proxy, auth) because it exposes an HTTP API. If you want more assurance, run setup in an isolated container, inspect the pip-installed packages and the model repo, and avoid enabling systemd/scheduled-task instructions unless you understand the implications.
功能分析
Type: OpenClaw Skill Name: qwen3-tts-voicedesign Version: 1.0.0 The skill bundle is classified as suspicious due to a critical shell injection vulnerability in `scripts/batch_seeds.sh` where the `TEXT` variable is unsafely interpolated into a `curl -d` argument, allowing arbitrary command execution if user input contains shell metacharacters. Additionally, the `SKILL.md` documentation instructs users to set up a Windows scheduled task with `highest privileges` for server auto-restart, which is a significant security risk and persistence mechanism. The `tts_server.py` also defaults to binding on `0.0.0.0`, exposing the service to the network by default.
能力评估
Purpose & Capability
Name/description (Qwen3-TTS VoiceDesign TTS server + client tools) matches the included scripts: a FastAPI server, client helpers, setup script and seed-batching tooling. The declared behavior (model download, one-click setup, OpenAI-compatible API) is consistent with the code.
Instruction Scope
SKILL.md instructs running setup.sh which creates a venv, pip-installs dependencies, downloads the model (ModelScope or Hugging Face), and runs the server; the runtime scripts only reference their .env and local files. Notable scope items: the server code clears proxy environment variables at start (potentially bypassing a corporate proxy), and the docs show guidance to register scheduled tasks or systemd units (these are only instructions, not executed automatically). The client scripts build JSON bodies via shell interpolation (potential for malformed input/escaping issues if used with untrusted text).
Install Mechanism
There is no platform install spec, but setup.sh will pip-install packages (qwen-tts, soundfile, pydub, uvicorn, fastapi, numpy and possibly modelscope and torch from the official PyTorch index). It downloads the ~3.5GB model via ModelScope or Hugging Face. These are expected for a local TTS runtime but do involve network access and large binary downloads; the sources used (ModelScope/HuggingFace, PyTorch wheel index) are standard release hosts rather than arbitrary shorteners.
Credentials
The skill requests no credentials and exposes only environment variables relevant to running a local TTS server (seed, instruct, model path, host/port, format). The only surprising behavior is that the server explicitly clears HTTP(S) proxy environment variables at startup, which may affect network routing on hosts that rely on proxies; this is operational (not credential) behavior and not an attempt to read secrets.
Persistence & Privilege
The skill is not always-enabled and does not attempt to change other skills' config. setup.sh suggests how to create systemd units or a Windows scheduled task, but it does not automatically create system-level services or elevate privileges. You must run setup/start manually, so persistence is user-controlled.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install qwen3-tts-voicedesign
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /qwen3-tts-voicedesign 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release: VoiceDesign voice design via natural language + seed fixation, OpenAI-compatible API server, one-click setup, batch seed exploration
元数据
Slug qwen3-tts-voicedesign
版本 1.0.0
许可证
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Qwen3-TTS VoiceDesign 是什么?

Text-to-speech with Qwen3-TTS VoiceDesign. Design custom voices via natural language descriptions + seed-based timbre fixation. Includes OpenAI-compatible AP... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 653 次。

如何安装 Qwen3-TTS VoiceDesign?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install qwen3-tts-voicedesign」即可一键安装,无需额外配置。

Qwen3-TTS VoiceDesign 是免费的吗?

是的,Qwen3-TTS VoiceDesign 完全免费(开源免费),可自由下载、安装和使用。

Qwen3-TTS VoiceDesign 支持哪些平台?

Qwen3-TTS VoiceDesign 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Qwen3-TTS VoiceDesign?

由 xiaoyaner0201(@xiaoyaner0201)开发并维护,当前版本 v1.0.0。

💬 留言讨论