← 返回 Skills 市场
aliyx

qwen-audio-lab

作者 aliyx · GitHub ↗ · v0.0.1 · MIT-0
cross-platform ⚠ suspicious
221
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install qwen-audio-lab
功能描述
Hybrid text-to-speech, reusable voice cloning, and narrated audio generation for macOS plus Aliyun Qwen. Use when the user wants to convert text into speech,...
安全使用建议
What to consider before installing: - The skill does what it claims (local macOS 'say' + remote Qwen/DashScope TTS and voice-clone). However, the package metadata did NOT declare the required DASHSCOPE_API_KEY even though SKILL.md and the script require it — treat that as a red flag (metadata should match runtime requirements). - The script will make network calls to DashScope endpoints (https://dashscope.aliyuncs.com and https://dashscope-intl.aliyuncs.com). Only provide an API key if you trust the endpoint and the skill source. - The skill stores outputs and remembered-voice state under ~/.openclaw/data/qwen-audio-lab; verify you are comfortable with that directory being created/written. - For some operations (audio trimming) ffmpeg is required, and local playback uses macOS 'say' — these are normal but will invoke subprocesses. - Voice cloning can have legal/consent implications. The SKILL.md recommends asking for permission; you should enforce that policy yourself before cloning third-party voices. - Because the skill source is 'unknown' and the registry metadata is inconsistent, prefer to inspect the full script locally (ensure the truncated portion contains only TTS/manage-voice logic) or obtain the skill from a trusted publisher before supplying credentials. If you proceed, limit the scope/permissions of the API key (if possible) and monitor network activity.
功能分析
Type: OpenClaw Skill Name: qwen-audio-lab Version: 0.0.1 The skill bundle provides a legitimate interface for Aliyun Qwen's text-to-speech and voice cloning services, including macOS local speech integration. The script `scripts/qwen_audio.py` uses standard Python libraries (urllib, subprocess, zipfile) to interact with the DashScope API and process audio files. It includes proper input sanitization for filenames and uses list-based subprocess calls to prevent shell injection. No evidence of data exfiltration, malicious execution, or prompt injection was found.
能力评估
Purpose & Capability
The name/description (macOS + Aliyun Qwen TTS, voice cloning, narrated PPTs) matches what the code and SKILL.md implement: local 'say' playback, Qwen TTS calls, voice cloning/design endpoints, and local storage of outputs and remembered voices. However, the registry metadata lists no required environment variables or primary credential while both SKILL.md and the code require DASHSCOPE_API_KEY — this metadata omission is an incoherence to be aware of.
Instruction Scope
The SKILL.md instructions and the included script remain focused on TTS/voice workflows. They reference only task-relevant files/paths (user home ~/.openclaw/data/qwen-audio-lab for outputs/state), optional ffmpeg for trimming, and network calls to DashScope (Aliyun) APIs. There is no instruction to read unrelated system files, shell history, or to exfiltrate arbitrary data.
Install Mechanism
This is an instruction-only skill with an included Python script and no install spec; nothing is downloaded from external URLs during install. Runtime will execute local scripts and may call external network endpoints. No archive downloads or remote installers were specified.
Credentials
The code and SKILL.md require DASHSCOPE_API_KEY (plus optional QWEN_AUDIO_REGION, QWEN_AUDIO_OUTPUT_DIR, QWEN_AUDIO_STATE_DIR), but the registry metadata declared no required env vars or primary credential. This mismatch is concerning because the skill needs an API key to access remote TTS/voice-cloning services; the package should declare that requirement explicitly. Aside from the missing declaration, the environment access requested by the script (API key + optional dirs) is proportionate to the stated purpose.
Persistence & Privilege
The skill does not request always:true and does not modify other skills or global configs. It writes state and outputs under ~/.openclaw/data/qwen-audio-lab (its own directory) which is normal for persistent skill state.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install qwen-audio-lab
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /qwen-audio-lab 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.0.1
Initial release: Hybrid text-to-speech skill for macOS and Aliyun Qwen, with support for voice cloning and narrated file generation. - Provides text-to-speech via both local macOS and Aliyun Qwen backends. - Supports cloning and reusing voices from user-supplied audio samples. - Generates narration audio from plain text, text files, or PPT speaker notes. - Offers easy high-level commands for narration, as well as legacy commands for backward compatibility. - Adds environment variables for API keys, output directories, and state management.
元数据
Slug qwen-audio-lab
版本 0.0.1
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

qwen-audio-lab 是什么?

Hybrid text-to-speech, reusable voice cloning, and narrated audio generation for macOS plus Aliyun Qwen. Use when the user wants to convert text into speech,... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 221 次。

如何安装 qwen-audio-lab?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install qwen-audio-lab」即可一键安装,无需额外配置。

qwen-audio-lab 是免费的吗?

是的,qwen-audio-lab 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

qwen-audio-lab 支持哪些平台?

qwen-audio-lab 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 qwen-audio-lab?

由 aliyx(@aliyx)开发并维护,当前版本 v0.0.1。

💬 留言讨论