← 返回 Skills 市场

qwen-audio-lab

Name: qwen-audio-lab
Author: aliyx

作者 aliyx · GitHub ↗ · v0.0.1 · MIT-0

cross-platform ⚠ suspicious

221

总下载

当前安装

版本数

在 OpenClaw 中安装

/install qwen-audio-lab

功能描述

Hybrid text-to-speech, reusable voice cloning, and narrated audio generation for macOS plus Aliyun Qwen. Use when the user wants to convert text into speech,...

安全使用建议

What to consider before installing: - The skill does what it claims (local macOS 'say' + remote Qwen/DashScope TTS and voice-clone). However, the package metadata did NOT declare the required DASHSCOPE_API_KEY even though SKILL.md and the script require it — treat that as a red flag (metadata should match runtime requirements). - The script will make network calls to DashScope endpoints (https://dashscope.aliyuncs.com and https://dashscope-intl.aliyuncs.com). Only provide an API key if you trust the endpoint and the skill source. - The skill stores outputs and remembered-voice state under ~/.openclaw/data/qwen-audio-lab; verify you are comfortable with that directory being created/written. - For some operations (audio trimming) ffmpeg is required, and local playback uses macOS 'say' — these are normal but will invoke subprocesses. - Voice cloning can have legal/consent implications. The SKILL.md recommends asking for permission; you should enforce that policy yourself before cloning third-party voices. - Because the skill source is 'unknown' and the registry metadata is inconsistent, prefer to inspect the full script locally (ensure the truncated portion contains only TTS/manage-voice logic) or obtain the skill from a trusted publisher before supplying credentials. If you proceed, limit the scope/permissions of the API key (if possible) and monitor network activity.

功能分析

Type: OpenClaw Skill Name: qwen-audio-lab Version: 0.0.1 The skill bundle provides a legitimate interface for Aliyun Qwen's text-to-speech and voice cloning services, including macOS local speech integration. The script `scripts/qwen_audio.py` uses standard Python libraries (urllib, subprocess, zipfile) to interact with the DashScope API and process audio files. It includes proper input sanitization for filenames and uses list-based subprocess calls to prevent shell injection. No evidence of data exfiltration, malicious execution, or prompt injection was found.

能力评估

ℹ Purpose & Capability

The name/description (macOS + Aliyun Qwen TTS, voice cloning, narrated PPTs) matches what the code and SKILL.md implement: local 'say' playback, Qwen TTS calls, voice cloning/design endpoints, and local storage of outputs and remembered voices. However, the registry metadata lists no required environment variables or primary credential while both SKILL.md and the code require DASHSCOPE_API_KEY — this metadata omission is an incoherence to be aware of.

✓ Instruction Scope

The SKILL.md instructions and the included script remain focused on TTS/voice workflows. They reference only task-relevant files/paths (user home ~/.openclaw/data/qwen-audio-lab for outputs/state), optional ffmpeg for trimming, and network calls to DashScope (Aliyun) APIs. There is no instruction to read unrelated system files, shell history, or to exfiltrate arbitrary data.

✓ Install Mechanism

This is an instruction-only skill with an included Python script and no install spec; nothing is downloaded from external URLs during install. Runtime will execute local scripts and may call external network endpoints. No archive downloads or remote installers were specified.

⚠ Credentials

The code and SKILL.md require DASHSCOPE_API_KEY (plus optional QWEN_AUDIO_REGION, QWEN_AUDIO_OUTPUT_DIR, QWEN_AUDIO_STATE_DIR), but the registry metadata declared no required env vars or primary credential. This mismatch is concerning because the skill needs an API key to access remote TTS/voice-cloning services; the package should declare that requirement explicitly. Aside from the missing declaration, the environment access requested by the script (API key + optional dirs) is proportionate to the stated purpose.

✓ Persistence & Privilege

The skill does not request always:true and does not modify other skills or global configs. It writes state and outputs under ~/.openclaw/data/qwen-audio-lab (its own directory) which is normal for persistent skill state.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install qwen-audio-lab
安装完成后，直接呼叫该 Skill 的名称或使用 /qwen-audio-lab 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v0.0.1

Initial release: Hybrid text-to-speech skill for macOS and Aliyun Qwen, with support for voice cloning and narrated file generation. - Provides text-to-speech via both local macOS and Aliyun Qwen backends. - Supports cloning and reusing voices from user-supplied audio samples. - Generates narration audio from plain text, text files, or PPT speaker notes. - Offers easy high-level commands for narration, as well as legacy commands for backward compatibility. - Adds environment variables for API keys, output directories, and state management.

元数据

Slug qwen-audio-lab

版本 0.0.1

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

qwen-audio-lab 是什么？

Hybrid text-to-speech, reusable voice cloning, and narrated audio generation for macOS plus Aliyun Qwen. Use when the user wants to convert text into speech,... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 221 次。

如何安装 qwen-audio-lab？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install qwen-audio-lab」即可一键安装，无需额外配置。

qwen-audio-lab 是免费的吗？

是的，qwen-audio-lab 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

qwen-audio-lab 支持哪些平台？

qwen-audio-lab 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 qwen-audio-lab？

由 aliyx（@aliyx）开发并维护，当前版本 v0.0.1。