← 返回 Skills 市场
cinience

Aliyun Cosyvoice Voice Design

作者 cinience · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
102
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install aliyun-cosyvoice-voice-design
功能描述
Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...
使用说明 (SKILL.md)

Category: provider

Model Studio CosyVoice Voice Design

Use the CosyVoice voice enrollment API to create designed voices from a natural-language voice description.

Critical model names

Use model="voice-enrollment" and one of these target_model values:

  • cosyvoice-v3.5-plus
  • cosyvoice-v3.5-flash
  • cosyvoice-v3-plus
  • cosyvoice-v3-flash

Recommended default in this repo:

  • target_model="cosyvoice-v3.5-plus"

Region and compatibility

  • cosyvoice-v3.5-plus and cosyvoice-v3.5-flash are available only in China mainland deployment mode (Beijing endpoint).
  • In international deployment mode (Singapore endpoint), cosyvoice-v3-plus and cosyvoice-v3-flash do not support voice clone/design.
  • The target_model must match the later speech synthesis model.

Endpoint

  • Domestic: https://dashscope.aliyuncs.com/api/v1/services/audio/tts/customization
  • International: https://dashscope-intl.aliyuncs.com/api/v1/services/audio/tts/customization

Prerequisites

  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (cosyvoice.voice_design)

Request

  • model (string, optional): fixed to voice-enrollment
  • target_model (string, optional): default cosyvoice-v3.5-plus
  • prefix (string, required): letters/digits only, max 10 chars
  • voice_prompt (string, required): max 500 chars, Chinese or English only
  • preview_text (string, required): max 200 chars, Chinese or English
  • language_hints (array[string], optional): zh or en, and should match preview_text
  • sample_rate (int, optional): e.g. 24000
  • response_format (string, optional): e.g. wav

Response

  • voice_id (string)
  • request_id (string)
  • status (string, optional)

Operational guidance

  • Keep voice_prompt concrete: timbre, age range, pace, emotion, articulation, and scenario.
  • If language_hints is used, it should match the language of preview_text.
  • Designed voice names include a -vd- marker in the generated backend naming convention.

Local helper script

Prepare a normalized request JSON:

python skills/ai/audio/aliyun-cosyvoice-voice-design/scripts/prepare_cosyvoice_design_request.py \
  --target-model cosyvoice-v3.5-plus \
  --prefix announcer \
  --voice-prompt "沉稳的中年男性播音员,低沉有磁性,语速平稳,吐字清晰。" \
  --preview-text "各位听众朋友,大家好,欢迎收听晚间新闻。" \
  --language-hint zh

Validation

mkdir -p output/aliyun-cosyvoice-voice-design
for f in skills/ai/audio/aliyun-cosyvoice-voice-design/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/aliyun-cosyvoice-voice-design/validate.txt

Pass criteria: command exits 0 and output/aliyun-cosyvoice-voice-design/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/aliyun-cosyvoice-voice-design/.
  • Include target_model, prefix, voice_prompt, and preview_text in the evidence file.

References

  • references/api_reference.md
  • references/sources.md
安全使用建议
This skill looks like a focused helper for creating Alibaba Cloud CosyVoice enrollment requests, but there are a few things to check before installing or running it: - Ask the publisher to update the package metadata to declare DASHSCOPE_API_KEY (or equivalent) as a required credential so you know what will be needed and why. The SKILL.md currently says to set DASHSCOPE_API_KEY or use ~/.alibabacloud/credentials, but the registry lists no env vars. - Confirm how and when your API key will be used. The helper script only prepares JSON; it doesn't call the network itself — ensure your agent or other tooling will call the listed Dashscope endpoints and that those calls are limited in scope (least privilege API key). - Be aware the skill instructs saving the provided voice_prompt and preview_text to output/aliyun-cosyvoice-voice-design/. Do not include secrets or sensitive PII in those fields if you care about privacy. - If you will run this in an automated agent, review how the agent invokes the skill and whether request/response artifacts are uploaded or transmitted elsewhere. If you need higher assurance, run the included Python script locally, inspect the generated JSON, and only provide an API key with minimal permissions for voice enrollment before using it in production.
功能分析
Type: OpenClaw Skill Name: aliyun-cosyvoice-voice-design Version: 1.0.0 The skill is a legitimate utility for interacting with the Alibaba Cloud CosyVoice voice design API. The core logic in `scripts/prepare_cosyvoice_design_request.py` is limited to formatting JSON requests and writing them to a local output directory, with no evidence of data exfiltration, malicious execution, or prompt injection attacks.
能力评估
Purpose & Capability
The skill's name, description, SKILL.md, references, and helper script all consistently target Alibaba Cloud CosyVoice voice-design enrollment. Requiring an Alibaba Cloud API key (DASHSCOPE_API_KEY or ~/.alibabacloud/credentials) is appropriate for this purpose — however the registry metadata declares no required environment variables or primary credential, which is inconsistent.
Instruction Scope
The SKILL.md instructs building a JSON enrollment request, validating the helper script, and saving outputs under output/aliyun-cosyvoice-voice-design/. It lists only the Aliyun endpoints. This scope is narrowly focused on CosyVoice design, but it explicitly instructs saving voice_prompt and preview_text to local evidence files (possible sensitive user text), which users should be aware of.
Install Mechanism
No install spec — instruction-only plus a small helper Python script. Nothing is downloaded or written by an installer, which is low risk.
Credentials
SKILL.md requires DASHSCOPE_API_KEY or credentials in ~/.alibabacloud/credentials to call the Dashscope endpoints, which is proportionate to the stated purpose. But the skill's declared metadata lists no required env vars or primary credential — a clear mismatch. The helper script itself does not read credentials, so it's unclear when/how the credentials are used; the omission reduces transparency and is a risk.
Persistence & Privilege
always is false and the skill is user-invocable; it does not request persistent or elevated platform privileges and does not modify other skills' configs. The helper script writes files only under a local output path.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install aliyun-cosyvoice-voice-design
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /aliyun-cosyvoice-voice-design 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of aliyun-cosyvoice-voice-design. - Provides a normalized interface to design custom voices using Alibaba Cloud Model Studio CosyVoice, especially with v3.5 models. - Documents required fields, supported model names, API endpoints, and deployment region compatibility. - Offers detailed operational guidance for effective voice prompt creation and validation. - Includes a local helper script for preparing and validating request JSONs. - Supplies output management and evidence collection instructions.
元数据
Slug aliyun-cosyvoice-voice-design
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Aliyun Cosyvoice Voice Design 是什么?

Use when designing custom voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 102 次。

如何安装 Aliyun Cosyvoice Voice Design?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install aliyun-cosyvoice-voice-design」即可一键安装,无需额外配置。

Aliyun Cosyvoice Voice Design 是免费的吗?

是的,Aliyun Cosyvoice Voice Design 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Aliyun Cosyvoice Voice Design 支持哪些平台?

Aliyun Cosyvoice Voice Design 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Aliyun Cosyvoice Voice Design?

由 cinience(@cinience)开发并维护,当前版本 v1.0.0。

💬 留言讨论