← Back to Skills Marketplace
cinience

Aliyun Cosyvoice Voice Clone

by cinience · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
108
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install aliyun-cosyvoice-voice-clone
Description
Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from...
README (SKILL.md)

Category: provider

Model Studio CosyVoice Voice Clone

Use the CosyVoice voice enrollment API to create cloned voices from public reference audio.

Critical model names

Use model="voice-enrollment" and one of these target_model values:

  • cosyvoice-v3.5-plus
  • cosyvoice-v3.5-flash
  • cosyvoice-v3-plus
  • cosyvoice-v3-flash
  • cosyvoice-v2

Recommended default in this repo:

  • target_model="cosyvoice-v3.5-plus"

Region and compatibility

  • cosyvoice-v3.5-plus and cosyvoice-v3.5-flash are available only in China mainland deployment mode (Beijing endpoint).
  • In international deployment mode (Singapore endpoint), cosyvoice-v3-plus and cosyvoice-v3-flash do not support voice clone/design.
  • The target_model used during enrollment must match the model used later in speech synthesis, otherwise synthesis fails.

Endpoint

  • Domestic: https://dashscope.aliyuncs.com/api/v1/services/audio/tts/customization
  • International: https://dashscope-intl.aliyuncs.com/api/v1/services/audio/tts/customization

Prerequisites

  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.
  • Provide a public audio URL for the enrollment sample.

Normalized interface (cosyvoice.voice_clone)

Request

  • model (string, optional): fixed to voice-enrollment
  • target_model (string, optional): default cosyvoice-v3.5-plus
  • prefix (string, required): letters/digits only, max 10 chars
  • voice_sample_url (string, required): public audio URL
  • language_hints (array[string], optional): only first item is used
  • max_prompt_audio_length (float, optional): only for cosyvoice-v3.5-plus, cosyvoice-v3.5-flash, cosyvoice-v3-flash
  • enable_preprocess (bool, optional): only for cosyvoice-v3.5-plus, cosyvoice-v3.5-flash, cosyvoice-v3-flash

Response

  • voice_id (string): use this as the voice parameter in later TTS calls
  • request_id (string)
  • usage.count (number, optional)

Operational guidance

  • For Chinese dialect reference audio, keep language_hints=["zh"]; control dialect style later in synthesis via text or instruct.
  • For cosyvoice-v3.5-plus, supported language_hints include zh, en, fr, de, ja, ko, ru, pt, th, id, vi.
  • Avoid frequent enrollment calls; each call creates a new custom voice and consumes quota.

Local helper script

Prepare a normalized request JSON:

python skills/ai/audio/aliyun-cosyvoice-voice-clone/scripts/prepare_cosyvoice_clone_request.py \
  --target-model cosyvoice-v3.5-plus \
  --prefix myvoice \
  --voice-sample-url https://example.com/voice.wav \
  --language-hint zh

Validation

mkdir -p output/aliyun-cosyvoice-voice-clone
for f in skills/ai/audio/aliyun-cosyvoice-voice-clone/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/aliyun-cosyvoice-voice-clone/validate.txt

Pass criteria: command exits 0 and output/aliyun-cosyvoice-voice-clone/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/aliyun-cosyvoice-voice-clone/.
  • Include target_model, prefix, and sample URL in the evidence file.

References

  • references/api_reference.md
  • references/sources.md
Usage Guidance
This skill is generally coherent for preparing Alibaba Cloud CosyVoice enrollment requests, but before installing you should: (1) note that the SKILL.md requires a DASHSCOPE_API_KEY or ~/.alibabacloud/credentials entry even though the metadata lists no required env vars — treat that as a manual prerequisite; (2) ensure the API key you provide has minimal permissions needed for voice enrollment and rotate/remove it when not needed; (3) avoid passing private or sensitive audio via public URLs (the workflow expects a public URL and the script writes the URL to an output file); (4) inspect the small helper script (already included) to confirm it only writes local JSON and does not perform network calls or exfiltration; (5) verify the endpoint URLs (dashscope* hosts) match Alibaba Cloud documentation you trust; and (6) run the provided validation steps in a safe environment to see what files are written under output/aliyun-cosyvoice-voice-clone/ and verify they contain only expected artifacts.
Capability Analysis
Type: OpenClaw Skill Name: aliyun-cosyvoice-voice-clone Version: 1.0.0 The skill bundle provides a legitimate interface for interacting with the Alibaba Cloud CosyVoice voice cloning API. The Python script (prepare_cosyvoice_clone_request.py) is a simple utility for generating JSON request payloads, and the SKILL.md instructions correctly guide the agent through the API's requirements and regional constraints without any signs of malicious intent, data exfiltration, or prompt injection.
Capability Assessment
Purpose & Capability
The name, description, SKILL.md, reference docs, and helper script all consistently implement preparing/enrolling CosyVoice cloned voices against Alibaba Cloud Model Studio. However, the package metadata lists no required environment variables while the SKILL.md explicitly requires a DASHSCOPE_API_KEY or ~/.alibabacloud/credentials entry — an inconsistency in declared requirements.
Instruction Scope
Runtime instructions are narrowly scoped to creating enrollment request JSON, validating Python files, and saving outputs. They instruct the user/agent to set an API key and to provide a public audio URL. The guidance to save evidence (including the sample URL and prefix) is expected for auditing but could expose sensitive sample URLs or identifiers if not handled carefully.
Install Mechanism
There is no install spec; this is an instruction-only skill with a small helper script. No downloads, package installs, or archive extraction are performed by the skill itself.
Credentials
Requesting an Alibaba Cloud API key (DASHSCOPE_API_KEY or ~/.alibabacloud/credentials) is proportionate for calling the CosyVoice enrollment endpoint. The concern is that the skill's declared metadata lists no required env vars or primary credential while the SKILL.md requires them — this mismatch can mislead users about what secrets the skill needs.
Persistence & Privilege
The skill does not request permanent/always-on inclusion and does not modify other skills or agent configuration. Autonomous invocation is allowed by default (platform behavior) but the skill metadata does not elevate privileges (always:false).
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install aliyun-cosyvoice-voice-clone
  3. After installation, invoke the skill by name or use /aliyun-cosyvoice-voice-clone
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of aliyun-cosyvoice-voice-clone skill for Alibaba Cloud Model Studio CosyVoice. - Supports creating cloned voices using CosyVoice enrollment from public reference audio. - Compatible with models: cosyvoice-v3.5-plus, cosyvoice-v3.5-flash, cosyvoice-v3-plus, cosyvoice-v3-flash, and cosyvoice-v2. - Allows specifying language hints, audio sample URL, and additional enrollment options. - Returns a reusable voice_id for future TTS synthesis. - Includes usage guidance, API endpoint details, and helper script for preparing enrollment requests.
Metadata
Slug aliyun-cosyvoice-voice-clone
Version 1.0.0
License MIT-0
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is Aliyun Cosyvoice Voice Clone?

Use when creating cloned voices with Alibaba Cloud Model Studio CosyVoice customization models, especially cosyvoice-v3.5-plus or cosyvoice-v3.5-flash, from... It is an AI Agent Skill for Claude Code / OpenClaw, with 108 downloads so far.

How do I install Aliyun Cosyvoice Voice Clone?

Run "/install aliyun-cosyvoice-voice-clone" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Aliyun Cosyvoice Voice Clone free?

Yes, Aliyun Cosyvoice Voice Clone is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Aliyun Cosyvoice Voice Clone support?

Aliyun Cosyvoice Voice Clone is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Aliyun Cosyvoice Voice Clone?

It is built and maintained by cinience (@cinience); the current version is v1.0.0.

💬 Comments