← Back to Skills Marketplace
cinience

Aliyun Qwen Tts

by cinience · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
107
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install aliyun-qwen-tts
Description
Use when generating human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). Use when converting text...
README (SKILL.md)

Category: provider

Model Studio Qwen TTS

Validation

mkdir -p output/aliyun-qwen-tts
python -m py_compile skills/ai/audio/aliyun-qwen-tts/scripts/generate_tts.py && echo "py_compile_ok" > output/aliyun-qwen-tts/validate.txt

Pass criteria: command exits 0 and output/aliyun-qwen-tts/validate.txt is generated.

Output And Evidence

  • Save generated audio links, sample audio files, and request payloads to output/aliyun-qwen-tts/.
  • Keep one validation log per execution.

Critical model names

Use one of the recommended models:

  • qwen3-tts-flash
  • qwen3-tts-instruct-flash
  • qwen3-tts-instruct-flash-2026-01-26

Prerequisites

  • Install SDK (recommended in a venv to avoid PEP 668 limits):
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials (env takes precedence).

Normalized interface (tts.generate)

Request

  • text (string, required)
  • voice (string, required)
  • language_type (string, optional; default Auto)
  • instruction (string, optional; recommended for instruct models)
  • stream (bool, optional; default false)

Response

  • audio_url (string, when stream=false)
  • audio_base64_pcm (string, when stream=true)
  • sample_rate (int, 24000)
  • format (string, wav or pcm depending on mode)

Quick start (Python + DashScope SDK)

import os
import dashscope

# Prefer env var for auth: export DASHSCOPE_API_KEY=...
# Or use ~/.alibabacloud/credentials with dashscope_api_key under [default].
# Beijing region; for Singapore use: https://dashscope-intl.aliyuncs.com/api/v1
dashscope.base_http_api_url = "https://dashscope.aliyuncs.com/api/v1"

text = "Hello, this is a short voice line."
response = dashscope.MultiModalConversation.call(
    model="qwen3-tts-instruct-flash",
    api_key=os.getenv("DASHSCOPE_API_KEY"),
    text=text,
    voice="Cherry",
    language_type="English",
    instruction="Warm and calm tone, slightly slower pace.",
    stream=False,
)

audio_url = response.output.audio.url
print(audio_url)

Streaming notes

  • stream=True returns Base64-encoded PCM chunks at 24kHz.
  • Decode chunks and play or concatenate to a pcm buffer.
  • The response contains finish_reason == "stop" when the stream ends.

Operational guidance

  • Keep requests concise; split long text into multiple calls if you hit size or timeout errors.
  • Use language_type consistent with the text to improve pronunciation.
  • Use instruction only when you need explicit style/tone control.
  • Cache by (text, voice, language_type) to avoid repeat costs.

Output location

  • Default output: output/aliyun-qwen-tts/audio/
  • Override base dir with OUTPUT_DIR.

Workflow

  1. Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
  2. Run one minimal read-only query first to verify connectivity and permissions.
  3. Execute the target operation with explicit parameters and bounded scope.
  4. Verify results and save output/evidence files.

References

  • references/api_reference.md for parameter mapping and streaming example.

  • Realtime mode is provided by skills/ai/audio/aliyun-qwen-tts-realtime/.

  • Voice cloning/design are provided by skills/ai/audio/aliyun-qwen-tts-voice-clone/ and skills/ai/audio/aliyun-qwen-tts-voice-design/.

  • Source list: references/sources.md

Usage Guidance
Before installing or running this skill: (1) Expect to provide a DASHSCOPE_API_KEY — the registry metadata omits this, so confirm you are comfortable supplying that secret. (2) The script will try to read .env files and ~/.alibabacloud/credentials; avoid keeping high-privilege or unrelated secrets in those files or run the skill in a sandbox. (3) If you must install the 'dashscope' Python package, install it in an isolated venv and verify the package source (PyPI or vendor site). (4) Prefer creating a dedicated DashScope API key with minimal scope for TTS usage, and rotate/revoke it if you stop using the skill. (5) If you need higher assurance, review the included generate_tts.py yourself or run the skill in a controlled environment (no sensitive credentials present) to observe its network calls and output.
Capability Analysis
Type: OpenClaw Skill Name: aliyun-qwen-tts Version: 1.0.0 The skill is a legitimate integration for Alibaba Cloud's Qwen TTS service. It uses the official DashScope SDK and follows standard practices for credential management, including reading from environment variables and the '~/.alibabacloud/credentials' file. The core logic in 'scripts/generate_tts.py' is focused on generating and downloading audio files as described in 'SKILL.md', with no evidence of malicious intent, data exfiltration, or harmful prompt injection.
Capability Assessment
Purpose & Capability
Name/description, SKILL.md, and the Python script all align: this is a DashScope (Alibaba) Qwen TTS client. However the registry metadata lists no required environment variables or primary credential while the instructions and script require DASHSCOPE_API_KEY (or credentials from ~/.alibabacloud/credentials). That mismatch is an incoherence the user should be aware of.
Instruction Scope
Runtime instructions and the script stay within TTS functionality (compose request, call DashScope API, download audio, save outputs). The script intentionally reads .env files in cwd and in the repo root and will read ~/.alibabacloud/credentials (and honor ALIBABA_CLOUD_PROFILE / ALICLOUD_PROFILE). These file accesses are expected for fetching the API key but are broader than the declared registry requirements.
Install Mechanism
No install spec is embedded (instruction-only skill). SKILL.md recommends installing the 'dashscope' Python SDK via pip in a venv. This is standard, but installing third-party packages carries normal supply-chain risk; the package source should be verified.
Credentials
The skill requires an API key (DASHSCOPE_API_KEY) and reads ~/.alibabacloud/credentials and .env files, but the registry metadata declares no required env vars or primary credential. It also uses OUTPUT_DIR and honors ALIBABA_CLOUD_PROFILE / ALICLOUD_PROFILE implicitly. Requesting access to local credential files is proportionate to calling the DashScope API, but the missing declaration is a notable inconsistency.
Persistence & Privilege
The skill is not marked 'always: true', does not modify other skills or system-wide config, and relies on explicitly provided credentials. Autonomous invocation is allowed by default but not combined with other high-risk flags here.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install aliyun-qwen-tts
  3. After installation, invoke the skill by name or use /aliyun-qwen-tts
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of aliyun-qwen-tts. - Provides text-to-speech conversion using DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). - Supports audio generation for short videos, news, and related use cases. - Outlines request/response fields and setup steps for the DashScope SDK. - Includes both standard and streaming TTS options. - Offers operational tips, output conventions, and validation instructions.
Metadata
Slug aliyun-qwen-tts
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Aliyun Qwen Tts?

Use when generating human-like speech audio with Model Studio DashScope Qwen TTS models (qwen3-tts-flash, qwen3-tts-instruct-flash). Use when converting text... It is an AI Agent Skill for Claude Code / OpenClaw, with 107 downloads so far.

How do I install Aliyun Qwen Tts?

Run "/install aliyun-qwen-tts" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Aliyun Qwen Tts free?

Yes, Aliyun Qwen Tts is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Aliyun Qwen Tts support?

Aliyun Qwen Tts is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Aliyun Qwen Tts?

It is built and maintained by cinience (@cinience); the current version is v1.0.0.

💬 Comments