← Back to Skills Marketplace

Alicloud Ai Audio Tts Realtime

Name: Alicloud Ai Audio Tts Realtime
Author: cinience

by cinience · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ⚠ suspicious

797

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install alicloud-ai-audio-tts-realtime

Description

Real-time speech synthesis with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instructi...

README (SKILL.md)

Category: provider

Model Studio Qwen TTS Realtime

Use realtime TTS models for low-latency streaming speech output.

Critical model names

Use one of these exact model strings:

qwen3-tts-flash-realtime
qwen3-tts-instruct-flash-realtime
qwen3-tts-instruct-flash-realtime-2026-01-22
qwen3-tts-vd-realtime-2026-01-15
qwen3-tts-vc-realtime-2026-01-15

Prerequisites

Install SDK in a virtual environment:

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.realtime)

Request

text (string, required)
voice (string, required)
instruction (string, optional)
sample_rate (int, optional)

Response

audio_base64_pcm_chunks (array\x3Cstring>)
sample_rate (int)
finish_reason (string)

Operational guidance

Use websocket or streaming endpoint for realtime mode.
Keep each utterance short for lower latency.
For instruction models, keep instruction explicit and concise.
Some SDK/runtime combinations may reject realtime model calls over MultiModalConversation; use the probe script below to verify compatibility.

Local demo script

Use the probe script to verify realtime compatibility in your current SDK/runtime, and optionally fallback to a non-realtime model for immediate output:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/realtime_tts_demo.py \
  --text "This is a realtime speech demo." \
  --fallback \
  --output output/ai-audio-tts-realtime/audio/fallback-demo.wav

Strict mode (for CI / gating):

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/realtime_tts_demo.py \
  --text "realtime health check" \
  --strict

Output location

Default output: output/ai-audio-tts-realtime/audio/
Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/alicloud-ai-audio-tts-realtime
for f in skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-realtime/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-realtime/validate.txt is generated.

Output And Evidence

Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-tts-realtime/.
Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
Run one minimal read-only query first to verify connectivity and permissions.
Execute the target operation with explicit parameters and bounded scope.
Verify results and save output/evidence files.

References

references/sources.md

Usage Guidance

This skill generally does what it says (realtime TTS using Alibaba's dashscope SDK), but take these precautions before installing: - Expect to supply DASHSCOPE_API_KEY (or add dashscope_api_key to ~/.alibabacloud/credentials). The skill metadata should have declared this but didn't — treat that as a red flag. - The bundled script will auto-load .env files from the current working directory and the repo root. If you have sensitive secrets in .env, run the probe in a clean/sandboxed environment or remove secrets first. - The script downloads audio from URLs returned by the service. Ensure you trust the dashscope endpoint (default base URL is dashscope.aliyuncs.com) and consider running in a network-restricted environment if you want to limit unexpected external fetches. - Review the script locally before running (it is short and readable). If you proceed, run in a virtualenv, set OUTPUT_DIR to an isolated path, and verify the DASHSCOPE_API_KEY scope/permissions are minimal. - If you maintain the registry entry, update the metadata to declare DASHSCOPE_API_KEY in requires.env and document the .env auto-loading behavior to remove the mismatch.

Capability Analysis

Type: OpenClaw Skill Name: alicloud-ai-audio-tts-realtime Version: 1.0.1 The skill bundle provides a legitimate interface for Alibaba Cloud's Qwen Realtime TTS service. The script `scripts/realtime_tts_demo.py` correctly implements authentication by checking environment variables and the standard Alibaba Cloud credentials file (~/.alibabacloud/credentials). All network activity is directed toward the official Alibaba Cloud API (dashscope.aliyuncs.com), and the code lacks any indicators of malicious intent, such as data exfiltration or unauthorized command execution.

Capability Assessment

ℹ Purpose & Capability

The name/description and the included Python probe script align with realtime TTS via Alibaba Cloud's dashscope SDK and the listed Qwen models — that capability is coherent. However, the package metadata declares no required environment variables while the runtime script requires DASHSCOPE_API_KEY (or credentials file), which is an inconsistency.

⚠ Instruction Scope

SKILL.md gives reasonable runtime guidance, but the included script will automatically load .env files from the current working dir and the repository root and will read ~/.alibabacloud/credentials to populate DASHSCOPE_API_KEY; this behavior means the skill could pull unrelated secrets from your environment. The script also downloads audio from URLs returned by the service (audio_url) — normally expected, but it will fetch arbitrary URLs provided in responses, which enlarges the network trust surface.

✓ Install Mechanism

No install spec (instruction-only) and only a pip dependency (dashscope) are required. There is no opaque remote download/install mechanism in the skill bundle itself.

⚠ Credentials

The runtime requires DASHSCOPE_API_KEY (or credentials entry) though the registry metadata lists no required env vars; this mismatch is misleading. Additionally, the script's auto-loading of .env files can expose unrelated environment secrets to the skill if present.

✓ Persistence & Privilege

The skill does not request always:true, does not modify other skills or system-wide configs, and is user-invocable. It writes outputs to a user-visible output directory only.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install alicloud-ai-audio-tts-realtime
After installation, invoke the skill by name or use /alicloud-ai-audio-tts-realtime
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.1

batch publish from alicloud-skills on 2026-03-11

v1.0.0

Initial release of Alibaba Cloud Model Studio Qwen TTS Realtime integration. - Provides low-latency, real-time speech synthesis using specific Qwen TTS models. - Supports instruction-controlled and standard real-time TTS. - Includes SDK setup, environment requirements, and compatibility probe/demo scripts. - Documents both required and optional API parameters for normalized requests and responses. - Offers operational guidance and output management options.

Metadata

Slug alicloud-ai-audio-tts-realtime

Version 1.0.1

License MIT-0

All-time Installs 2

Active Installs 2

Total Versions 2

Frequently Asked Questions

What is Alicloud Ai Audio Tts Realtime?

Real-time speech synthesis with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instructi... It is an AI Agent Skill for Claude Code / OpenClaw, with 797 downloads so far.

How do I install Alicloud Ai Audio Tts Realtime?

Run "/install alicloud-ai-audio-tts-realtime" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Alicloud Ai Audio Tts Realtime free?

Yes, Alicloud Ai Audio Tts Realtime is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Alicloud Ai Audio Tts Realtime support?

Alicloud Ai Audio Tts Realtime is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Alicloud Ai Audio Tts Realtime?

It is built and maintained by cinience (@cinience); the current version is v1.0.1.

More Skills