← 返回 Skills 市场

Alicloud Ai Audio Tts Realtime

Name: Alicloud Ai Audio Tts Realtime
Author: cinience

作者 cinience · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ⚠ suspicious

797

总下载

当前安装

版本数

在 OpenClaw 中安装

/install alicloud-ai-audio-tts-realtime

功能描述

Real-time speech synthesis with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instructi...

使用说明 (SKILL.md)

Category: provider

Model Studio Qwen TTS Realtime

Use realtime TTS models for low-latency streaming speech output.

Critical model names

Use one of these exact model strings:

qwen3-tts-flash-realtime
qwen3-tts-instruct-flash-realtime
qwen3-tts-instruct-flash-realtime-2026-01-22
qwen3-tts-vd-realtime-2026-01-15
qwen3-tts-vc-realtime-2026-01-15

Prerequisites

Install SDK in a virtual environment:

python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope

Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.realtime)

Request

text (string, required)
voice (string, required)
instruction (string, optional)
sample_rate (int, optional)

Response

audio_base64_pcm_chunks (array\x3Cstring>)
sample_rate (int)
finish_reason (string)

Operational guidance

Use websocket or streaming endpoint for realtime mode.
Keep each utterance short for lower latency.
For instruction models, keep instruction explicit and concise.
Some SDK/runtime combinations may reject realtime model calls over MultiModalConversation; use the probe script below to verify compatibility.

Local demo script

Use the probe script to verify realtime compatibility in your current SDK/runtime, and optionally fallback to a non-realtime model for immediate output:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/realtime_tts_demo.py \
  --text "This is a realtime speech demo." \
  --fallback \
  --output output/ai-audio-tts-realtime/audio/fallback-demo.wav

Strict mode (for CI / gating):

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/realtime_tts_demo.py \
  --text "realtime health check" \
  --strict

Output location

Default output: output/ai-audio-tts-realtime/audio/
Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/alicloud-ai-audio-tts-realtime
for f in skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-realtime/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-realtime/validate.txt is generated.

Output And Evidence

Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-tts-realtime/.
Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
Run one minimal read-only query first to verify connectivity and permissions.
Execute the target operation with explicit parameters and bounded scope.
Verify results and save output/evidence files.

References

references/sources.md

安全使用建议

This skill generally does what it says (realtime TTS using Alibaba's dashscope SDK), but take these precautions before installing: - Expect to supply DASHSCOPE_API_KEY (or add dashscope_api_key to ~/.alibabacloud/credentials). The skill metadata should have declared this but didn't — treat that as a red flag. - The bundled script will auto-load .env files from the current working directory and the repo root. If you have sensitive secrets in .env, run the probe in a clean/sandboxed environment or remove secrets first. - The script downloads audio from URLs returned by the service. Ensure you trust the dashscope endpoint (default base URL is dashscope.aliyuncs.com) and consider running in a network-restricted environment if you want to limit unexpected external fetches. - Review the script locally before running (it is short and readable). If you proceed, run in a virtualenv, set OUTPUT_DIR to an isolated path, and verify the DASHSCOPE_API_KEY scope/permissions are minimal. - If you maintain the registry entry, update the metadata to declare DASHSCOPE_API_KEY in requires.env and document the .env auto-loading behavior to remove the mismatch.

功能分析

Type: OpenClaw Skill Name: alicloud-ai-audio-tts-realtime Version: 1.0.1 The skill bundle provides a legitimate interface for Alibaba Cloud's Qwen Realtime TTS service. The script `scripts/realtime_tts_demo.py` correctly implements authentication by checking environment variables and the standard Alibaba Cloud credentials file (~/.alibabacloud/credentials). All network activity is directed toward the official Alibaba Cloud API (dashscope.aliyuncs.com), and the code lacks any indicators of malicious intent, such as data exfiltration or unauthorized command execution.

能力评估

ℹ Purpose & Capability

The name/description and the included Python probe script align with realtime TTS via Alibaba Cloud's dashscope SDK and the listed Qwen models — that capability is coherent. However, the package metadata declares no required environment variables while the runtime script requires DASHSCOPE_API_KEY (or credentials file), which is an inconsistency.

⚠ Instruction Scope

SKILL.md gives reasonable runtime guidance, but the included script will automatically load .env files from the current working dir and the repository root and will read ~/.alibabacloud/credentials to populate DASHSCOPE_API_KEY; this behavior means the skill could pull unrelated secrets from your environment. The script also downloads audio from URLs returned by the service (audio_url) — normally expected, but it will fetch arbitrary URLs provided in responses, which enlarges the network trust surface.

✓ Install Mechanism

No install spec (instruction-only) and only a pip dependency (dashscope) are required. There is no opaque remote download/install mechanism in the skill bundle itself.

⚠ Credentials

The runtime requires DASHSCOPE_API_KEY (or credentials entry) though the registry metadata lists no required env vars; this mismatch is misleading. Additionally, the script's auto-loading of .env files can expose unrelated environment secrets to the skill if present.

✓ Persistence & Privilege

The skill does not request always:true, does not modify other skills or system-wide configs, and is user-invocable. It writes outputs to a user-visible output directory only.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install alicloud-ai-audio-tts-realtime
安装完成后，直接呼叫该 Skill 的名称或使用 /alicloud-ai-audio-tts-realtime 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.1

batch publish from alicloud-skills on 2026-03-11

v1.0.0

Initial release of Alibaba Cloud Model Studio Qwen TTS Realtime integration. - Provides low-latency, real-time speech synthesis using specific Qwen TTS models. - Supports instruction-controlled and standard real-time TTS. - Includes SDK setup, environment requirements, and compatibility probe/demo scripts. - Documents both required and optional API parameters for normalized requests and responses. - Offers operational guidance and output management options.

元数据

Slug alicloud-ai-audio-tts-realtime

版本 1.0.1

许可证 MIT-0

累计安装 2

当前安装数 2

历史版本数 2

常见问题