← 返回 Skills 市场
cinience

Alicloud Ai Audio Tts Realtime

作者 cinience · GitHub ↗ · v1.0.1 · MIT-0
cross-platform ⚠ suspicious
797
总下载
0
收藏
2
当前安装
2
版本数
在 OpenClaw 中安装
/install alicloud-ai-audio-tts-realtime
功能描述
Real-time speech synthesis with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instructi...
使用说明 (SKILL.md)

Category: provider

Model Studio Qwen TTS Realtime

Use realtime TTS models for low-latency streaming speech output.

Critical model names

Use one of these exact model strings:

  • qwen3-tts-flash-realtime
  • qwen3-tts-instruct-flash-realtime
  • qwen3-tts-instruct-flash-realtime-2026-01-22
  • qwen3-tts-vd-realtime-2026-01-15
  • qwen3-tts-vc-realtime-2026-01-15

Prerequisites

  • Install SDK in a virtual environment:
python3 -m venv .venv
. .venv/bin/activate
python -m pip install dashscope
  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.

Normalized interface (tts.realtime)

Request

  • text (string, required)
  • voice (string, required)
  • instruction (string, optional)
  • sample_rate (int, optional)

Response

  • audio_base64_pcm_chunks (array\x3Cstring>)
  • sample_rate (int)
  • finish_reason (string)

Operational guidance

  • Use websocket or streaming endpoint for realtime mode.
  • Keep each utterance short for lower latency.
  • For instruction models, keep instruction explicit and concise.
  • Some SDK/runtime combinations may reject realtime model calls over MultiModalConversation; use the probe script below to verify compatibility.

Local demo script

Use the probe script to verify realtime compatibility in your current SDK/runtime, and optionally fallback to a non-realtime model for immediate output:

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/realtime_tts_demo.py \
  --text "This is a realtime speech demo." \
  --fallback \
  --output output/ai-audio-tts-realtime/audio/fallback-demo.wav

Strict mode (for CI / gating):

.venv/bin/python skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/realtime_tts_demo.py \
  --text "realtime health check" \
  --strict

Output location

  • Default output: output/ai-audio-tts-realtime/audio/
  • Override base dir with OUTPUT_DIR.

Validation

mkdir -p output/alicloud-ai-audio-tts-realtime
for f in skills/ai/audio/alicloud-ai-audio-tts-realtime/scripts/*.py; do
  python3 -m py_compile "$f"
done
echo "py_compile_ok" > output/alicloud-ai-audio-tts-realtime/validate.txt

Pass criteria: command exits 0 and output/alicloud-ai-audio-tts-realtime/validate.txt is generated.

Output And Evidence

  • Save artifacts, command outputs, and API response summaries under output/alicloud-ai-audio-tts-realtime/.
  • Include key parameters (region/resource id/time range) in evidence files for reproducibility.

Workflow

  1. Confirm user intent, region, identifiers, and whether the operation is read-only or mutating.
  2. Run one minimal read-only query first to verify connectivity and permissions.
  3. Execute the target operation with explicit parameters and bounded scope.
  4. Verify results and save output/evidence files.

References

  • references/sources.md
安全使用建议
This skill generally does what it says (realtime TTS using Alibaba's dashscope SDK), but take these precautions before installing: - Expect to supply DASHSCOPE_API_KEY (or add dashscope_api_key to ~/.alibabacloud/credentials). The skill metadata should have declared this but didn't — treat that as a red flag. - The bundled script will auto-load .env files from the current working directory and the repo root. If you have sensitive secrets in .env, run the probe in a clean/sandboxed environment or remove secrets first. - The script downloads audio from URLs returned by the service. Ensure you trust the dashscope endpoint (default base URL is dashscope.aliyuncs.com) and consider running in a network-restricted environment if you want to limit unexpected external fetches. - Review the script locally before running (it is short and readable). If you proceed, run in a virtualenv, set OUTPUT_DIR to an isolated path, and verify the DASHSCOPE_API_KEY scope/permissions are minimal. - If you maintain the registry entry, update the metadata to declare DASHSCOPE_API_KEY in requires.env and document the .env auto-loading behavior to remove the mismatch.
功能分析
Type: OpenClaw Skill Name: alicloud-ai-audio-tts-realtime Version: 1.0.1 The skill bundle provides a legitimate interface for Alibaba Cloud's Qwen Realtime TTS service. The script `scripts/realtime_tts_demo.py` correctly implements authentication by checking environment variables and the standard Alibaba Cloud credentials file (~/.alibabacloud/credentials). All network activity is directed toward the official Alibaba Cloud API (dashscope.aliyuncs.com), and the code lacks any indicators of malicious intent, such as data exfiltration or unauthorized command execution.
能力评估
Purpose & Capability
The name/description and the included Python probe script align with realtime TTS via Alibaba Cloud's dashscope SDK and the listed Qwen models — that capability is coherent. However, the package metadata declares no required environment variables while the runtime script requires DASHSCOPE_API_KEY (or credentials file), which is an inconsistency.
Instruction Scope
SKILL.md gives reasonable runtime guidance, but the included script will automatically load .env files from the current working dir and the repository root and will read ~/.alibabacloud/credentials to populate DASHSCOPE_API_KEY; this behavior means the skill could pull unrelated secrets from your environment. The script also downloads audio from URLs returned by the service (audio_url) — normally expected, but it will fetch arbitrary URLs provided in responses, which enlarges the network trust surface.
Install Mechanism
No install spec (instruction-only) and only a pip dependency (dashscope) are required. There is no opaque remote download/install mechanism in the skill bundle itself.
Credentials
The runtime requires DASHSCOPE_API_KEY (or credentials entry) though the registry metadata lists no required env vars; this mismatch is misleading. Additionally, the script's auto-loading of .env files can expose unrelated environment secrets to the skill if present.
Persistence & Privilege
The skill does not request always:true, does not modify other skills or system-wide configs, and is user-invocable. It writes outputs to a user-visible output directory only.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install alicloud-ai-audio-tts-realtime
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /alicloud-ai-audio-tts-realtime 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.1
batch publish from alicloud-skills on 2026-03-11
v1.0.0
Initial release of Alibaba Cloud Model Studio Qwen TTS Realtime integration. - Provides low-latency, real-time speech synthesis using specific Qwen TTS models. - Supports instruction-controlled and standard real-time TTS. - Includes SDK setup, environment requirements, and compatibility probe/demo scripts. - Documents both required and optional API parameters for normalized requests and responses. - Offers operational guidance and output management options.
元数据
Slug alicloud-ai-audio-tts-realtime
版本 1.0.1
许可证 MIT-0
累计安装 2
当前安装数 2
历史版本数 2
常见问题

Alicloud Ai Audio Tts Realtime 是什么?

Real-time speech synthesis with Alibaba Cloud Model Studio Qwen TTS Realtime models. Use when low-latency interactive speech is required, including instructi... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 797 次。

如何安装 Alicloud Ai Audio Tts Realtime?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install alicloud-ai-audio-tts-realtime」即可一键安装,无需额外配置。

Alicloud Ai Audio Tts Realtime 是免费的吗?

是的,Alicloud Ai Audio Tts Realtime 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Alicloud Ai Audio Tts Realtime 支持哪些平台?

Alicloud Ai Audio Tts Realtime 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Alicloud Ai Audio Tts Realtime?

由 cinience(@cinience)开发并维护,当前版本 v1.0.1。

💬 留言讨论