← Back to Skills Marketplace
cinience

Aliyun Qwen Asr Realtime

by cinience · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
101
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install aliyun-qwen-asr-realtime
Description
Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, li...
README (SKILL.md)

Category: provider

Model Studio Qwen ASR Realtime

Validation

mkdir -p output/aliyun-qwen-asr-realtime
python -m py_compile skills/ai/audio/aliyun-qwen-asr-realtime/scripts/prepare_realtime_asr_request.py && echo "py_compile_ok" > output/aliyun-qwen-asr-realtime/validate.txt

Pass criteria: command exits 0 and output/aliyun-qwen-asr-realtime/validate.txt is generated.

Output And Evidence

  • Save session payloads and response samples under output/aliyun-qwen-asr-realtime/.

Critical model names

Use one of these exact model strings:

  • qwen3-asr-flash-realtime
  • qwen3-asr-flash-realtime-2026-02-10

Use cases

  • Realtime subtitles and captions
  • Voice-agent duplex input
  • Streaming speech-to-text in browser or terminal clients

Prerequisites

  • Set DASHSCOPE_API_KEY in your environment, or add dashscope_api_key to ~/.alibabacloud/credentials.
  • Realtime sessions generally require WebSocket or streaming session handling in the client.

Normalized interface (asr.realtime)

Request

  • model (string, optional): default qwen3-asr-flash-realtime
  • language_hints (array\x3Cstring>, optional)
  • format (string, optional): e.g. pcm, wav
  • sample_rate (int, optional): e.g. 16000
  • chunk_ms (int, optional): frame size in milliseconds

Response

  • text (string): recognized transcript fragment
  • is_final (bool): finalization marker
  • usage (object, optional)

Quick start

Generate a request template:

python skills/ai/audio/aliyun-qwen-asr-realtime/scripts/prepare_realtime_asr_request.py \
  --output output/aliyun-qwen-asr-realtime/request.json

Operational guidance

  • Prefer 16kHz mono PCM unless your client stack requires another format.
  • Keep chunks small enough for responsive partial results.
  • If you only have recorded files, use skills/ai/audio/aliyun-qwen-asr/ instead.

References

  • references/sources.md
Usage Guidance
This skill appears to implement what it claims (a small helper and documentation for Alibaba Qwen ASR realtime), but the package metadata omits the credential and config-path requirements declared in SKILL.md. Before installing or providing credentials: 1) Verify the upstream source and trustworthiness (homepage/source unknown). 2) Confirm that DASHSCOPE_API_KEY is the correct and expected credential for your Alibaba account and not a misnamed secret; prefer using a dedicated API key with limited scope. 3) Prefer setting the API key in a process environment variable rather than adding secrets to ~/.alibabacloud/credentials unless you trust the code and know what will read that file. 4) Ask the publisher to update the skill metadata to explicitly declare required env vars (primaryEnv) and config paths so you can make an informed decision. 5) Monitor outputs written to output/aliyun-qwen-asr-realtime and avoid placing sensitive audio or transcripts there until you confirm behavior.
Capability Analysis
Type: OpenClaw Skill Name: aliyun-qwen-asr-realtime Version: 1.0.0 The skill bundle is a straightforward utility for generating request templates for Alibaba Cloud's Qwen ASR Realtime service. The Python script `scripts/prepare_realtime_asr_request.py` performs simple JSON serialization to a local file without any network activity, credential access, or suspicious execution logic. All instructions in `SKILL.md` are aligned with the stated purpose of configuring speech-to-text parameters.
Capability Assessment
Purpose & Capability
The skill's stated purpose (Qwen ASR realtime on Alibaba Cloud) aligns with the included script and docs. However, the SKILL.md requires an Alibaba credential (DASHSCOPE_API_KEY or dashscope_api_key in ~/.alibabacloud/credentials) while the registry metadata lists no required environment variables or config paths — this mismatch is unexpected and unexplained.
Instruction Scope
Runtime instructions are narrowly scoped: generate a sample JSON payload, validate the Python script, and save session payloads under an output directory. They do instruct users to provide a DASHSCOPE_API_KEY or place credentials in ~/.alibabacloud/credentials — access to that home config file is implied but not declared in metadata.
Install Mechanism
No install spec is present and included code is a small local Python helper; nothing is downloaded or installed. This is low-risk from an install perspective.
Credentials
The SKILL.md requires DASHSCOPE_API_KEY (or adding dashscope_api_key to ~/.alibabacloud/credentials), which is appropriate for an Alibaba Cloud integration. But the skill metadata does not declare that environment variable or the config path; required credentials should be declared explicitly (primaryEnv missing). The credential naming (DASHSCOPE_*) is unusual compared with standard Alibaba keys — verify it is legitimate.
Persistence & Privilege
The skill does not request permanent or elevated platform privileges (always:false). It does not attempt to modify other skills or system-wide settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install aliyun-qwen-asr-realtime
  3. After installation, invoke the skill by name or use /aliyun-qwen-asr-realtime
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release supporting Alibaba Cloud Qwen ASR Realtime speech recognition. - Provides low-latency realtime speech recognition with Qwen ASR Realtime models. - Supports use cases such as live captions, streaming microphone input, and duplex voice agents. - Exposes a normalized asr.realtime interface for request and response handling. - Includes setup, model selection, and operational usage guidance. - Requires a valid DASHSCOPE_API_KEY for operation.
Metadata
Slug aliyun-qwen-asr-realtime
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Aliyun Qwen Asr Realtime?

Use when low-latency realtime speech recognition is needed with Alibaba Cloud Model Studio Qwen ASR Realtime models, including streaming microphone input, li... It is an AI Agent Skill for Claude Code / OpenClaw, with 101 downloads so far.

How do I install Aliyun Qwen Asr Realtime?

Run "/install aliyun-qwen-asr-realtime" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Aliyun Qwen Asr Realtime free?

Yes, Aliyun Qwen Asr Realtime is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Aliyun Qwen Asr Realtime support?

Aliyun Qwen Asr Realtime is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Aliyun Qwen Asr Realtime?

It is built and maintained by cinience (@cinience); the current version is v1.0.0.

💬 Comments