← Back to Skills Marketplace
Qwen ASR (C-based Offline)
by
rightister
· GitHub ↗
· v1.0.0
· MIT-0
267
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install rightister-qwen-asr
Description
Offline Chinese and mixed Chinese-English speech-to-text recognition in pure C without Python or FFmpeg dependencies, suitable for edge devices.
Usage Guidance
Key points before installing/using:
- The packaging is inconsistent: SKILL.md says 'no FFmpeg' but run.sh calls ffmpeg. Expect to need system tools: git, make, a C toolchain, BLAS (OpenBLAS/MKL), and ffmpeg.
- The skill does not auto-download models: you must run the upstream download_model.sh manually (this requires internet and TTY). After that, inference can be offline. Verify the model source and license before downloading (~1.7–4.5GB each).
- The runner compiles and executes code from ~/.openclaw/workspace/qwen-asr. Only proceed if you trust the upstream repository (https://github.com/antirez/qwen-asr) — inspect its code and the download script (download_model.sh) yourself.
- Run builds in a sandbox/container or non-privileged account if possible, and do not run as root. Check disk space and memory requirements before use.
- Because metadata omitted required binaries and contradicted the README, treat this as a poorly packaged integration rather than clearly malicious — but verify upstream sources and inspect scripts before running them.
Capability Analysis
Type: OpenClaw Skill
Name: rightister-qwen-asr
Version: 1.0.0
The skill is a legitimate wrapper for the qwen-asr C implementation by antirez, providing offline Chinese speech-to-text capabilities. The execution script (scripts/run.sh) handles audio preprocessing via FFmpeg and manages model inference safely, requiring manual user intervention for repository cloning and model downloads rather than performing automated remote execution. While the script contains a minor functional bug in temporary directory creation (mktemp usage), it lacks any indicators of malicious intent, data exfiltration, or prompt injection.
Capability Assessment
Purpose & Capability
The skill's stated purpose (offline C ASR suitable for edge) is plausible, but the package metadata lists no required binaries or env vars while the SKILL.md and run.sh clearly rely on external artifacts: a cloned repository (~~/.openclaw/workspace/qwen-asr), a compiled qwen_asr binary (make blas), and model files (~1.7–4.5GB). The SKILL.md also inconsistently claims 'no FFmpeg dependencies' while the runner invokes ffmpeg for preprocessing. These mismatches are disproportionate to the simple description and suggest sloppy or inconsistent packaging.
Instruction Scope
The included run.sh performs several runtime actions outside a narrow 'just run local binary' scope: it expects a repo in $HOME/.openclaw/workspace (or instructs the user to git clone it), may invoke make to compile code on the host, calls ffmpeg to transcode audio, and requires the user to run a model download script (which needs network and TTY). The script does not exfiltrate credentials or data, but it does rely on network access (for model download) and mutates the user's workspace. The SKILL.md's claim of fully offline inference is only true after a one-time download step, which is not automated.
Install Mechanism
There is no formal install spec (instruction-only), which minimizes automated risk, but the runner expects the user to clone and build upstream code and to have system tools (git, make, C compiler, ffmpeg, BLAS) available. The model download is left to the user (manual git/./download_model.sh), reducing silent remote downloads, but building unknown code on-device is a real risk if you don't trust the upstream repository.
Credentials
The skill requests no environment variables or credentials, and the run script does not attempt to read secret env vars or other unrelated config files. The lack of credential requests is appropriate for an offline ASR skill; the only external requirement is the model download (no credentials shown).
Persistence & Privilege
The skill is not force-enabled (always:false) and does not modify system-wide configs. It operates on files under $HOME/.openclaw/workspace and temporary WAV files; it does not request elevated privileges or persist credentials. Autonomous invocation is allowed by platform default but is not combined with additional concerning privileges here.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install rightister-qwen-asr - After installation, invoke the skill by name or use
/rightister-qwen-asr - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of rightister-qwen-asr.
- Provides offline Chinese (and bilingual) speech-to-text using the qwen3-asr-0.6b model with a pure C implementation.
- No Python, GIL, or external FFmpeg dependencies; suitable for edge deployment.
- Supports major platforms: macOS (Accelerate), Linux (OpenBLAS/MKL).
- Audio preprocessing auto-converts files to 16kHz mono WAV; supports `.ogg`, `.mp3`, `.wav`.
- Model selection (0.6B/1.7B) and thread count are configurable.
- Purely offline inference after model download; no internet needed.
- Outputs plain Chinese or mixed Chinese-English transcription.
Metadata
Frequently Asked Questions
What is Qwen ASR (C-based Offline)?
Offline Chinese and mixed Chinese-English speech-to-text recognition in pure C without Python or FFmpeg dependencies, suitable for edge devices. It is an AI Agent Skill for Claude Code / OpenClaw, with 267 downloads so far.
How do I install Qwen ASR (C-based Offline)?
Run "/install rightister-qwen-asr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Qwen ASR (C-based Offline) free?
Yes, Qwen ASR (C-based Offline) is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Qwen ASR (C-based Offline) support?
Qwen ASR (C-based Offline) is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Qwen ASR (C-based Offline)?
It is built and maintained by rightister (@rightister); the current version is v1.0.0.
More Skills