← 返回 Skills 市场
Qwen ASR (C-based Offline)
作者
rightister
· GitHub ↗
· v1.0.0
· MIT-0
267
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install rightister-qwen-asr
功能描述
Offline Chinese and mixed Chinese-English speech-to-text recognition in pure C without Python or FFmpeg dependencies, suitable for edge devices.
安全使用建议
Key points before installing/using:
- The packaging is inconsistent: SKILL.md says 'no FFmpeg' but run.sh calls ffmpeg. Expect to need system tools: git, make, a C toolchain, BLAS (OpenBLAS/MKL), and ffmpeg.
- The skill does not auto-download models: you must run the upstream download_model.sh manually (this requires internet and TTY). After that, inference can be offline. Verify the model source and license before downloading (~1.7–4.5GB each).
- The runner compiles and executes code from ~/.openclaw/workspace/qwen-asr. Only proceed if you trust the upstream repository (https://github.com/antirez/qwen-asr) — inspect its code and the download script (download_model.sh) yourself.
- Run builds in a sandbox/container or non-privileged account if possible, and do not run as root. Check disk space and memory requirements before use.
- Because metadata omitted required binaries and contradicted the README, treat this as a poorly packaged integration rather than clearly malicious — but verify upstream sources and inspect scripts before running them.
功能分析
Type: OpenClaw Skill
Name: rightister-qwen-asr
Version: 1.0.0
The skill is a legitimate wrapper for the qwen-asr C implementation by antirez, providing offline Chinese speech-to-text capabilities. The execution script (scripts/run.sh) handles audio preprocessing via FFmpeg and manages model inference safely, requiring manual user intervention for repository cloning and model downloads rather than performing automated remote execution. While the script contains a minor functional bug in temporary directory creation (mktemp usage), it lacks any indicators of malicious intent, data exfiltration, or prompt injection.
能力评估
Purpose & Capability
The skill's stated purpose (offline C ASR suitable for edge) is plausible, but the package metadata lists no required binaries or env vars while the SKILL.md and run.sh clearly rely on external artifacts: a cloned repository (~~/.openclaw/workspace/qwen-asr), a compiled qwen_asr binary (make blas), and model files (~1.7–4.5GB). The SKILL.md also inconsistently claims 'no FFmpeg dependencies' while the runner invokes ffmpeg for preprocessing. These mismatches are disproportionate to the simple description and suggest sloppy or inconsistent packaging.
Instruction Scope
The included run.sh performs several runtime actions outside a narrow 'just run local binary' scope: it expects a repo in $HOME/.openclaw/workspace (or instructs the user to git clone it), may invoke make to compile code on the host, calls ffmpeg to transcode audio, and requires the user to run a model download script (which needs network and TTY). The script does not exfiltrate credentials or data, but it does rely on network access (for model download) and mutates the user's workspace. The SKILL.md's claim of fully offline inference is only true after a one-time download step, which is not automated.
Install Mechanism
There is no formal install spec (instruction-only), which minimizes automated risk, but the runner expects the user to clone and build upstream code and to have system tools (git, make, C compiler, ffmpeg, BLAS) available. The model download is left to the user (manual git/./download_model.sh), reducing silent remote downloads, but building unknown code on-device is a real risk if you don't trust the upstream repository.
Credentials
The skill requests no environment variables or credentials, and the run script does not attempt to read secret env vars or other unrelated config files. The lack of credential requests is appropriate for an offline ASR skill; the only external requirement is the model download (no credentials shown).
Persistence & Privilege
The skill is not force-enabled (always:false) and does not modify system-wide configs. It operates on files under $HOME/.openclaw/workspace and temporary WAV files; it does not request elevated privileges or persist credentials. Autonomous invocation is allowed by platform default but is not combined with additional concerning privileges here.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install rightister-qwen-asr - 安装完成后,直接呼叫该 Skill 的名称或使用
/rightister-qwen-asr触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of rightister-qwen-asr.
- Provides offline Chinese (and bilingual) speech-to-text using the qwen3-asr-0.6b model with a pure C implementation.
- No Python, GIL, or external FFmpeg dependencies; suitable for edge deployment.
- Supports major platforms: macOS (Accelerate), Linux (OpenBLAS/MKL).
- Audio preprocessing auto-converts files to 16kHz mono WAV; supports `.ogg`, `.mp3`, `.wav`.
- Model selection (0.6B/1.7B) and thread count are configurable.
- Purely offline inference after model download; no internet needed.
- Outputs plain Chinese or mixed Chinese-English transcription.
元数据
常见问题
Qwen ASR (C-based Offline) 是什么?
Offline Chinese and mixed Chinese-English speech-to-text recognition in pure C without Python or FFmpeg dependencies, suitable for edge devices. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 267 次。
如何安装 Qwen ASR (C-based Offline)?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install rightister-qwen-asr」即可一键安装,无需额外配置。
Qwen ASR (C-based Offline) 是免费的吗?
是的,Qwen ASR (C-based Offline) 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Qwen ASR (C-based Offline) 支持哪些平台?
Qwen ASR (C-based Offline) 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Qwen ASR (C-based Offline)?
由 rightister(@rightister)开发并维护,当前版本 v1.0.0。
推荐 Skills