← 返回 Skills 市场
huanglizhuo

Qwen ASR

作者 lizhuo · GitHub ↗ · v0.1.2 · MIT-0
darwinlinux ⚠ suspicious
292
总下载
0
收藏
0
当前安装
3
版本数
在 OpenClaw 中安装
/install qwen-asr-local
功能描述
Local speech-to-text using Qwen3-ASR (CPU-only, no API key, no cloud). Use when: (1) a voice message or audio file needs transcription, (2) user asks to tran...
安全使用建议
This skill appears consistent with its stated purpose. Before installing: (1) review the install script and the GitHub release it downloads from and only run it if you trust that repo (prebuilt native binaries execute with your user privileges); (2) expect ~1.5 GB model download and network access to GitHub/HuggingFace; (3) ffmpeg is required for non‑WAV inputs; (4) install writes to ~/.local/bin and ~/.openclaw/tools/qwen-asr — you may need to add ~/.local/bin to your PATH; (5) if you need fully air-gapped/local operation, verify the model is cached locally or that the model download will not require a HuggingFace token. Otherwise the skill is internally coherent and proportionate.
功能分析
Type: OpenClaw Skill Name: qwen-asr-local Version: 0.1.2 The skill provides local speech-to-text by downloading a pre-compiled binary from a third-party GitHub repository (huanglizhuo/QwenASR) and a 1.5GB model from HuggingFace. While these actions are aligned with the stated purpose of the tool, downloading and executing unverified binaries from external sources is a high-risk capability. The scripts (install.sh and transcribe.sh) are functionally sound and do not show signs of intentional malice, data exfiltration, or prompt injection.
能力评估
Purpose & Capability
Name/description (local Qwen3-ASR CPU transcription) match the declared requirement (qwen-asr binary) and the included scripts. No unrelated environment variables, credentials, or unusual binaries are requested.
Instruction Scope
SKILL.md and transcribe.sh only describe running the qwen-asr binary (locally) and converting audio via ffmpeg when needed. The install script downloads a release and the model and writes them under the user's home; scripts do not read unrelated system files or exfiltrate data. The only optional env var is QWEN_ASR_MODEL_DIR to override the model path.
Install Mechanism
Install script fetches a prebuilt release from the project's GitHub Releases and extracts it to ~/.local/bin; model download is performed by qwen-asr (presumably fetching from HuggingFace). Using GitHub Releases and the model download command is expected for this purpose; no obscure/shortened URLs or third-party personal servers are used.
Credentials
No secrets or extra environment variables are required. The only environment interaction is an optional QWEN_ASR_MODEL_DIR and use of PATH/ffmpeg. Note: model download requires network access and some HuggingFace-hosted models may require authentication in other contexts, but no credential is requested by this skill.
Persistence & Privilege
Skill is not forcible (always:false) and does not modify other skills or system-wide agent settings. It writes binaries/models into the user's home directories (standard for local tools) but does not request elevated privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install qwen-asr-local
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /qwen-asr-local 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v0.1.2
- Added source code and original implementation links for QwenASR and qwen-asr in documentation. - No changes to functionality or usage.
v0.1.1
- Added support for automatic audio format conversion via ffmpeg when using the transcribe.sh script. - Now accepts a wider range of audio formats (wav, mp3, m4a, ogg, flac, opus, webm, aac, etc.) for transcription. - Clarified that direct qwen-asr command works with WAV files only. - Updated usage instructions to reflect these changes.
v0.1.0
Initial release of qwen-asr-local: local, offline speech-to-text using Qwen3-ASR. - Runs entirely on CPU with no API key or cloud required. - Supports transcription of audio files, voice messages, and input from stdin. - Provides segmented and real-time streaming transcription modes. - Compatible with macOS and Linux only. - Simple installation via provided script; downloads pre-built binary and model files.
元数据
Slug qwen-asr-local
版本 0.1.2
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 3
常见问题

Qwen ASR 是什么?

Local speech-to-text using Qwen3-ASR (CPU-only, no API key, no cloud). Use when: (1) a voice message or audio file needs transcription, (2) user asks to tran... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 292 次。

如何安装 Qwen ASR?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install qwen-asr-local」即可一键安装,无需额外配置。

Qwen ASR 是免费的吗?

是的,Qwen ASR 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Qwen ASR 支持哪些平台?

Qwen ASR 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(darwin, linux)。

谁开发了 Qwen ASR?

由 lizhuo(@huanglizhuo)开发并维护,当前版本 v0.1.2。

💬 留言讨论