← 返回 Skills 市场

Qwen ASR

Name: Qwen ASR
Author: huanglizhuo

作者 lizhuo · GitHub ↗ · v0.1.2 · MIT-0

darwinlinux ⚠ suspicious

292

总下载

当前安装

版本数

在 OpenClaw 中安装

/install qwen-asr-local

功能描述

Local speech-to-text using Qwen3-ASR (CPU-only, no API key, no cloud). Use when: (1) a voice message or audio file needs transcription, (2) user asks to tran...

安全使用建议

This skill appears consistent with its stated purpose. Before installing: (1) review the install script and the GitHub release it downloads from and only run it if you trust that repo (prebuilt native binaries execute with your user privileges); (2) expect ~1.5 GB model download and network access to GitHub/HuggingFace; (3) ffmpeg is required for non‑WAV inputs; (4) install writes to ~/.local/bin and ~/.openclaw/tools/qwen-asr — you may need to add ~/.local/bin to your PATH; (5) if you need fully air-gapped/local operation, verify the model is cached locally or that the model download will not require a HuggingFace token. Otherwise the skill is internally coherent and proportionate.

功能分析

Type: OpenClaw Skill Name: qwen-asr-local Version: 0.1.2 The skill provides local speech-to-text by downloading a pre-compiled binary from a third-party GitHub repository (huanglizhuo/QwenASR) and a 1.5GB model from HuggingFace. While these actions are aligned with the stated purpose of the tool, downloading and executing unverified binaries from external sources is a high-risk capability. The scripts (install.sh and transcribe.sh) are functionally sound and do not show signs of intentional malice, data exfiltration, or prompt injection.

能力评估

✓ Purpose & Capability

Name/description (local Qwen3-ASR CPU transcription) match the declared requirement (qwen-asr binary) and the included scripts. No unrelated environment variables, credentials, or unusual binaries are requested.

✓ Instruction Scope

SKILL.md and transcribe.sh only describe running the qwen-asr binary (locally) and converting audio via ffmpeg when needed. The install script downloads a release and the model and writes them under the user's home; scripts do not read unrelated system files or exfiltrate data. The only optional env var is QWEN_ASR_MODEL_DIR to override the model path.

✓ Install Mechanism

Install script fetches a prebuilt release from the project's GitHub Releases and extracts it to ~/.local/bin; model download is performed by qwen-asr (presumably fetching from HuggingFace). Using GitHub Releases and the model download command is expected for this purpose; no obscure/shortened URLs or third-party personal servers are used.

✓ Credentials

No secrets or extra environment variables are required. The only environment interaction is an optional QWEN_ASR_MODEL_DIR and use of PATH/ffmpeg. Note: model download requires network access and some HuggingFace-hosted models may require authentication in other contexts, but no credential is requested by this skill.

✓ Persistence & Privilege

Skill is not forcible (always:false) and does not modify other skills or system-wide agent settings. It writes binaries/models into the user's home directories (standard for local tools) but does not request elevated privileges.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install qwen-asr-local
安装完成后，直接呼叫该 Skill 的名称或使用 /qwen-asr-local 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v0.1.2

- Added source code and original implementation links for QwenASR and qwen-asr in documentation. - No changes to functionality or usage.

v0.1.1

- Added support for automatic audio format conversion via ffmpeg when using the transcribe.sh script. - Now accepts a wider range of audio formats (wav, mp3, m4a, ogg, flac, opus, webm, aac, etc.) for transcription. - Clarified that direct qwen-asr command works with WAV files only. - Updated usage instructions to reflect these changes.

v0.1.0

Initial release of qwen-asr-local: local, offline speech-to-text using Qwen3-ASR. - Runs entirely on CPU with no API key or cloud required. - Supports transcription of audio files, voice messages, and input from stdin. - Provides segmented and real-time streaming transcription modes. - Compatible with macOS and Linux only. - Simple installation via provided script; downloads pre-built binary and model files.

元数据

Slug qwen-asr-local

版本 0.1.2

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 3

常见问题

Qwen ASR 是什么？

Local speech-to-text using Qwen3-ASR (CPU-only, no API key, no cloud). Use when: (1) a voice message or audio file needs transcription, (2) user asks to tran... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 292 次。

如何安装 Qwen ASR？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install qwen-asr-local」即可一键安装，无需额外配置。

Qwen ASR 是免费的吗？

是的，Qwen ASR 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Qwen ASR 支持哪些平台？

Qwen ASR 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（darwin, linux）。

谁开发了 Qwen ASR？

由 lizhuo（@huanglizhuo）开发并维护，当前版本 v0.1.2。