← 返回 Skills 市场
222
总下载
0
收藏
2
当前安装
1
版本数
在 OpenClaw 中安装
/install fun-asr
功能描述
阿里云百炼 FunASR 录音文件识别,使用阿里云 DashScope API 进行语音转文字。当用户需要转录音频文件时触发。
使用说明 (SKILL.md)
Fun-ASR
Fun-ASR 是通义实验室百聆团队推出的端到端语音识别大模型,是基于数千万小时真实语音数据训练而成,具备强大的上下文理解能力与行业适应性。
激活条件
| 触发场景 | 说明 |
|---|---|
| 用户发送音频文件 | .wav / .mp3 / .m4a / .flac / .ogg 等格式 |
| 用户要求转录 | "转写音频"、"语音转文字" |
| 音频文件处理 | 需要提取音频中的文字内容 |
配置
设置环境变量:
export DASHSCOPE_API_KEY="sk-xxx"
使用方法
安装依赖
pip install dashscope librosa
转写音频文件
python scripts/cli.py audio.wav
版本:1.0.0 创建于:2026-03-16
安全使用建议
This skill appears to be a small client for Aliyun DashScope ASR and will send any audio you run through it to DashScope's service (wss://dashscope.aliyuncs.com). Before installing or running: 1) Treat DASHSCOPE_API_KEY as a secret — the registry metadata failed to declare it, so do not paste keys into unfamiliar installs. 2) Only use a minimally scoped or ephemeral API key and avoid transcribing sensitive audio. 3) The package includes .claude/settings.local.json permitting WebFetch to bailian.console.aliyun.com (an unrelated Aliyun console domain) — consider removing or reviewing that file if you don't want the skill to request web access. 4) Verify the source/owner since source/homepage are unknown; inspect the dashscope Python package and network endpoints you will be talking to. 5) If you plan to install, run in an isolated environment (virtualenv/container) and confirm the DASHSCOPE API key usage is acceptable.
功能分析
Type: OpenClaw Skill
Name: fun-asr
Version: 1.0.0
The skill bundle is a legitimate implementation for transcribing audio files using the Alibaba DashScope FunASR API. The Python script (scripts/cli.py) correctly handles audio processing via librosa and communicates with the official Alibaba WebSocket endpoint (dashscope.aliyuncs.com) without any signs of data exfiltration, malicious execution, or prompt injection.
能力评估
Purpose & Capability
The code and SKILL.md implement an Aliyun DashScope-based ASR client (reads an audio file and sends PCM to a DashScope websocket). That matches the skill description. However registry metadata lists no required environment variables while both SKILL.md and the script require DASHSCOPE_API_KEY, an omission that is incoherent.
Instruction Scope
Runtime instructions are limited to installing two Python packages, setting DASHSCOPE_API_KEY, and running scripts/cli.py on an audio file. The script only reads the provided audio file and sends audio frames to DashScope; it does not attempt to read unrelated files, other credentials, or system state.
Install Mechanism
There is no install spec in the registry (instruction-only), and the SKILL.md instructs pip installing dashscope and librosa. No downloads from arbitrary URLs or archive extraction are present. The only code file is a small CLI script that uses those libraries.
Credentials
The script and SKILL.md require DASHSCOPE_API_KEY (used as dashscope.api_key), but the skill's declared required env vars list is empty — this mismatch could mislead users about secret requirements. Also the included .claude/settings.local.json grants WebFetch to bailian.console.aliyun.com which is unrelated to the runtime websocket endpoint (wss://dashscope.aliyuncs.com) and is not explained.
Persistence & Privilege
The skill is not marked always:true and does not request persistent system-wide privileges. It does not modify other skills' configs. Including a settings.local.json is benign but its allowed WebFetch domain is unexpected.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install fun-asr - 安装完成后,直接呼叫该 Skill 的名称或使用
/fun-asr触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release of fun-asr skill:
- Integrates Aliyun DashScope API for automatic speech recognition of audio files.
- Supports various audio formats including .wav, .mp3, .m4a, .flac, and .ogg.
- Automatically activates when users request audio transcription or send audio files.
- Provides setup instructions, including API key configuration and Python dependency installation.
- Includes command-line usage example for transcribing audio files.
元数据
常见问题
Fun-ASR speech recognition 是什么?
阿里云百炼 FunASR 录音文件识别,使用阿里云 DashScope API 进行语音转文字。当用户需要转录音频文件时触发。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 222 次。
如何安装 Fun-ASR speech recognition?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install fun-asr」即可一键安装,无需额外配置。
Fun-ASR speech recognition 是免费的吗?
是的,Fun-ASR speech recognition 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Fun-ASR speech recognition 支持哪些平台?
Fun-ASR speech recognition 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Fun-ASR speech recognition?
由 彭震东(@pengzhendong)开发并维护,当前版本 v1.0.0。
推荐 Skills