← 返回 Skills 市场
Fun-ASR-File
作者
chenggongdu
· GitHub ↗
· v1.1.0
· MIT-0
123
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install fun-asr-file
功能描述
阿里云百炼 FunASR 本地音频文件识别(非流式),使用阿里云 DashScope API 进行语音转文字。针对本地音频文件优化,支持自动格式转换,适合批量文件转写场景。
使用说明 (SKILL.md)
Fun-ASR-File
Fun-ASR 是通义实验室百聆团队推出的端到端语音识别大模型,是基于数千万小时真实语音数据训练而成,具备强大的上下文理解能力与行业适应性。
激活条件
| 触发场景 | 说明 |
|---|---|
| 用户发送音频文件 | .wav / .mp3 / .m4a / .flac / .ogg 等格式 |
| 用户要求转录 | "转写音频"、"语音转文字" |
| 音频文件处理 | 需要提取音频中的文字内容 |
配置
设置环境变量:
export DASHSCOPE_API_KEY="sk-xxx"
使用方法
安装依赖
pip install dashscope
# 如需自动格式转换,请安装 FFmpeg
转写音频文件
python scripts/cli.py audio.wav
# 支持 .wav / .mp3 / .m4a / .flac / .ogg 等格式
# 会自动转换为 API 要求的格式(16kHz, 单声道, pcm_s16le)
版本:1.1.0 创建于:2026-03-16 更新:2026-04-09 - 改为非流式调用,优化本地文件处理
安全使用建议
Before installing or using this skill:
- Expect to provide an Aliyun DashScope API key (DASHSCOPE_API_KEY). The registry metadata omitted this; verify the publisher and update metadata before trusting the skill.
- Treat the API key as a secret. Use a least-privilege key and avoid pasting long-lived account keys unless you control the account and understand DashScope billing/auth.
- The skill requires the dashscope Python package and FFmpeg. Install packages from trusted sources and verify dashscope's provenance (PyPI project page, source repo) before pip installing.
- NOTES.md mentions integration with a 'tikhub-douyin-media-links' skill that would download media — if you plan to chain skills, be aware that downloaded content and the chain may transmit data to external services.
- If you need higher assurance, request that the publisher correct the registry metadata to declare DASHSCOPE_API_KEY as a required credential and provide a homepage or source repo for review; run the skill in a sandboxed environment first.
功能分析
Type: OpenClaw Skill
Name: fun-asr-file
Version: 1.1.0
The skill is a legitimate implementation for audio-to-text transcription using Alibaba Cloud's DashScope API. The core logic in `scripts/cli.py` uses the official `dashscope` SDK to process local files and correctly handles API responses without any evidence of data exfiltration, unauthorized network calls, or malicious command execution.
能力评估
Purpose & Capability
The name/description (local ASR via FunASR / DashScope) match the code and SKILL.md: the script calls the DashScope ASR API to transcribe local files and recommends FFmpeg for preprocessing. Functionality requested (local file transcription) is coherent with the implementation.
Instruction Scope
SKILL.md and scripts/cli.py limit actions to reading a user-supplied audio file, preprocessing instructions (FFmpeg), and sending the file to DashScope; they do not attempt to read unrelated system files. NOTES.md documents optional interaction with another skill (tikhub-douyin-media-links) for fetching media links — that implies possible cross-skill network download workflows if combined, but the skill itself does not autonomously perform such downloads.
Install Mechanism
This is instruction-only (no install spec). It requires the dashscope Python SDK and recommends FFmpeg; both are typical for this task. No suspicious external download URLs or archive extraction are present. The lack of an install spec means dependencies would be installed manually by the operator, which reduces automatic risk but requires the user to trust the dashscope package source.
Credentials
The SKILL.md and scripts/cli.py require DASHSCOPE_API_KEY (dashscope.api_key = os.environ.get('DASHSCOPE_API_KEY')), but the skill metadata declared no required environment variables or primary credential. That inconsistency is concerning: the runtime needs an API key but the registry entry does not advertise it. No other unrelated credentials are requested.
Persistence & Privilege
always:false and the code does not modify agent/system configuration or other skills. The skill runs only when invoked and does not request persistent elevated privileges.
如何使用
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install fun-asr-file - 安装完成后,直接呼叫该 Skill 的名称或使用
/fun-asr-file触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.1.0
改为非流式调用,优化本地音频文件处理,支持自动格式转换
元数据
常见问题
Fun-ASR-File 是什么?
阿里云百炼 FunASR 本地音频文件识别(非流式),使用阿里云 DashScope API 进行语音转文字。针对本地音频文件优化,支持自动格式转换,适合批量文件转写场景。 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 123 次。
如何安装 Fun-ASR-File?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install fun-asr-file」即可一键安装,无需额外配置。
Fun-ASR-File 是免费的吗?
是的,Fun-ASR-File 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Fun-ASR-File 支持哪些平台?
Fun-ASR-File 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Fun-ASR-File?
由 chenggongdu(@chenggongdu)开发并维护,当前版本 v1.1.0。
推荐 Skills