← Back to Skills Marketplace
虾转音频
by
luis1213899
· GitHub ↗
· v1.3.1
· MIT-0
122
Downloads
1
Stars
1
Active Installs
15
Versions
Install in OpenClaw
/install xia-zhuan-audio
Description
🎵 音视频格式转换与处理工具箱。基于 FFmpeg + Whisper AI,支持:格式转换、视频提取音频、合并、分割、压缩、查看信息、音频转文字。
Usage Guidance
This skill appears to do what it claims: FFmpeg-based audio operations plus Whisper transcription. Before installing, ensure you have FFmpeg and Python available and be prepared for first-run downloads of Whisper models (can be hundreds of MBs to multiple GBs) from HuggingFace or a mirror you configure via HF_ENDPOINT. Note the registry metadata omitted required binaries/env — that mismatch is likely a packaging oversight; if you need strict inventory or auditing, ask the author to update the skill metadata. If you have sensitive audio, remember transcription produces local text files; run the skill in a sandbox or test environment if you want to validate behavior before using it on production data.
Capability Assessment
Purpose & Capability
Name/description state FFmpeg + Whisper and the bundled code (audio-forge.js, menu.js, transcribe.py) implements those features. However the registry metadata at the top of the report lists no required binaries or env vars while SKILL.md and the code clearly require FFmpeg, Python, and (optionally) HF_ENDPOINT and XZA_* environment variables. This is likely a packaging/metadata omission rather than malicious misalignment, but it is an inconsistency to be aware of.
Instruction Scope
SKILL.md and the code limit actions to running ffmpeg, invoking Python/transcribe.py, reading/writing local files provided by the user, and (expected) downloading Whisper models from the HuggingFace endpoint. The instructions do not ask to read unrelated system files or exfiltrate data to unknown endpoints.
Install Mechanism
There is no automated install spec; this is instruction + code bundle. The transcribe step depends on the faster-whisper Python package and will cause the Whisper model(s) to be downloaded from the HF_ENDPOINT on first run. Model downloads can be large (MBs–GBs) and are normal for this functionality; downloads come from HuggingFace by default (SKILL.md and transcribe.py default HF_ENDPOINT to https://huggingface.co).
Credentials
The skill does not request access to secrets or unrelated credentials. Declared environment variables (XZA_FFMPEG, XZA_FFPROBE, XZA_SCRIPTDIR, XZA_MODELDIR, HF_ENDPOINT) are reasonable for locating binaries, scripts, and controlling model download source. No other env-vars or credentials are accessed in the code.
Persistence & Privilege
The skill is not marked always:true and does not request elevated or persistent platform privileges. It does not modify other skills or system-wide agent settings. Autonomous invocation remains enabled by default (normal for skills) but is not combined with other concerning factors.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install xia-zhuan-audio - After installation, invoke the skill by name or use
/xia-zhuan-audio - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.3.1
修复:添加 homepage 字段指向 GitHub 源码仓库,增强技能可信度
v1.3.0
增强:转录支持json完整数据输出、视频文件直接输入、--device设备选择、--keep-audio保留临时音频、字幕时间戳毫秒精度、片段级置信度/words信息
v1.2.0
移除:um.exe解密功能及相关代码;保留纯FFmpeg转换能力;消除版权风险
v1.1.1
更新作者名为 GitHub 账号 @luis12123899
v1.1.0
优化:特殊格式功能描述去敏感化;移除明文法律声明;改为平台兼容转换描述
v1.0.9
优化:SKILL.md参考audio-toolkit格式;添加license/MIT、emoji、bins声明;精简描述;补全作者/创建日期
v1.0.8
修复:menu.js移除shell:true解决空格路径截断;run.bat改用%~dp0可移植路径;package.json版本更新
v1.0.7
修复:runFFmpeg/getDuration/cmdInfo全部改用spawn/execFile数组传参,消除shell注入风险
v1.0.6
修复:在metadata.requires.env中声明所有环境变量,消除审计不一致
v1.0.5
修复:Whisper模型默认改为官方HuggingFace源;hf-mirror.com改为用户可选配置
v1.0.4
修复:um.exe添加存在性检查并附带来源说明;Whisper模型下载添加安全说明;SKILL.md详细披露依赖来源
v1.0.3
修复:彻底移除所有硬编码路径;menu.js正确引用audio-forge.js;HF_ENDPOINT改为setdefault
v1.0.2
修复:HF_ENDPOINT改为条件设置;所有硬编码路径已移除;um.exe已确认包含
v1.0.1
修复:移除硬编码路径,改用环境变量配置;添加法律声明;完善文档
v1.0.0
音视频格式转换、提取、合并、分割、压缩、解密、音频转文字(Whisper AI)
Metadata
Frequently Asked Questions
What is 虾转音频?
🎵 音视频格式转换与处理工具箱。基于 FFmpeg + Whisper AI,支持:格式转换、视频提取音频、合并、分割、压缩、查看信息、音频转文字。 It is an AI Agent Skill for Claude Code / OpenClaw, with 122 downloads so far.
How do I install 虾转音频?
Run "/install xia-zhuan-audio" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is 虾转音频 free?
Yes, 虾转音频 is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does 虾转音频 support?
虾转音频 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created 虾转音频?
It is built and maintained by luis1213899 (@luis1213899); the current version is v1.3.1.
More Skills