← Back to Skills Marketplace
video-subtitle-skill
by
Wan Shuaibing
· GitHub ↗
· v1.0.0
· MIT-0
313
Downloads
0
Stars
3
Active Installs
1
Versions
Install in OpenClaw
/install video-subtitle-skill
Description
为视频/音频自动生成字幕,支持多语言识别、翻译、说话人分离、字幕烧入视频
Usage Guidance
This skill largely does what it claims (extract audio, call SenseAudio, write SRT/VTT/TXT, optionally burn subtitles), but there are important mismatches you should address before use: 1) The package metadata does not declare the required SENSEAUDIO_API_KEY or ffmpeg/ffprobe even though the script needs them — treat that as an omission, not a feature. 2) The SKILL.md echoes the API key to stdout which can leak the secret to logs or other observers; remove that echo or avoid running it. 3) Review the included scripts (scripts/video_subtitle.py) for any code paths that post data to endpoints you don't expect (the script appears to call only https://api.senseaudio.cn). 4) Run the skill in an isolated environment (or with an API key with limited quota) until you confirm behavior. 5) If you plan to install, ensure ffmpeg/ffprobe are present and supply the SENSEAUDIO_API_KEY securely (do not paste it into chat or run the provided echo). If you want higher confidence, ask the author to update metadata to declare the env var/binaries and to remove the secret-echoing line.
Capability Analysis
Type: OpenClaw Skill
Name: video-subtitle-skill
Version: 1.0.0
The skill bundle is a legitimate tool for generating video subtitles via the SenseAudio ASR API. The core logic in `scripts/video_subtitle.py` uses `ffmpeg` for media processing and the `requests` library to interact with the specified API endpoint (api.senseaudio.cn). No evidence of data exfiltration, malicious execution, or harmful prompt injection was found; the instructions in `SKILL.md` correctly guide the agent to perform ASR tasks and summarize results locally.
Capability Assessment
Purpose & Capability
The skill's description and code use the SenseAudio ASR API and therefore require an API key and ffmpeg/ffprobe, but the registry metadata declares no required environment variables or required binaries. That omission is inconsistent: a subtitle generator that calls an external ASR certainly needs an API key and system media tools.
Instruction Scope
Runtime instructions tell the agent to run the included Python script (expected) and to read the generated text file for summarization (expected). However the SKILL.md first step prints the API key with echo (echo "SENSEAUDIO_API_KEY=$SENSEAUDIO_API_KEY") which can expose the secret in logs/outputs — this is an unnecessary secret leak. Instructions otherwise stay within the stated purpose and reference only the input file and generated outputs.
Install Mechanism
There is no install spec (instruction-only), which minimizes installer risk. However the skill ships an executable script (scripts/video_subtitle.py) but does not declare any installation steps for Python dependencies beyond documenting 'requests' and system deps in README/USAGE — this is a minor inconsistency (code is present but no install automation).
Credentials
The code requires a single external credential (SENSEAUDIO_API_KEY) and expects ffmpeg/ffprobe on PATH, but the registry metadata lists no required env vars or binaries. Requiring an API key is proportionate to the purpose, but failing to declare it in metadata is a security/operational gap. Additionally, the SKILL.md's practice of echoing the API key risks exposing it; this is disproportionate and unnecessary.
Persistence & Privilege
The skill does not request always:true, does not declare edits to other skill configs, and has no special persistence or elevated privileges. It runs as a one-off script and outputs local files.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install video-subtitle-skill - After installation, invoke the skill by name or use
/video-subtitle-skill - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
video-subtitle-skill v1.0.0
- 首次发布:基于 SenseAudio ASR API 的自动视频/音频字幕生成工具
- 支持多语言识别、字幕翻译、说话人分离、字幕烧入视频
- 多种字幕输出格式:SRT / VTT / TXT / JSON
- 提供详细使用文档及常见参数说明
- 使用环境 API 密钥验证,明确指引获取方法
Metadata
Frequently Asked Questions
What is video-subtitle-skill?
为视频/音频自动生成字幕,支持多语言识别、翻译、说话人分离、字幕烧入视频. It is an AI Agent Skill for Claude Code / OpenClaw, with 313 downloads so far.
How do I install video-subtitle-skill?
Run "/install video-subtitle-skill" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is video-subtitle-skill free?
Yes, video-subtitle-skill is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does video-subtitle-skill support?
video-subtitle-skill is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created video-subtitle-skill?
It is built and maintained by Wan Shuaibing (@qwerty0205); the current version is v1.0.0.
More Skills