← Back to Skills Marketplace
812
Downloads
2
Stars
0
Active Installs
2
Versions
Install in OpenClaw
/install video-captions
Description
Generate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and burn-in.
Usage Guidance
This skill appears coherent for captioning tasks, but review these before using: 1) It expects ffmpeg and a Whisper CLI on PATH — ensure you trust those binaries and their versions. 2) The docs recommend pip installs (openai-whisper, whisper-timestamped, stable-ts, etc.); install packages from trusted sources and consider a virtualenv or container. 3) Local processing is the default (best for privacy). Only supply cloud API keys (AssemblyAI / Deepgram) if you consent to sending video/audio to those services. 4) Example ffmpeg burn-in commands depend on fonts and may require installing or mapping fonts on your system. 5) If you plan to let an autonomous agent run this skill, be aware it could process any video files the agent is given — limit scope and run in an isolated environment if you have sensitive content.
Capability Analysis
Type: OpenClaw Skill
Name: video-captions
Version: 1.0.1
The skill bundle is designed for legitimate video captioning tasks, utilizing tools like `ffmpeg` and `whisper`. It explicitly states that processing is local by default and cloud APIs are opt-in, requiring user-provided keys, which mitigates concerns about hidden data exfiltration. However, the extensive use of `ffmpeg` and `whisper` commands across `SKILL.md`, `engines.md`, `formats.md`, `platforms.md`, and `styling.md` introduces a significant shell injection vulnerability risk. If the AI agent constructs these commands using unsanitized user input (e.g., for filenames or style parameters), a malicious user could execute arbitrary commands on the host system. While there is no evidence of intentional malicious behavior within the skill's instructions or code, this critical vulnerability potential warrants a 'suspicious' classification.
Capability Assessment
Purpose & Capability
Name/description match the actual instructions: transcription, timing, styling, and burn-in. Required binaries (ffmpeg, whisper) are appropriate. Optional cloud API keys (AssemblyAI, Deepgram) are declared in metadata and are relevant to higher-accuracy cloud engines.
Instruction Scope
SKILL.md contains concrete shell commands (whisper, whisper_timestamped, ffmpeg, curl for Deepgram) that operate on user video files and outputs captions/subtitled video. It does not instruct the agent to read unrelated system files or exfiltrate unrelated secrets. It does recommend installing Python packages (pip), which is expected for local engines but should be performed by the user in a trusted environment.
Install Mechanism
There is no install spec (instruction-only), which is low-risk. However engines.md shows example pip install commands for third-party Python packages—these are not auto-run by the skill but are a user action that can pull arbitrary code. Users should vet package sources and install in isolated environments if concerned.
Credentials
The skill does not require any environment variables by default. The metadata lists optional API keys (ASSEMBLYAI_API_KEY, DEEPGRAM_API_KEY) that are directly relevant to the cloud transcription options shown. No unrelated credentials or config paths are requested.
Persistence & Privilege
Skill is not always-enabled and does not request elevated or persistent system configuration. disable-model-invocation is false (normal), meaning the agent can call the skill autonomously — this is expected for skills and is not excessive here.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install video-captions - After installation, invoke the skill by name or use
/video-captions - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.1
Declared optional cloud API env vars in metadata to clarify that cloud engines require user-provided keys
v1.0.0
Initial release
Metadata
Frequently Asked Questions
What is Video Captions?
Generate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and burn-in. It is an AI Agent Skill for Claude Code / OpenClaw, with 812 downloads so far.
How do I install Video Captions?
Run "/install video-captions" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Video Captions free?
Yes, Video Captions is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Video Captions support?
Video Captions is cross-platform and runs anywhere OpenClaw / Claude Code is available (linux, darwin).
Who created Video Captions?
It is built and maintained by Iván (@ivangdavila); the current version is v1.0.1.
More Skills