← Back to Skills Marketplace

Video Captions

Name: Video Captions
Author: ivangdavila

by Iván · GitHub ↗ · v1.0.1

linuxdarwin ⚠ suspicious

812

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install video-captions

Description

Generate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and burn-in.

Usage Guidance

This skill appears coherent for captioning tasks, but review these before using: 1) It expects ffmpeg and a Whisper CLI on PATH — ensure you trust those binaries and their versions. 2) The docs recommend pip installs (openai-whisper, whisper-timestamped, stable-ts, etc.); install packages from trusted sources and consider a virtualenv or container. 3) Local processing is the default (best for privacy). Only supply cloud API keys (AssemblyAI / Deepgram) if you consent to sending video/audio to those services. 4) Example ffmpeg burn-in commands depend on fonts and may require installing or mapping fonts on your system. 5) If you plan to let an autonomous agent run this skill, be aware it could process any video files the agent is given — limit scope and run in an isolated environment if you have sensitive content.

Capability Analysis

Type: OpenClaw Skill Name: video-captions Version: 1.0.1 The skill bundle is designed for legitimate video captioning tasks, utilizing tools like `ffmpeg` and `whisper`. It explicitly states that processing is local by default and cloud APIs are opt-in, requiring user-provided keys, which mitigates concerns about hidden data exfiltration. However, the extensive use of `ffmpeg` and `whisper` commands across `SKILL.md`, `engines.md`, `formats.md`, `platforms.md`, and `styling.md` introduces a significant shell injection vulnerability risk. If the AI agent constructs these commands using unsanitized user input (e.g., for filenames or style parameters), a malicious user could execute arbitrary commands on the host system. While there is no evidence of intentional malicious behavior within the skill's instructions or code, this critical vulnerability potential warrants a 'suspicious' classification.

Capability Assessment

✓ Purpose & Capability

Name/description match the actual instructions: transcription, timing, styling, and burn-in. Required binaries (ffmpeg, whisper) are appropriate. Optional cloud API keys (AssemblyAI, Deepgram) are declared in metadata and are relevant to higher-accuracy cloud engines.

✓ Instruction Scope

SKILL.md contains concrete shell commands (whisper, whisper_timestamped, ffmpeg, curl for Deepgram) that operate on user video files and outputs captions/subtitled video. It does not instruct the agent to read unrelated system files or exfiltrate unrelated secrets. It does recommend installing Python packages (pip), which is expected for local engines but should be performed by the user in a trusted environment.

ℹ Install Mechanism

There is no install spec (instruction-only), which is low-risk. However engines.md shows example pip install commands for third-party Python packages—these are not auto-run by the skill but are a user action that can pull arbitrary code. Users should vet package sources and install in isolated environments if concerned.

✓ Credentials

The skill does not require any environment variables by default. The metadata lists optional API keys (ASSEMBLYAI_API_KEY, DEEPGRAM_API_KEY) that are directly relevant to the cloud transcription options shown. No unrelated credentials or config paths are requested.

✓ Persistence & Privilege

Skill is not always-enabled and does not request elevated or persistent system configuration. disable-model-invocation is false (normal), meaning the agent can call the skill autonomously — this is expected for skills and is not excessive here.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install video-captions
After installation, invoke the skill by name or use /video-captions
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.1

Declared optional cloud API env vars in metadata to clarify that cloud engines require user-provided keys

v1.0.0

Initial release

Metadata

Slug video-captions

Version 1.0.1

License —

All-time Installs 0

Active Installs 0

Total Versions 2

Frequently Asked Questions

What is Video Captions?

Generate professional captions and subtitles with multi-engine transcription, word-level timing, styling presets, and burn-in. It is an AI Agent Skill for Claude Code / OpenClaw, with 812 downloads so far.

How do I install Video Captions?

Run "/install video-captions" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Video Captions free?

Yes, Video Captions is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Video Captions support?

Video Captions is cross-platform and runs anywhere OpenClaw / Claude Code is available (linux, darwin).

Who created Video Captions?

It is built and maintained by Iván (@ivangdavila); the current version is v1.0.1.

More Skills