← Back to Skills Marketplace
transcribe-video
by
Pengfei Ni
· GitHub ↗
· v1.0.0
· MIT-0
77
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install transcribe-video
Description
Extract transcript or subtitles from a local video file. Use this skill whenever the user asks to transcribe a video, extract speech-to-text, get subtitles,...
Usage Guidance
This skill appears to do what it says (extract embedded subtitles or send audio to OpenAI for transcription), but the package metadata omits two important requirements: (1) you must have ffmpeg/ffprobe installed on the host, and (2) you must create ~/.transcribe_video.env containing OPENAI_API_KEY (and optional OPENAI_API_BASE/TRANSCRIBE_MODEL). Before installing/using: (a) review the included scripts (scripts/transcribe.py) yourself and ensure you trust the code and the unknown publisher, (b) be aware that audio will be uploaded to an external API (privacy risk), (c) if you don’t want network transcription, use only embedded subtitles or adopt an offline STT tool, and (d) ask the publisher to update the manifest to declare required binaries and the primary credential so the skill's metadata accurately reflects its needs. If you proceed, run it in a controlled environment (or without network access) until you are comfortable.
Capability Analysis
Type: OpenClaw Skill
Name: transcribe-video
Version: 1.0.0
The skill contains shell injection vulnerabilities in `SKILL.md`, where user-provided file paths are directly interpolated into bash commands (ffprobe and ffmpeg) using simple double quotes, which does not prevent command substitution (e.g., $(payload)). Additionally, the skill requires the user to store sensitive API keys in a plaintext configuration file (`~/.transcribe_video.env`). While the Python script `scripts/transcribe.py` uses safer subprocess handling, the instructions provided to the AI agent for pre-processing steps introduce significant execution risks.
Capability Tags
Capability Assessment
Purpose & Capability
The skill claims to transcribe local video files which aligns with the included script and instructions. However, the registry metadata declares no required binaries or env vars while the SKILL.md and scripts require ffmpeg/ffprobe and an OpenAI API key file (~/.transcribe_video.env). The absence of these declared requirements in the manifest is an incoherence: a legitimate transcribe skill should list ffmpeg/ffprobe and the API key as required.
Instruction Scope
The SKILL.md stays largely within the stated purpose: it checks for embedded subtitles, extracts audio, and calls an external transcription API. It instructs the agent to read ~/.transcribe_video.env for credentials (explicitly stated) and to run ffprobe/ffmpeg and a local Python script. There is no unrelated file collection, but the instructions do send audio off-host to the OpenAI API, which is expected for API-based transcription but is a privacy consideration the user should be aware of.
Install Mechanism
This is an instruction-only skill with no install spec; the included Python script and instructions rely on user-installed tools and Python packages. No downloads from arbitrary URLs or hidden installers are present.
Credentials
The code expects credentials (OPENAI_API_KEY and optional OPENAI_API_BASE / TRANSCRIBE_MODEL / AZURE_API_VERSION) loaded from ~/.transcribe_video.env, but the skill metadata declares no required env or primary credential. Requesting an API key (a sensitive secret) is proportionate to doing cloud transcription, but the manifest should explicitly declare it. The skill will transmit audio to an external service (OpenAI or Azure OpenAI) — this is necessary for API transcription but important to surface to users.
Persistence & Privilege
The skill does not request persistent or global privileges (always:false). It does not modify other skills or system-wide configuration. Its runtime behavior is limited to reading a dedicated env file in the user's home, invoking ffmpeg, and calling an external API.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install transcribe-video - After installation, invoke the skill by name or use
/transcribe-video - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Initial release: transcribe-video skill extracts transcripts or subtitles from local video files.
- Automatically checks for embedded subtitles first for fast, accurate results; uses speech-to-text API if none are found.
- Supports popular video formats: mp4, mkv, mov, avi, webm, and others ffmpeg can read.
- Guides user through dependencies for API transcription if required.
- Outputs a text transcript file in the same folder as the video, and shows a preview with basic info.
Metadata
Frequently Asked Questions
What is transcribe-video?
Extract transcript or subtitles from a local video file. Use this skill whenever the user asks to transcribe a video, extract speech-to-text, get subtitles,... It is an AI Agent Skill for Claude Code / OpenClaw, with 77 downloads so far.
How do I install transcribe-video?
Run "/install transcribe-video" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is transcribe-video free?
Yes, transcribe-video is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does transcribe-video support?
transcribe-video is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created transcribe-video?
It is built and maintained by Pengfei Ni (@feiskyer); the current version is v1.0.0.
More Skills