← Back to Skills Marketplace

transcribe-video

Name: transcribe-video
Author: feiskyer

by Pengfei Ni · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install transcribe-video

Description

Extract transcript or subtitles from a local video file. Use this skill whenever the user asks to transcribe a video, extract speech-to-text, get subtitles,...

Usage Guidance

This skill appears to do what it says (extract embedded subtitles or send audio to OpenAI for transcription), but the package metadata omits two important requirements: (1) you must have ffmpeg/ffprobe installed on the host, and (2) you must create ~/.transcribe_video.env containing OPENAI_API_KEY (and optional OPENAI_API_BASE/TRANSCRIBE_MODEL). Before installing/using: (a) review the included scripts (scripts/transcribe.py) yourself and ensure you trust the code and the unknown publisher, (b) be aware that audio will be uploaded to an external API (privacy risk), (c) if you don’t want network transcription, use only embedded subtitles or adopt an offline STT tool, and (d) ask the publisher to update the manifest to declare required binaries and the primary credential so the skill's metadata accurately reflects its needs. If you proceed, run it in a controlled environment (or without network access) until you are comfortable.

Capability Analysis

Type: OpenClaw Skill Name: transcribe-video Version: 1.0.0 The skill contains shell injection vulnerabilities in `SKILL.md`, where user-provided file paths are directly interpolated into bash commands (ffprobe and ffmpeg) using simple double quotes, which does not prevent command substitution (e.g., $(payload)). Additionally, the skill requires the user to store sensitive API keys in a plaintext configuration file (`~/.transcribe_video.env`). While the Python script `scripts/transcribe.py` uses safer subprocess handling, the instructions provided to the AI agent for pre-processing steps introduce significant execution risks.

Capability Tags

requires-sensitive-credentials

Capability Assessment

⚠ Purpose & Capability

The skill claims to transcribe local video files which aligns with the included script and instructions. However, the registry metadata declares no required binaries or env vars while the SKILL.md and scripts require ffmpeg/ffprobe and an OpenAI API key file (~/.transcribe_video.env). The absence of these declared requirements in the manifest is an incoherence: a legitimate transcribe skill should list ffmpeg/ffprobe and the API key as required.

ℹ Instruction Scope

The SKILL.md stays largely within the stated purpose: it checks for embedded subtitles, extracts audio, and calls an external transcription API. It instructs the agent to read ~/.transcribe_video.env for credentials (explicitly stated) and to run ffprobe/ffmpeg and a local Python script. There is no unrelated file collection, but the instructions do send audio off-host to the OpenAI API, which is expected for API-based transcription but is a privacy consideration the user should be aware of.

✓ Install Mechanism

This is an instruction-only skill with no install spec; the included Python script and instructions rely on user-installed tools and Python packages. No downloads from arbitrary URLs or hidden installers are present.

⚠ Credentials

The code expects credentials (OPENAI_API_KEY and optional OPENAI_API_BASE / TRANSCRIBE_MODEL / AZURE_API_VERSION) loaded from ~/.transcribe_video.env, but the skill metadata declares no required env or primary credential. Requesting an API key (a sensitive secret) is proportionate to doing cloud transcription, but the manifest should explicitly declare it. The skill will transmit audio to an external service (OpenAI or Azure OpenAI) — this is necessary for API transcription but important to surface to users.

✓ Persistence & Privilege

The skill does not request persistent or global privileges (always:false). It does not modify other skills or system-wide configuration. Its runtime behavior is limited to reading a dedicated env file in the user's home, invoking ffmpeg, and calling an external API.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install transcribe-video
After installation, invoke the skill by name or use /transcribe-video
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Initial release: transcribe-video skill extracts transcripts or subtitles from local video files. - Automatically checks for embedded subtitles first for fast, accurate results; uses speech-to-text API if none are found. - Supports popular video formats: mp4, mkv, mov, avi, webm, and others ffmpeg can read. - Guides user through dependencies for API transcription if required. - Outputs a text transcript file in the same folder as the video, and shows a preview with basic info.

Metadata

Slug transcribe-video

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is transcribe-video?

Extract transcript or subtitles from a local video file. Use this skill whenever the user asks to transcribe a video, extract speech-to-text, get subtitles,... It is an AI Agent Skill for Claude Code / OpenClaw, with 77 downloads so far.

How do I install transcribe-video?

Run "/install transcribe-video" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is transcribe-video free?

Yes, transcribe-video is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does transcribe-video support?

transcribe-video is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created transcribe-video?

It is built and maintained by Pengfei Ni (@feiskyer); the current version is v1.0.0.

More Skills