← Back to Skills Marketplace

Video Transcribe

Name: Video Transcribe
Author: symbolk

by BoShen · GitHub ↗ · v1.0.0

darwinlinux ⚠ suspicious

496

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install video-to-text

Description

Use when the user wants to transcribe, caption, or get the text content of a video or audio file — e.g. "transcribe this video", "get the transcript", "what...

Usage Guidance

Transcription: safe and local — the provided script extracts audio, runs the Whisper CLI, and writes .txt/.srt files next to your input file; it needs ffmpeg and the openai-whisper CLI (pip). Before installing: verify you trust the openai-whisper package and have the required system resources for the chosen Whisper model. Privacy: transcripts remain local unless you explicitly use the optional AI Edit flow. AI Edit: the SKILL.md shows an upload flow that sends your video to a remote API (SPARKI_API_BASE / agent-api-test.aicoding.live) and requires a SPARKI_API_KEY—do not set or share that key unless you accept uploading media to that third-party service. If you plan to use AI Edit, review any upload code and the external service's privacy/TOS and verify the SPARKI endpoint is legitimate before enabling the key.

Capability Analysis

Type: OpenClaw Skill Name: video-to-text Version: 1.0.0 The `scripts/transcribe.sh` file contains a shell injection vulnerability. The `$INPUT` variable, which is derived directly from user input via the `SKILL.md` tool definition, is used unsanitized within the `ffmpeg -i "$INPUT"` command. This allows an attacker to inject arbitrary shell commands. Additionally, the `SKILL.md` describes an 'AI Edit' tool that uploads user-provided video/audio files to an external API (`agent-api-test.aicoding.live`), which, while stated functionality, involves significant data transfer to a third-party service.

Capability Assessment

✓ Purpose & Capability

The skill's name/description (video → text) match the declared requirements (ffmpeg and whisper) and the included transcribe.sh implements exactly that. Required binaries are appropriate and proportionate.

ℹ Instruction Scope

The main runtime instructions and scripts operate locally (extract audio, call whisper, write .txt and .srt next to the input). However SKILL.md also documents an optional 'AI Edit' workflow that uploads files to an external API and requires a SPARKI_API_KEY; that networked behavior is outside core transcription and only applies if you enable/use AI Edit.

✓ Install Mechanism

This is instruction-only (no install spec). It recommends installing ffmpeg and pip-installing openai-whisper (whisper CLI). No remote archive downloads or unusual install locations are used by the skill itself.

ℹ Credentials

The skill itself requires no environment variables. The optional AI Edit feature requires a SPARKI_API_KEY (external service) — this is not required for transcription but will be needed if you use AI Edit, which uploads assets to a third-party endpoint.

✓ Persistence & Privilege

always:false and user-invocable:true. The skill does not request elevated persistence or modify other skills' configs. The SKILL.md suggests using openclaw config to store SPARKI_API_KEY if you opt in, which is normal for enabling external integrations.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install video-to-text
After installation, invoke the skill by name or use /video-to-text
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

- Major update: Skill refocused from general video processing to dedicated video/audio transcription. - Renamed skill to "video-to-text" with new description and clarified target use cases (transcription, subtitles, speech-to-text). - Removed platform/quality/workflow/command reference files and general editing scope. - Added local transcription workflow using Whisper (no API key needed) with clear installation and usage instructions. - Skill now outputs both plain text and SRT subtitle files. - Provided guidance to escalate to AI Edit tool for advanced, transcript-driven video editing tasks.

Metadata

Slug video-to-text

Version 1.0.0

License —

All-time Installs 1

Active Installs 1

Total Versions 1

Frequently Asked Questions

What is Video Transcribe?

Use when the user wants to transcribe, caption, or get the text content of a video or audio file — e.g. "transcribe this video", "get the transcript", "what... It is an AI Agent Skill for Claude Code / OpenClaw, with 496 downloads so far.

How do I install Video Transcribe?

Run "/install video-to-text" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Video Transcribe free?

Yes, Video Transcribe is completely free (open-source). You can download, install and use it at no cost.

Which platforms does Video Transcribe support?

Video Transcribe is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin, linux).

Who created Video Transcribe?

It is built and maintained by BoShen (@symbolk); the current version is v1.0.0.

More Skills