← Back to Skills Marketplace
496
Downloads
2
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install video-to-text
Description
Use when the user wants to transcribe, caption, or get the text content of a video or audio file — e.g. "transcribe this video", "get the transcript", "what...
Usage Guidance
Transcription: safe and local — the provided script extracts audio, runs the Whisper CLI, and writes .txt/.srt files next to your input file; it needs ffmpeg and the openai-whisper CLI (pip). Before installing: verify you trust the openai-whisper package and have the required system resources for the chosen Whisper model. Privacy: transcripts remain local unless you explicitly use the optional AI Edit flow. AI Edit: the SKILL.md shows an upload flow that sends your video to a remote API (SPARKI_API_BASE / agent-api-test.aicoding.live) and requires a SPARKI_API_KEY—do not set or share that key unless you accept uploading media to that third-party service. If you plan to use AI Edit, review any upload code and the external service's privacy/TOS and verify the SPARKI endpoint is legitimate before enabling the key.
Capability Analysis
Type: OpenClaw Skill
Name: video-to-text
Version: 1.0.0
The `scripts/transcribe.sh` file contains a shell injection vulnerability. The `$INPUT` variable, which is derived directly from user input via the `SKILL.md` tool definition, is used unsanitized within the `ffmpeg -i "$INPUT"` command. This allows an attacker to inject arbitrary shell commands. Additionally, the `SKILL.md` describes an 'AI Edit' tool that uploads user-provided video/audio files to an external API (`agent-api-test.aicoding.live`), which, while stated functionality, involves significant data transfer to a third-party service.
Capability Assessment
Purpose & Capability
The skill's name/description (video → text) match the declared requirements (ffmpeg and whisper) and the included transcribe.sh implements exactly that. Required binaries are appropriate and proportionate.
Instruction Scope
The main runtime instructions and scripts operate locally (extract audio, call whisper, write .txt and .srt next to the input). However SKILL.md also documents an optional 'AI Edit' workflow that uploads files to an external API and requires a SPARKI_API_KEY; that networked behavior is outside core transcription and only applies if you enable/use AI Edit.
Install Mechanism
This is instruction-only (no install spec). It recommends installing ffmpeg and pip-installing openai-whisper (whisper CLI). No remote archive downloads or unusual install locations are used by the skill itself.
Credentials
The skill itself requires no environment variables. The optional AI Edit feature requires a SPARKI_API_KEY (external service) — this is not required for transcription but will be needed if you use AI Edit, which uploads assets to a third-party endpoint.
Persistence & Privilege
always:false and user-invocable:true. The skill does not request elevated persistence or modify other skills' configs. The SKILL.md suggests using openclaw config to store SPARKI_API_KEY if you opt in, which is normal for enabling external integrations.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install video-to-text - After installation, invoke the skill by name or use
/video-to-text - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Major update: Skill refocused from general video processing to dedicated video/audio transcription.
- Renamed skill to "video-to-text" with new description and clarified target use cases (transcription, subtitles, speech-to-text).
- Removed platform/quality/workflow/command reference files and general editing scope.
- Added local transcription workflow using Whisper (no API key needed) with clear installation and usage instructions.
- Skill now outputs both plain text and SRT subtitle files.
- Provided guidance to escalate to AI Edit tool for advanced, transcript-driven video editing tasks.
Metadata
Frequently Asked Questions
What is Video Transcribe?
Use when the user wants to transcribe, caption, or get the text content of a video or audio file — e.g. "transcribe this video", "get the transcript", "what... It is an AI Agent Skill for Claude Code / OpenClaw, with 496 downloads so far.
How do I install Video Transcribe?
Run "/install video-to-text" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Video Transcribe free?
Yes, Video Transcribe is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Video Transcribe support?
Video Transcribe is cross-platform and runs anywhere OpenClaw / Claude Code is available (darwin, linux).
Who created Video Transcribe?
It is built and maintained by BoShen (@symbolk); the current version is v1.0.0.
More Skills