← Back to Skills Marketplace

video2txt-视频理解字幕提取

Name: video2txt-视频理解字幕提取
Author: chentx1243

by Chentx1243 · GitHub ↗ · v1.0.1 · MIT-0

cross-platform ✓ Security Clean

142

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install maple-video2txt

Description

将本地视频或音频文件转写为 SRT 字幕文件和 TXT 纯文本文件

Usage Guidance

This skill appears to do what it says: transcribe local media into .srt and .txt using faster-whisper and ffmpeg. Before installing or running it: 1) Run pip installs in an isolated venv. 2) Ensure ffmpeg/ffprobe are installed and on PATH. 3) Be aware the first run will download Whisper model files (network traffic and significant disk use); verify you are comfortable with that. 4) Inspect the SKILL.md and script for any hidden/unexpected characters or modifications (the static scan flagged unicode control characters). 5) If you plan to run it automatically, restrict it to media files you trust — the script will read local files and write model files and outputs to disk. If anything in the SKILL.md raw text looks suspicious, do not grant it automated/autonomous execution until you confirm the content is clean.

Capability Analysis

Type: OpenClaw Skill Name: maple-video2txt Version: 1.0.1 The skill bundle is a legitimate tool for transcribing video and audio files using the faster-whisper library. The Python script (video_to_text.py) implements standard media processing logic, including duration detection via ffprobe and text normalization. No evidence of data exfiltration, malicious execution, or prompt injection was found; all behaviors, including model downloading and file system access, are consistent with the stated purpose.

Capability Assessment

✓ Purpose & Capability

Name/description, required binary (python3), declared dependencies (faster-whisper, av, opencc) and the included Python script all align with a local transcription/subtitle generation tool. The script uses ffprobe/ffmpeg and faster-whisper as expected.

ℹ Instruction Scope

SKILL.md instructs the agent to run the provided Python script and to install dependencies; the instructions stay within the transcription task. Important operational behaviors are called out (model download on first run, use background execution). Note: the script will perform network downloads for Whisper models and will call ffprobe/ffmpeg via subprocess; these are expected but are external network/system interactions the user should be aware of. Also the SKILL.md pre-scan flagged unicode-control-chars (possible prompt-injection attempt) — the visible content looks normal, but a manual check of raw file bytes for hidden control characters is recommended.

✓ Install Mechanism

No custom install spec; standard pip requirements.txt is provided. This is low-risk compared with arbitrary remote archive downloads. The only external runtime download is the Whisper model files (expected for this functionality).

✓ Credentials

The skill requests no environment variables or credentials. It needs access to local files (input media) and will write SRT/TXT output and model files to disk (models directory). These requirements are proportional to the stated goal.

✓ Persistence & Privilege

Skill is not always-on and does not request special platform privileges. It does not declare or appear to modify other skills or global agent settings.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install maple-video2txt
After installation, invoke the skill by name or use /maple-video2txt
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.1

- Changed skill emoji from 🎬 to video in the metadata. - Added ffprobe/ffmpeg and Whisper model files as dependencies in the documentation. - Updated script file location: removed video_to_text.py from root, added scripts/video_to_text.py. - No functional changes to user-visible features.

v1.0.0

Initial release — revamped and simplified to focus on local video/audio transcription to text and subtitles. - Transcribes local video or audio files into SRT subtitle and plain TXT files using faster-whisper. - Supports a variety of video and audio formats; defaults to Chinese (Simplified) recognition. - Provides detailed CLI usage with customizable parameters for output, language, and model. - Progress is reported during execution; intended for background runs to avoid popups. - Dependencies listed in requirements.txt; Node/npm scripts and docx outputs from previous version removed.

Metadata

Slug maple-video2txt

Version 1.0.1

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 2

Frequently Asked Questions

What is video2txt-视频理解字幕提取?

将本地视频或音频文件转写为 SRT 字幕文件和 TXT 纯文本文件. It is an AI Agent Skill for Claude Code / OpenClaw, with 142 downloads so far.

How do I install video2txt-视频理解字幕提取?

Run "/install maple-video2txt" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is video2txt-视频理解字幕提取 free?

Yes, video2txt-视频理解字幕提取 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does video2txt-视频理解字幕提取 support?

video2txt-视频理解字幕提取 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created video2txt-视频理解字幕提取?

It is built and maintained by Chentx1243 (@chentx1243); the current version is v1.0.1.

More Skills