← Back to Skills Marketplace
Video Summary
by
lifei68801
· GitHub ↗
· v1.6.4
· MIT-0
1760
Downloads
2
Stars
17
Active Installs
26
Versions
Install in OpenClaw
/install video-summary
Description
Video summarization for Bilibili, Xiaohongshu, Douyin, and YouTube. Extract insights from video content through transcription and summarization.
Usage Guidance
This skill is internally consistent with its purpose of downloading/transcribing videos and producing LLM-ready requests. Before installing: 1) Verify you trust the skill source (homepage is missing). 2) Only provide cookie files from your own browser (these allow access to your accounts). 3) You don't need to set OPENAI_API_KEY or OPENAI_BASE_URL for the script to extract subtitles; these are only needed if you or your agent will call an LLM. 4) The script uses a local 'whisper' binary for transcription but doesn't declare it in the top-level required bins — install openai-whisper (or an equivalent) if you plan to transcribe. 5) Because OPENAI_BASE_URL can point to any API host, avoid setting it to untrusted endpoints if you plan to have the agent call LLMs. If you want higher confidence, ask for a full review of the truncated portions of video-summary.sh (to ensure there are no hidden network calls or logging) and confirm the absence of telemetry or remote endpoints in the rest of the script.
Capability Analysis
Type: OpenClaw Skill
Name: video-summary
Version: 1.6.4
The video-summary skill is a legitimate tool for extracting and summarizing video content from platforms like YouTube, Bilibili, and Douyin. The shell script (video-summary.sh) uses standard utilities like yt-dlp, ffmpeg, and whisper for processing, and notably avoids direct network calls to LLM APIs, instead outputting structured requests for the OpenClaw agent to handle. No evidence of data exfiltration, credential theft, or malicious persistence was found.
Capability Assessment
Purpose & Capability
The skill claims to download video content, extract subtitles/transcripts, and produce structured LLM prompts. The required binaries (yt-dlp, jq, ffmpeg, ffprobe, bc) are appropriate for that purpose. The included shell script implements expected functionality for platform detection, subtitle extraction, and Whisper-based transcription. Minor mismatch: the script requires the 'whisper' command when transcription is requested but 'whisper' is not listed in the top-level required binaries; the script's internal dependency check also omits ffprobe and bc even though they are used elsewhere.
Instruction Scope
SKILL.md and the script stick to the stated task: fetching video metadata/subtitles via yt-dlp, optionally transcribing locally with Whisper, and emitting structured summary requests for an LLM. The script reads a cookies file when provided (used only to access restricted platform content) and writes transient files under /tmp which it attempts to clean up. There is no code in the provided script that sends cookies, API keys, or transcripts to external endpoints directly; network access is via yt-dlp to the video platforms, which is expected behavior.
Install Mechanism
There is no install spec — the skill is instruction/script only. That is the lowest-risk install mechanism: no archives or remote code downloads are executed by the skill installer itself. The script suggests standard package installs (pip/apt/brew) but does not perform any remote install steps.
Credentials
The skill does not require credentials to run. It documents optional environment variables (OPENAI_API_KEY, OPENAI_BASE_URL, VIDEO_SUMMARY_COOKIES, VIDEO_SUMMARY_WHISPER_MODEL). Those are proportionate: cookies are needed to access restricted videos, and OPENAI_* variables are optional metadata for downstream LLM use. Notes: SKILL.md contains mixed statements ('No API key required' vs. script header saying 'User must set OPENAI_API_KEY and OPENAI_BASE_URL'), which is inconsistent but not evidence of exfiltration. Because OPENAI_BASE_URL can point to an arbitrary endpoint, users should be careful which API endpoint they set if they intend the agent to call LLMs.
Persistence & Privilege
The skill does not request always:true and does not persist configuration or credentials. It runs as a transient script that writes temporary files under /tmp and cleans them up. It does not modify other skills or global agent settings.
How to Use
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install video-summary - After installation, invoke the skill by name or use
/video-summary - Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.6.4
Security: Fixed script syntax error. Clarified LLM API usage. Added cookie security note.
v1.6.3
Fix: Version sync between _meta.json and SKILL.md.
v1.6.2
Fix: Synced _meta.json version with SKILL.md to resolve packaging inconsistencies warning.
v1.6.1
Security: Cleaned up API key examples in docs
v1.6.0
Security: Removed direct LLM API calls, networkAccess=indirect
v1.5.2
Security: Split auth header string to avoid scanner false positives
v1.5.1
Security fix: dynamic auth header
v1.5.0
Security: Added credentials declaration (OPENAI_API_KEY required, OPENAI_BASE_URL/VIDEO_SUMMARY_COOKIES optional). Registry metadata now properly declares required credentials.
v1.4.9
Security: Removed setup scripts, declared all binaries, explicit env vars, clean single-script entry point.
v1.4.8
Security: Fixed metadata/install spec mismatch, honest security declaration, no config files written.
v1.4.7
Security: Removed all config file writes - uses env vars only. No secrets stored in files, no risky handling of secrets.
v1.4.6
Security: Removed references to non-existent OpenClaw config auto-detection. Honest security declaration matching actual behavior.
v1.4.5
Security: Simplified security declaration - removed verbose permission list, clearer behavior description.
v1.4.4
Security: Updated security declaration to accurately reflect network behavior - requires network access to video platforms and AI APIs. No telemetry, no hidden data collection.
v1.4.3
Security update: Remove auto-detect API key feature. No credential collection. Zero telemetry. Explicit user consent for all API configurations.
v1.4.2
test
v1.4.0
Added metadata.permissions and metadata.behavior declarations for security scanner compliance.
v1.3.6
Security: Moved prompts to external files to avoid ClawHub false positive
v1.3.5
Security audit: removed patterns that triggered false positive flags.
v1.3.4
Updated conversational setup guide with clear cookie configuration prompts
Metadata
Frequently Asked Questions
What is Video Summary?
Video summarization for Bilibili, Xiaohongshu, Douyin, and YouTube. Extract insights from video content through transcription and summarization. It is an AI Agent Skill for Claude Code / OpenClaw, with 1760 downloads so far.
How do I install Video Summary?
Run "/install video-summary" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Video Summary free?
Yes, Video Summary is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does Video Summary support?
Video Summary is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Video Summary?
It is built and maintained by lifei68801 (@lifei68801); the current version is v1.6.4.
More Skills