← Back to Skills Marketplace
mcdowell8023

Video Summarizer

by mcdowelll · GitHub ↗ · v2.0.0 · MIT-0
cross-platform ⚠ suspicious
67
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install wansan-video-summarizer
Description
Multi-platform video transcript extraction and AI-powered summarization (YouTube, Bilibili, extensible). Use when you need to summarize videos, extract trans...
Usage Guidance
This skill appears to implement what it claims, but review and decide before installing: 1) setup.sh will create a venv and pip-install packages—inspect and run it yourself rather than blindly piping to shell. 2) yt-dlp is invoked with --cookies-from-browser chrome: that can cause local browser cookies (sensitive) to be read by yt-dlp; only allow this if you understand and accept that access. 3) The SKILL.md mentions using an 'innertube + Cloudflare proxy' technique to avoid rate limits—ask the author which endpoints/proxies are used or inspect code paths that call innertube to ensure no unexpected network endpoints are contacted. 4) Only provide optional tokens (LLM_API_KEY, GITHUB_TOKEN, OPENCLAW_GATEWAY_TOKEN, POLLINATIONS_API_KEY) if you trust the skill and intend to use those backends. If you want higher assurance, request the full summarize.py logic that performs LLM and network calls (the included file was truncated) and check for any hard-coded remote endpoints or obfuscated network behavior.
Capability Analysis
Type: OpenClaw Skill Name: wansan-video-summarizer Version: 2.0.0 The skill exhibits high-risk behaviors including accessing local browser data and utilizing sensitive authentication tokens. Specifically, 'scripts/summarize.py' invokes 'yt-dlp' with the '--cookies-from-browser chrome' flag to bypass Bilibili's anti-bot measures, which grants the script access to the user's browser history and session data. Additionally, the script contains logic to utilize a 'GITHUB_TOKEN' by spoofing VS Code/Copilot headers to access the GitHub Copilot API as an LLM fallback. While these actions appear aligned with the stated purpose of video summarization and bypassing platform restrictions, the combination of browser data access and sensitive token usage represents a significant security risk without explicit user consent for these specific high-privilege operations.
Capability Tags
requires-sensitive-credentials
Capability Assessment
Purpose & Capability
Name/description align with included code and dependencies (yt-dlp, ffmpeg, faster-whisper, youtube-transcript-api, innertube). Using yt-dlp + faster-whisper for Bilibili transcription and ffmpeg for frames is coherent. One minor mismatch: the README/SKILL.md emphasize an 'innertube ANDROID client + Cloudflare proxy' technique to avoid YouTube limits but the code excerpt doesn't show any explicit proxy endpoints or configuration—this is ambiguous but could be an implementation detail of the innertube library.
Instruction Scope
Runtime instructions and scripts will: (a) invoke yt-dlp with --cookies-from-browser chrome (this allows access to local browser cookies), (b) run ffmpeg and external subprocesses, (c) create files under /tmp and a skill-local config/settings.json, and (d) (per README) may call external LLM/image endpoints (LLM_API_URL, OPENCLAW_GATEWAY_TOKEN, GITHUB_TOKEN, POLLINATIONS_API_KEY) if provided. Access to browser cookies and unspecified proxy behavior is the primary scope creep risk; these are not required env vars but are implied by the download approach.
Install Mechanism
There is no registry install spec, but a bundled setup.sh installs a Python venv and pip packages (youtube-transcript-api, requests, innertube, faster-whisper) and checks for yt-dlp/ffmpeg. This is a common pattern; it uses pip (PyPI) and Homebrew for yt-dlp if needed—moderate risk but expected. Nothing in setup.sh downloads arbitrary archives from untrusted servers.
Credentials
Requires no credentials to run, but documents many optional environment variables (LLM_API_URL/KEY, OPENCLAW_GATEWAY_TOKEN, GITHUB_TOKEN, POLLINATIONS_API_KEY). Those are plausible for optional LLM/image fallbacks, but supplying broad tokens like GITHUB_TOKEN or gateway tokens increases blast radius. The SKILL.md/README do not require these; they are optional and explained as fallbacks.
Persistence & Privilege
The skill does not request always:true and is user-invocable. setup.sh writes only to its own skill config directory. It does not appear to modify other skills or global agent settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install wansan-video-summarizer
  3. After installation, invoke the skill by name or use /wansan-video-summarizer
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v2.0.0
Renamed from youtube-summarizer; cross-platform install; YouTube + Bilibili support
Metadata
Slug wansan-video-summarizer
Version 2.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Video Summarizer?

Multi-platform video transcript extraction and AI-powered summarization (YouTube, Bilibili, extensible). Use when you need to summarize videos, extract trans... It is an AI Agent Skill for Claude Code / OpenClaw, with 67 downloads so far.

How do I install Video Summarizer?

Run "/install wansan-video-summarizer" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Video Summarizer free?

Yes, Video Summarizer is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Video Summarizer support?

Video Summarizer is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Video Summarizer?

It is built and maintained by mcdowelll (@mcdowell8023); the current version is v2.0.0.

💬 Comments