← Back to Skills Marketplace
huuuwnnn-droid

Video Insight

by huuuwnnn-droid · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
68
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install video-insight
Description
Cross-platform video transcript extraction and optional AI summarization for YouTube and Bilibili. GPU auto-detect. Transcript-first with opt-in LLM summary.
Usage Guidance
What to consider before installing: - Privacy: transcripts are cached permanently under ~/.cache/video-insight. If transcripts may contain sensitive content, clear or relocate the cache and inspect Cache.put/get behavior. - External LLMs: the summarization feature is opt-in, but if you set LLM_API_URL + LLM_API_KEY (or OPENCLAW_GATEWAY_TOKEN), the tool will POST the entire transcript to that endpoint (no truncation). Only set those env vars for endpoints you trust and that have an appropriate privacy policy. - Browser cookies: on download failure the tool retries with yt-dlp --cookies-from-browser chrome. That causes yt-dlp to access your browser cookie store (potentially exposing authenticated content). If you don't want local browser cookies accessed, avoid allowing that fallback or run in an isolated environment. - Installation: setup.sh installs Python packages from PyPI and may modify your Python environment or create a venv. Review the script before running; consider installing in an isolated virtual environment or container. - If you need higher assurance: request the publisher/source, verify package signatures or hashes, or run the tool in an isolated VM/container. If you want the skill but not remote summaries, do not set LLM_API_* or OPENCLAW_GATEWAY_TOKEN and avoid --summarize; manually review cached files and remove them if undesired. Additional info that would change this assessment: an authoritative source/homepage or explicit metadata declaring the env vars the skill expects (and the privacy implications), or removal/documentation of the yt-dlp cookie fallback. With those clarifications this would likely be classified as benign (coherent) rather than suspicious.
Capability Analysis
Type: OpenClaw Skill Name: video-insight Version: 1.0.0 The skill is classified as suspicious due to high-risk capabilities and potential vulnerabilities. Specifically, 'scripts/bilibili.py' attempts to access Chrome browser cookies via 'yt-dlp' to bypass anti-bot measures; while this is a standard feature for video extraction, it is not disclosed in the documentation. Additionally, 'scripts/utils.py' contains a potential Server-Side Request Forgery (SSRF) vulnerability in 'extract_bilibili_id' by following redirects on user-provided URLs without sanitization. These behaviors, while plausibly intended for the stated purpose, represent a significant attack surface and lack of transparency regarding sensitive data access.
Capability Tags
requires-sensitive-credentials
Capability Assessment
Purpose & Capability
Code and setup align with the stated purpose: yt-dlp, ffmpeg, faster-whisper are used for download, audio extraction, and transcription, and the CLI wraps YouTube/Bilibili handling. However, the code also invokes yt-dlp's --cookies-from-browser fallback (reads browser cookies via yt-dlp) and supports sending full transcripts to arbitrary LLM endpoints — behaviours that are not called out in the top-level metadata and may be surprising to users.
Instruction Scope
SKILL.md says default is transcript-only, but the runtime instructions & code: (1) permanently cache full transcripts to ~/.cache/video-insight, (2) may extract keyframes, (3) on download failure try yt-dlp --cookies-from-browser (reads local browser cookies), and (4) when --summarize is used will post the entire transcript (no truncation) to an external LLM API if LLM envs are set. These are scope-expanding actions (reading browser cookies, persistent local storage, network exfiltration of large transcripts) that are not declared as required in the metadata.
Install Mechanism
No install spec in registry (instruction-only), but a provided setup.sh installs Python deps (yt-dlp, youtube-transcript-api, innertube, requests, faster-whisper) into a venv or system Python. This is a normal approach for such a tool; install uses PyPI (standard). No remote arbitrary binary downloads or obscure URLs in the installer.
Credentials
Registry declares no required env vars, but the code reads many env vars: WHISPER_DEVICE, WHISPER_MODEL, FRAME_TIME_OFFSET, FRAME_INTERVAL, MAX_FRAMES, LLM_API_URL, LLM_API_KEY, LLM_MODEL, OPENCLAW_GATEWAY_TOKEN, etc. In particular, if LLM_API_URL + LLM_API_KEY (or OPENCLAW_GATEWAY_TOKEN) are set, the skill will send full transcripts to that external endpoint — a high-sensitivity action that should be declared and explicitly consented to. Cache storage of transcripts is permanent by default (also not called out as a required configuration item).
Persistence & Privilege
always:false and the skill does not modify other skills. However it writes permanent cached transcript files to ~/.cache/video-insight and creates a venv and cache dir during setup.sh. Temp files are managed and cleaned, but caches are intentionally permanent unless manually removed.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install video-insight
  3. After installation, invoke the skill by name or use /video-insight
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
v1.0.0
Metadata
Slug video-insight
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Video Insight?

Cross-platform video transcript extraction and optional AI summarization for YouTube and Bilibili. GPU auto-detect. Transcript-first with opt-in LLM summary. It is an AI Agent Skill for Claude Code / OpenClaw, with 68 downloads so far.

How do I install Video Insight?

Run "/install video-insight" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Video Insight free?

Yes, Video Insight is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Video Insight support?

Video Insight is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Video Insight?

It is built and maintained by huuuwnnn-droid (@huuuwnnn-droid); the current version is v1.0.0.

💬 Comments