← Back to Skills Marketplace

YouTube AnyCaption Summarizer

Name: YouTube AnyCaption Summarizer
Author: arthurli202602-commits

by arthurli202602-commits · GitHub ↗ · v1.1.4 · MIT-0

cross-platform ⚠ suspicious

202

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install youtube-anycaption-summarizer

Description

Turn YouTube videos into dependable markdown transcripts and polished summaries — even when caption coverage is messy. This skill works with manual closed ca...

Usage Guidance

This package appears to implement what it claims (YouTube transcripts plus an LLM-powered summary with local Whisper fallback), but review these points before installing: - Missing declared dependency: the scripts call the 'openclaw' CLI (openclaw infer/agent) to make LLM calls and to forward progress. The skill metadata does not list 'openclaw' as a required binary. Ensure your environment provides the OpenClaw CLI/gateway if you expect LLM/autosend features to work. - Undeclared environment variables: the code reads session/gateway/env names (YOUTUBE_LAUNCH_SESSION_KEY, OPENCLAW_SESSION_KEY, OPENCLAW_PARENT_SESSION_KEY, OPENCLAW_GATEWAY_PORT, and optional FORWARD_STATE env). If these are present in your environment the skill will use them to post messages into sessions and call the local gateway. Only set those env vars if you intend the skill to have that behavior. - File writes and model download: the installer and scripts will create ~/.openclaw/workspace and may download a Whisper model from huggingface.co into that directory. Confirm you approve creating that folder and downloading the model file (large binary). If you prefer a different location, pass --models-dir when running the scripts. - Network activity: yt-dlp will contact YouTube, and the script will curl the HuggingFace model URL. If you use --cookies or --cookies-from-browser for restricted videos, you will be providing potentially sensitive cookies for yt-dlp to use — only provide cookies you trust and prefer ephemeral handling. - Forwarding behavior: the batch runner can forward progress messages into an OpenClaw session using the openclaw agent and stores a forward-state file under ~/.openclaw/tmp. If you do not want messages forwarded, avoid setting the session-related env vars. Recommendations before installing: inspect the repository yourself (it’s bundled here), ensure you have the expected OpenClaw runtime if you want LLM summarization, verify the HuggingFace model URL is acceptable, and run the workflow in a controlled environment (or with --dry-run / --keep-intermediates) to observe behavior. If you are unsure about session forwarding, unset session-related env vars before running.

Capability Analysis

Type: OpenClaw Skill Name: youtube-anycaption-summarizer Version: 1.1.4 The skill provides a robust YouTube summarization workflow with local Whisper fallback and batch processing, but it includes several high-risk capabilities. Specifically, it supports extracting sensitive browser cookies via yt-dlp (`run_youtube_workflow.py`) and programmatically injecting messages into the active OpenClaw agent session using the `openclaw agent` CLI (`run_youtube_batch_end_to_end.py`). While these features are functionally aligned with the stated goals of accessing restricted videos and providing milestone updates, they represent a significant attack surface for data access and session manipulation. Additionally, the installation script in `SKILL.md` executes a remote download of a binary model from Hugging Face.

Capability Assessment

ℹ Purpose & Capability

Name/description align with the code: scripts use yt-dlp, ffmpeg, Whisper (whisper-cli) and Python to fetch/clean/transcribe YouTube videos and produce summaries. The declared required binaries (yt-dlp, ffmpeg, whisper-cli, python3) match most runtime needs. However, several scripts call the 'openclaw' CLI and rely on an OpenClaw gateway/session integration (for LLM calls and progress forwarding) even though 'openclaw' is not listed as a required binary in the metadata — this is an omitted dependency, not expected from the declared manifest.

⚠ Instruction Scope

SKILL.md and scripts instruct the agent to download a Whisper model from HuggingFace, create ~/.openclaw/workspace, read/write per-video folders and temporary files, call local binaries, and (optionally) forward progress into an OpenClaw session via the openclaw CLI. The instructions reference cookies for restricted videos (user-provided) and call external services (YouTube via yt-dlp, HuggingFace for model download). The problematic items: (1) scripts expect an OpenClaw gateway/CLI and session keys to call the LLM and forward messages, but those env vars/CLI were not declared; (2) forwarding uses openclaw agent to post messages into sessions (requires session keys) which grants the skill the ability to post into your OpenClaw session if such keys are present; (3) the SKILL.md says 'does not touch openclaw.json' but it will create ~/.openclaw/workspace and a forward state file under ~/.openclaw/tmp — this writes to the user's home tree and may persist state.

✓ Install Mechanism

Install steps are brew installs for yt-dlp, ffmpeg, and whisper-cpp (whisper-cli) — these are standard package sources for macOS. The model download uses a direct curl from huggingface.co (a known host). These are expected for a local Whisper fallback. Nothing in the install spec downloads from obscure shorteners or personal IPs.

⚠ Credentials

The registry metadata lists no required env vars, but the code reads and uses multiple environment variables: OPENCLAW_GATEWAY_PORT, YOUTUBE_LAUNCH_SESSION_KEY, OPENCLAW_SESSION_KEY, SESSION_KEY, OPENCLAW_PARENT_SESSION_KEY, and respects FORWARD_STATE_ENV / YOUTUBE_BATCH_FORWARD_STATE. These are optional in code but important: if present they enable forwarding messages and gateway access. The skill will behave differently depending on these env vars, so their omission from the declared requirements is an inconsistency that a user should be aware of.

ℹ Persistence & Privilege

always:false and the skill does not attempt to modify other skills or system-wide configs. It will create and write files under ~/.openclaw/workspace and a forward-state file (default ~/.openclaw/tmp/youtube-anycaption-forward-state.json). It invokes subprocesses (openclaw agent/infer) which can forward progress into an OpenClaw session if session keys exist. This is normal for an OpenClaw-integrated workflow, but combined with undeclared env usage it increases the blast radius if you unintentionally expose a session key.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install youtube-anycaption-summarizer
After installation, invoke the skill by name or use /youtube-anycaption-summarizer
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.1.4

Refresh README, clean packaging artifacts, and publish latest skill improvements with minimal metadata churn.

v1.1.3

Docs safety update: dependency setup now clearly avoids overwriting the workspace folder, OpenClaw config, or an existing ggml-medium model file.

v1.1.2

Docs improvement: add a copy-paste dependency setup block for new users, including Homebrew installs, default Whisper model download, and verification commands.

v1.1.1

Page polish + conversion improvements: fewer, sharper tags; shorter public SKILL page; deep workflow moved to references; added install snippet; added explicit comparison/proof section; preserved full operational guidance in references.

v1.1.0

Listing + SEO polish: add GitHub homepage, runtime/install metadata, stronger hero copy, best-for/examples sections, safer placeholder rendering, standardized completion wording, and improved discoverability tags.

v1.0.0

Initial public release: robust YouTube transcript + summary workflow with CC/auto-caption/Whisper fallback, batch mode, language backfill, detailed implementation-focused summaries, and session-ready completion reporting.

Metadata

Slug youtube-anycaption-summarizer

Version 1.1.4

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 6

Frequently Asked Questions

What is YouTube AnyCaption Summarizer?

Turn YouTube videos into dependable markdown transcripts and polished summaries — even when caption coverage is messy. This skill works with manual closed ca... It is an AI Agent Skill for Claude Code / OpenClaw, with 202 downloads so far.

How do I install YouTube AnyCaption Summarizer?

Run "/install youtube-anycaption-summarizer" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is YouTube AnyCaption Summarizer free?

Yes, YouTube AnyCaption Summarizer is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does YouTube AnyCaption Summarizer support?

YouTube AnyCaption Summarizer is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created YouTube AnyCaption Summarizer?

It is built and maintained by arthurli202602-commits (@arthurli202602-commits); the current version is v1.1.4.

More Skills