← Back to Skills Marketplace
erickgrau

YouTube Intelligence Extractor

by Erick Grau · GitHub ↗ · v1.1.0 · MIT-0
cross-platform ⚠ suspicious
115
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install youtube-intelligence-extractor
Description
Extract structured intelligence from YouTube video transcripts for productivity, AI prompting, platform engineering, and creative workflows. Use this skill w...
README (SKILL.md)

YouTube Intelligence Extractor

Overview

This skill turns YouTube video transcripts into structured, actionable intelligence across four domains that matter to Chibitek Labs: productivity, AI prompting, platform/systems engineering, and creative building. Instead of passive consumption, every video becomes a reusable asset — extracted todos, prompts you can run immediately, frameworks you can apply, and insights mapped to your current work.

After presenting the report, always save it as a markdown file and present it to the user. This is standing behavior — do not wait to be asked.

Quick Reference

Situation Approach
User gives a YouTube URL Try Method A (Wave Tube fetch) → fallback to Method B (python) → fallback to Method C (manual paste)
User pastes raw transcript Skip fetch, go straight to extraction pipeline
Python fetch is IP-blocked Use Wave Tube web fetch — it works from server environments
Video has no captions Attempt auto-generated captions; if unavailable, notify user
User wants one specific domain Run full pipeline, present only the requested domain
User wants everything Run full pipeline, present all four intelligence reports
Very long video (1hr+) Chunk transcript, process in sections, merge outputs
Report is complete Always save as markdown file and present_files — do not ask

Step 1: Fetch the Transcript

Method A — Wave Tube (preferred — works from server/cloud environments)

Wave Tube mirrors YouTube transcripts and is not IP-blocked. Always try this first.

URL pattern: https://tube.wave.co/[video-slug]-[VIDEO_ID]

To find the slug: search "[VIDEO_ID] youtube" — the Wave Tube result will appear. Or web_search for the video title + video ID; Wave Tube typically appears in results.

Use the web_fetch tool on the Wave Tube URL. The full transcript appears in the page body under "## Transcript". Also captures: title, channel, views, duration, description, and timestamps in the same fetch.

Why Wave Tube first: youtube-transcript-api fails with IpBlocked when Claude runs on cloud infrastructure (AWS/GCP/Azure). Wave Tube is a reliable public mirror that bypasses this entirely and also provides video metadata in the same fetch.

Method B — Python (works locally on macOS, fails in server environments)

pip install youtube-transcript-api --break-system-packages -q
from youtube_transcript_api import YouTubeTranscriptApi
import re

def extract_video_id(url):
    patterns = [
        r'(?:v=|youtu\.be/)([a-zA-Z0-9_-]{11})',
        r'(?:embed/)([a-zA-Z0-9_-]{11})',
    ]
    for p in patterns:
        m = re.search(p, url)
        if m:
            return m.group(1)
    return url  # assume it's already a video ID

# IMPORTANT: Use instance method .fetch(), NOT .get_transcript()
# The API changed in v1.0+ — always instantiate first
video_id = extract_video_id("VIDEO_URL_HERE")
transcript = YouTubeTranscriptApi().fetch(video_id)
full_text = " ".join([entry.text for entry in transcript])
print(full_text)

Common mistake: YouTubeTranscriptApi.get_transcript() no longer exists in v1.0+. Always use YouTubeTranscriptApi().fetch(video_id) — instantiated, not class method.

Method C — Manual paste (always works, zero dependencies)

If both automated methods fail, ask the user to:

  1. Go to the YouTube video
  2. Click the ... menu → Show transcript
  3. Copy all the transcript text
  4. Paste it directly into the chat

Then proceed with the extraction pipeline on the pasted text.

Step 2: Run the Extraction Pipeline

Once you have the transcript text, extract across all four domains. Each domain is independent — process them cleanly and present them in sequence.

Domain 1 — Todos & Action Items

Extract concrete, doable actions mentioned or implied in the video.

Output format:

## ✅ Todos & Action Items
- [ ] [Specific action] — [context, tool, or condition if mentioned]

Rules:

  • Every item must be specific enough to act on without watching the video
  • Include tool names, URLs, timeframes where mentioned
  • Do NOT include vague advice ("learn more about X") — skip it or make it concrete

Domain 2 — Advice & Frameworks

Extract principles, mental models, frameworks, and strategic advice that apply repeatedly — not just one-time instructions.

Output format:

## 💡 Advice & Frameworks

### [Framework Name]
**The idea:** ...
**Why it matters:** ...
**How to apply it:** ...

Domain 3 — AI Prompts (Extractable & Ready to Use)

Extract any AI prompts, prompt strategies, or prompt patterns mentioned, shown, or implied. Reconstruct them as runnable prompts — not descriptions of prompts.

Output format:

## 🤖 AI Prompts & Techniques

### [Technique Name]
**Purpose:** [What this achieves]
**Prompt:**

[Full, ready-to-run prompt text in a fenced code block]

**Notes:** [Model, context, or usage tips]

Rules:

  • If a prompt is partially shown, reconstruct the most likely complete version
  • Format prompts in fenced code blocks so they are easy to copy
  • Never just describe a prompt — always write the actual text

Domain 4 — Platform, Engineering & Creative Insights

Extract anything relevant to systems design, platform architecture, tooling, development workflows, APIs, infrastructure patterns, or creative production.

Output format:

## ⚙️ Platform, Engineering & Creative Insights

### [Insight Name]
**Domain:** [Engineering / Platform / Creative / Tooling]
**Insight:** ...
**Application for Chibitek/UAMH:** [How this maps to UAMH, STIKI, Nexus, or OpenClaw]

Rules:

  • Always include the Chibitek/UAMH application note — generic insights are less useful
  • If an insight maps to multiple Chibitek systems, name them all

Step 3: Present the Intelligence Report

Report Header (always include)

# 🎬 Intelligence Report: [VIDEO TITLE]
**Channel:** [Channel Name]
**Published:** [Date] · [Duration] · [View count]
**Topic:** [2-3 word tag]
**Source:** [Full YouTube URL]
**Summary:** [One sentence: what this video teaches and who it's for]

---

Report Footer (always include)

## 🏆 Top 3 Immediate Actions
1. [Highest-leverage action from the report]
2. [Second most valuable]
3. [Third]

---
*Extracted by Chibitek Labs YouTube Intelligence Extractor · [Date]*

Step 4: Save the Report (Always — Do Not Wait to Be Asked)

After presenting the report in chat, immediately save it as a markdown file and present it using present_files. This is standing behavior.

File naming convention:

intel-[short-slug-from-title].md

Example: intel-openclaw-5-things.md

Save location: /mnt/user-data/outputs/ (or appropriate workspace output dir)

Every video intelligence report is a reusable asset. Saving automatically means the output is never lost when the chat session ends.

Step 5: Update This Skill After Every Improvement

After any run that reveals a better approach, edge case, or new pattern, update this SKILL.md immediately. Do not wait to be asked. This is standing behavior.

What counts as an improvement worth capturing:

  • A fetch method that worked better than expected
  • A new transcript source discovered
  • An output format tweak that made the report cleaner
  • A new anti-pattern encountered in the wild
  • A domain or extraction type that proved more or less useful than expected

Common Mistakes (Anti-Patterns)

Trying Python fetch first in server environments The youtube-transcript-api library is always IP-blocked when Claude runs on cloud infrastructure. Go to Wave Tube first — it is faster and more reliable in this context.

Using the old class method API YouTubeTranscriptApi.get_transcript() throws AttributeError in v1.0+. Use YouTubeTranscriptApi().fetch(video_id) — instantiated instance, not class method.

Summarizing instead of extracting Do not write a paragraph about what the video covers. Extract discrete, usable pieces. A summary is what YouTube's description does. This skill does more.

Vague action items Bad: - [ ] Learn more about AI prompting Good: - [ ] Try reverse prompting: ask OpenClaw to extract your goals by asking you questions

Paraphrasing prompts instead of reconstructing them If a speaker reads a prompt aloud or shows it on screen, write the actual prompt text in a fenced code block — ready to copy and run. Never just describe it.

Skipping the Chibitek mapping in Domain 4 Generic engineering insights are half as useful as ones mapped to UAMH, STIKI, or Nexus. Always close the loop on how the insight applies to Chibitek's current work.

Not saving the report automatically The report is a reusable asset. Save it every time without being asked.

Not updating the skill after improvements If this run taught you something new, update the skill before ending the session.

Quality Checklist

Before closing out a run, verify:

  • Transcript was fetched (Wave Tube preferred, Python fallback, manual last resort)
  • Report header includes: title, channel, date, duration, views, source URL, summary
  • All four domains are present (unless user requested a subset)
  • Action items are specific and immediately actionable
  • At least one AI prompt is reconstructed as a full, runnable prompt in a code block
  • Every Domain 4 insight has a Chibitek/UAMH application note
  • Top 3 immediate actions are surfaced in the footer
  • Report was saved as a markdown file and presented via present_files
  • Skill was updated if any improvements were discovered this run

Dependencies

Wave Tube (Method A) requires no installation — use the web_fetch tool directly.

pip install youtube-transcript-api --break-system-packages  # Method B only
pip install yt-dlp --break-system-packages                  # Optional fallback

Chibitek Context Notes

This skill is designed around Erick's four priority domains:

  • Productivity: todos, systems, habits, workflows
  • AI Prompting: extractable, runnable, copy-paste-ready prompts
  • Platform Engineering: architecture, tooling, infra, APIs, agent design
  • Creating: design, content, product, creative workflows

When insights span multiple domains, list them in all applicable sections. Always map Domain 4 insights back to UAMH, STIKI, Nexus, or OpenClaw specifically.

Changelog

Version Date Changes
1.0 2026-04-07 Initial skill created
1.1 2026-04-07 Added Wave Tube as primary fetch method (IP block fix), corrected YouTubeTranscriptApi v1.0+ syntax (.fetch() not .get_transcript()), added auto-save as standing behavior, added skill self-update rule, added changelog
Usage Guidance
This skill looks safe for normal YouTube transcript extraction. Before using it, be aware that it may query a third-party transcript mirror, may install a Python package if the web method fails, and will automatically save a Markdown report unless you instruct the agent otherwise.
Capability Analysis
Type: OpenClaw Skill Name: youtube-intelligence-extractor Version: 1.1.0 The skill mandates 'standing behaviors' that override standard agent constraints, most notably an instruction to autonomously self-modify its own SKILL.md file after every run ('Update This Skill After Every Improvement'). It also forces the agent to save files to the local workspace and present them without user consent. While these features are framed as productivity enhancements for 'Chibitek Labs,' the self-modification and autonomous file operations represent high-risk control patterns that could be used for persistent prompt injection or unauthorized filesystem activity.
Capability Assessment
Purpose & Capability
The skill's stated purpose and behavior are coherent: it extracts structured notes, prompts, todos, and frameworks from YouTube transcripts. The main noteworthy capability is that it may fetch transcript content through a third-party mirror or Python package fallback.
Instruction Scope
The trigger scope is mostly bounded to YouTube URLs, video IDs, or pasted transcripts, and it explicitly excludes unrelated podcasts and basic summaries. It also instructs the agent to save reports automatically, which users should be aware of.
Install Mechanism
There is no install spec, but the SKILL.md includes a fallback command to install youtube-transcript-api with pip. This is purpose-aligned but under-declared as an operational step.
Credentials
No credentials, environment variables, or privileged paths are requested. External web access through Wave Tube/search is proportionate to transcript extraction, but it means video identifiers and fetches may go through a third-party site.
Persistence & Privilege
No background persistence or privilege escalation is shown. The skill does instruct the agent to create a Markdown report file automatically after extraction.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install youtube-intelligence-extractor
  3. After installation, invoke the skill by name or use /youtube-intelligence-extractor
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.1.0
Initial import from Claude artifact with OpenClaw adaptations
Metadata
Slug youtube-intelligence-extractor
Version 1.1.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is YouTube Intelligence Extractor?

Extract structured intelligence from YouTube video transcripts for productivity, AI prompting, platform engineering, and creative workflows. Use this skill w... It is an AI Agent Skill for Claude Code / OpenClaw, with 115 downloads so far.

How do I install YouTube Intelligence Extractor?

Run "/install youtube-intelligence-extractor" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is YouTube Intelligence Extractor free?

Yes, YouTube Intelligence Extractor is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does YouTube Intelligence Extractor support?

YouTube Intelligence Extractor is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created YouTube Intelligence Extractor?

It is built and maintained by Erick Grau (@erickgrau); the current version is v1.1.0.

💬 Comments