Gemini Video Analyzer
/install gemini-video-analyzer
Gemini Video Analyzer
Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.
Quick Start
# Analyze a video with default prompt (full description)
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4
# Ask a specific question
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"
# Manage uploaded files
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup
Supported Formats
MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.
How It Works
- Video uploads to Google's Files API (temporary, auto-deletes after 48h)
- Gemini processes at 1 frame/sec — understands motion, transitions, audio context
- Model generates response based on your prompt
- Way better than frame extraction for understanding temporal content
Use Cases
| Task | Example Prompt |
|---|---|
| General description | (default — no prompt needed) |
| UI/text extraction | "What text and UI elements are visible?" |
| Tutorial summary | "Summarize the steps shown in this tutorial" |
| Bug report from video | "Describe what went wrong in this screen recording" |
| Meeting notes | "Summarize the key points discussed" |
| Content comparison | Upload 2 videos, ask for differences |
Configuration
Set GOOGLE_AI_API_KEY in your environment or .env file. Get a free key at aistudio.google.com.
Default model: gemini-2.5-flash (fast, cheap, excellent vision). Override with --model gemini-2.5-pro for complex analysis.
API Reference
See references/gemini-files-api.md for file upload limits, processing details, and advanced options.
Credits
Built by M. Abidi · LinkedIn · YouTube · GitHub · Book a Call
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install gemini-video-analyzer - After installation, invoke the skill by name or use
/gemini-video-analyzer - Provide required inputs per the skill's parameter spec and get structured output
What is Gemini Video Analyzer?
Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe... It is an AI Agent Skill for Claude Code / OpenClaw, with 392 downloads so far.
How do I install Gemini Video Analyzer?
Run "/install gemini-video-analyzer" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is Gemini Video Analyzer free?
Yes, Gemini Video Analyzer is completely free (open-source). You can download, install and use it at no cost.
Which platforms does Gemini Video Analyzer support?
Gemini Video Analyzer is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created Gemini Video Analyzer?
It is built and maintained by aiwithabidi (@aiwithabidi); the current version is v1.0.0.