Gemini Video Analyzer
/install gemini-video-analyzer
Gemini Video Analyzer
Analyze videos natively using Google Gemini's multimodal API. No frame extraction needed — Gemini processes video at 1 FPS with full motion, audio, and visual understanding.
Quick Start
# Analyze a video with default prompt (full description)
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4
# Ask a specific question
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/analyze.py /path/to/video.mp4 "What text is visible on screen?"
# Manage uploaded files
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py list
GOOGLE_AI_API_KEY=$GOOGLE_AI_API_KEY python3 {baseDir}/scripts/manage_files.py cleanup
Supported Formats
MP4, AVI, MOV, MKV, WebM, FLV, MPEG, MPG, WMV, 3GP — up to 2GB per file.
How It Works
- Video uploads to Google's Files API (temporary, auto-deletes after 48h)
- Gemini processes at 1 frame/sec — understands motion, transitions, audio context
- Model generates response based on your prompt
- Way better than frame extraction for understanding temporal content
Use Cases
| Task | Example Prompt |
|---|---|
| General description | (default — no prompt needed) |
| UI/text extraction | "What text and UI elements are visible?" |
| Tutorial summary | "Summarize the steps shown in this tutorial" |
| Bug report from video | "Describe what went wrong in this screen recording" |
| Meeting notes | "Summarize the key points discussed" |
| Content comparison | Upload 2 videos, ask for differences |
Configuration
Set GOOGLE_AI_API_KEY in your environment or .env file. Get a free key at aistudio.google.com.
Default model: gemini-2.5-flash (fast, cheap, excellent vision). Override with --model gemini-2.5-pro for complex analysis.
API Reference
See references/gemini-files-api.md for file upload limits, processing details, and advanced options.
Credits
Built by M. Abidi · LinkedIn · YouTube · GitHub · Book a Call
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install gemini-video-analyzer - 安装完成后,直接呼叫该 Skill 的名称或使用
/gemini-video-analyzer触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Gemini Video Analyzer 是什么?
Native video analysis using Google Gemini API. Upload and analyze video files — describe scenes, extract text/UI, answer questions about content, transcribe... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 392 次。
如何安装 Gemini Video Analyzer?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install gemini-video-analyzer」即可一键安装,无需额外配置。
Gemini Video Analyzer 是免费的吗?
是的,Gemini Video Analyzer 完全免费(开源免费),可自由下载、安装和使用。
Gemini Video Analyzer 支持哪些平台?
Gemini Video Analyzer 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Gemini Video Analyzer?
由 aiwithabidi(@aiwithabidi)开发并维护,当前版本 v1.0.0。