← Back to Skills Marketplace
lnj22

gemini-video-understanding

by lnj22 · GitHub ↗ · v0.1.0 · MIT-0
cross-platform ⚠ suspicious
81
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install pedestrian-traffic-counting-gemini-video-understanding
Description
Analyze videos with Google Gemini API (summaries, Q&A, transcription with timestamps + visual context, scene/timeline detection, video clipping, FPS control,...
Usage Guidance
This skill appears to implement Google Gemini video analysis and the instructions are internally consistent with that purpose, but metadata gaps and provenance issues make it suspicious. Before installing: (1) confirm the author/source and why the registry slug references 'pedestrian-traffic-counting' if your use case is general video analysis; (2) require the skill to declare the GEMINI_API_KEY (or other credentials) in its metadata and document required IAM/billing scope; (3) understand that local videos and public YouTube URLs will be sent to Google's API—do not send private or sensitive videos unless you accept that; (4) verify billing and token lifetime for the Gemini API key and that you trust the endpoint; and (5) prefer a skill with a verifiable homepage/source or maintainer contact. If these items are resolved and provenance is clear, the skill is likely fine to use; if not, treat it as untrusted.
Capability Analysis
Type: OpenClaw Skill Name: pedestrian-traffic-counting-gemini-video-understanding Version: 0.1.0 The skill bundle provides legitimate instructions and Python code examples for using the Google Gemini API to analyze videos. It covers standard operations such as file uploads, YouTube URL processing, and structured data extraction using the official google-genai library, with no evidence of malicious intent, data exfiltration, or prompt injection attacks.
Capability Tags
requires-sensitive-credentials
Capability Assessment
Purpose & Capability
The SKILL.md clearly implements Google Gemini video workflows (uploading files, YouTube URIs, model calls) which fits the skill name/description. However the registry slug ('pedestrian-traffic-counting-...') and the published 'source: unknown / homepage: none' indicate inconsistent provenance and a possible mismatch in intended purpose or origin. Also the code examples require a GEMINI_API_KEY, but the skill metadata declares no required environment variables or primary credential.
Instruction Scope
The runtime instructions are concrete and scoped to video analysis: uploading local files, reading video bytes, using the File API, and pointing to public YouTube URLs. They do not instruct reading unrelated system files or secrets beyond an API key. However the examples explicitly call os.getenv("GEMINI_API_KEY") and read local video files — the former is not declared in the skill metadata, which is an inconsistency to resolve before use.
Install Mechanism
No install specification and no code files — this is instruction-only, which is lower risk from an installation perspective. Nothing is downloaded or written to disk by an install step in the skill bundle itself.
Credentials
The SKILL.md requires an API key (os.getenv("GEMINI_API_KEY")) for Google Gemini calls, but the skill metadata lists no required environment variables or primary credential. That omission is disproportionate/incoherent: a networked API key is necessary for the stated functionality and should be declared and justified. Also consider privacy implications: videos (local or YouTube) may contain PII and will be transmitted to Google's API.
Persistence & Privilege
The skill is not marked always:true and is user-invocable with normal autonomous invocation allowed. It does not request persistent presence or modify other skills. This is the expected configuration for an instruction-only integration.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install pedestrian-traffic-counting-gemini-video-understanding
  3. After installation, invoke the skill by name or use /pedestrian-traffic-counting-gemini-video-understanding
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.1.0
Bulk publish from all-task-skills-dedup
Metadata
Slug pedestrian-traffic-counting-gemini-video-understanding
Version 0.1.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is gemini-video-understanding?

Analyze videos with Google Gemini API (summaries, Q&A, transcription with timestamps + visual context, scene/timeline detection, video clipping, FPS control,... It is an AI Agent Skill for Claude Code / OpenClaw, with 81 downloads so far.

How do I install gemini-video-understanding?

Run "/install pedestrian-traffic-counting-gemini-video-understanding" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is gemini-video-understanding free?

Yes, gemini-video-understanding is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does gemini-video-understanding support?

gemini-video-understanding is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created gemini-video-understanding?

It is built and maintained by lnj22 (@lnj22); the current version is v0.1.0.

💬 Comments