← Back to Skills Marketplace
danielwpp

Media Gen Vision Video

by danielwpp · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
106
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install media-gen-vision-video
Description
Generate and analyze images, and generate videos using OpenClaw's preferred Google media workflows. Use when the user asks to create, edit, inspect, compare,...
Usage Guidance
This skill's instructions clearly expect access to Google media models and to save/send media files, but it doesn't declare any credentials or config paths. Before installing, ask the publisher or platform: (1) How are Google/Gemini/Veo credentials supplied (API key, OAuth connector, or built-in platform integration)? (2) Where will generated files be stored and who can access them? (3) Will the skill run autonomously and could it upload user images to external services? If the platform supplies a documented, least-privilege Google connector (or the skill explicitly lists required env vars like GOOGLE_API_KEY/GEMINI_TOKEN and explains storage locations), the mismatch is resolved and the skill is more acceptable. Without that information, treat the skill as suspicious because it asks the agent to do things that normally require credentials and file access but does not declare them. Provide these answers or update the skill metadata (required env vars/config paths) before enabling it in sensitive environments.
Capability Analysis
Type: OpenClaw Skill Name: media-gen-vision-video Version: 1.0.0 The skill bundle contains standard operational instructions for an AI agent to handle image and video generation tasks using Google-native models (Gemini, Veo 3.1). The instructions in SKILL.md are well-aligned with the stated purpose and do not contain any indicators of malicious intent, data exfiltration, or prompt injection attacks.
Capability Assessment
Purpose & Capability
The SKILL.md repeatedly instructs using Google-native models (Nano Banana 2 / Gemini / Veo 3.1) and 'official Gemini API workflow', but the skill declares no required environment variables, primary credential, or config paths to supply Google API credentials. If the skill truly needs direct access to Google media APIs, it should request credentials or a connector; the absence is inconsistent.
Instruction Scope
Runtime instructions require generating, saving, and delivering binary media files (images/videos) and say to 'save the final file with a stable filename' and 'send the generated asset directly into the conversation.' Those are reasonable for the stated purpose but imply file system and attachment APIs. The skill does not specify where to store files, how to obtain user-supplied reference images, or what channels are used to deliver assets — this ambiguity could lead to broader access than expected.
Install Mechanism
Instruction-only skill with no install spec or remote downloads. This is low-risk from an installation perspective because no new code is written to disk by an installer.
Credentials
No env vars or credentials are declared, yet the workflow clearly needs access to Google APIs (which normally require API keys or OAuth tokens). This omission is disproportionate: either the platform must supply a connector implicitly (which should be documented) or the skill is failing to declare needed secrets.
Persistence & Privilege
always is false and there are no install hooks or requests to modify other skills or global settings. The skill does request the ability to save and send files, which is normal for media workflows and does not itself indicate elevated persistent privilege.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install media-gen-vision-video
  3. After installation, invoke the skill by name or use /media-gen-vision-video
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: image generation/editing, multimodal image understanding, and Veo 3.1 video workflows.
Metadata
Slug media-gen-vision-video
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Media Gen Vision Video?

Generate and analyze images, and generate videos using OpenClaw's preferred Google media workflows. Use when the user asks to create, edit, inspect, compare,... It is an AI Agent Skill for Claude Code / OpenClaw, with 106 downloads so far.

How do I install Media Gen Vision Video?

Run "/install media-gen-vision-video" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Media Gen Vision Video free?

Yes, Media Gen Vision Video is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Media Gen Vision Video support?

Media Gen Vision Video is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Media Gen Vision Video?

It is built and maintained by danielwpp (@danielwpp); the current version is v1.0.0.

💬 Comments