← Back to Skills Marketplace

Media Gen Vision Video

Name: Media Gen Vision Video
Author: danielwpp

by danielwpp · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

106

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install media-gen-vision-video

Description

Generate and analyze images, and generate videos using OpenClaw's preferred Google media workflows. Use when the user asks to create, edit, inspect, compare,...

Usage Guidance

This skill's instructions clearly expect access to Google media models and to save/send media files, but it doesn't declare any credentials or config paths. Before installing, ask the publisher or platform: (1) How are Google/Gemini/Veo credentials supplied (API key, OAuth connector, or built-in platform integration)? (2) Where will generated files be stored and who can access them? (3) Will the skill run autonomously and could it upload user images to external services? If the platform supplies a documented, least-privilege Google connector (or the skill explicitly lists required env vars like GOOGLE_API_KEY/GEMINI_TOKEN and explains storage locations), the mismatch is resolved and the skill is more acceptable. Without that information, treat the skill as suspicious because it asks the agent to do things that normally require credentials and file access but does not declare them. Provide these answers or update the skill metadata (required env vars/config paths) before enabling it in sensitive environments.

Capability Analysis

Type: OpenClaw Skill Name: media-gen-vision-video Version: 1.0.0 The skill bundle contains standard operational instructions for an AI agent to handle image and video generation tasks using Google-native models (Gemini, Veo 3.1). The instructions in SKILL.md are well-aligned with the stated purpose and do not contain any indicators of malicious intent, data exfiltration, or prompt injection attacks.

Capability Assessment

⚠ Purpose & Capability

The SKILL.md repeatedly instructs using Google-native models (Nano Banana 2 / Gemini / Veo 3.1) and 'official Gemini API workflow', but the skill declares no required environment variables, primary credential, or config paths to supply Google API credentials. If the skill truly needs direct access to Google media APIs, it should request credentials or a connector; the absence is inconsistent.

ℹ Instruction Scope

Runtime instructions require generating, saving, and delivering binary media files (images/videos) and say to 'save the final file with a stable filename' and 'send the generated asset directly into the conversation.' Those are reasonable for the stated purpose but imply file system and attachment APIs. The skill does not specify where to store files, how to obtain user-supplied reference images, or what channels are used to deliver assets — this ambiguity could lead to broader access than expected.

✓ Install Mechanism

Instruction-only skill with no install spec or remote downloads. This is low-risk from an installation perspective because no new code is written to disk by an installer.

⚠ Credentials

No env vars or credentials are declared, yet the workflow clearly needs access to Google APIs (which normally require API keys or OAuth tokens). This omission is disproportionate: either the platform must supply a connector implicitly (which should be documented) or the skill is failing to declare needed secrets.

✓ Persistence & Privilege

always is false and there are no install hooks or requests to modify other skills or global settings. The skill does request the ability to save and send files, which is normal for media workflows and does not itself indicate elevated persistent privilege.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install media-gen-vision-video
After installation, invoke the skill by name or use /media-gen-vision-video
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release: image generation/editing, multimodal image understanding, and Veo 3.1 video workflows.

Metadata

Slug media-gen-vision-video

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Media Gen Vision Video?

Generate and analyze images, and generate videos using OpenClaw's preferred Google media workflows. Use when the user asks to create, edit, inspect, compare,... It is an AI Agent Skill for Claude Code / OpenClaw, with 106 downloads so far.

How do I install Media Gen Vision Video?

Run "/install media-gen-vision-video" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Media Gen Vision Video free?

Yes, Media Gen Vision Video is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Media Gen Vision Video support?

Media Gen Vision Video is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Media Gen Vision Video?

It is built and maintained by danielwpp (@danielwpp); the current version is v1.0.0.

More Skills