← Back to Skills Marketplace

AI Image & Video Generation

Name: AI Image & Video Generation
Author: bytevirts

by bytevirts · GitHub ↗ · v1.0.3 · MIT-0

cross-platform ✓ Security Clean

109

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install ai-image-video-generation

Description

Generate images and videos using the VibeVideo API. Use when the user asks to create AI images or videos, check generation status, list available models, or...

README (SKILL.md)

VibeVideo Generation Skill

Prerequisites

Environment variable VIBEVIDEO_API_KEY must be set with a valid API key
Get your API key from: Dashboard → Settings → API Keys

API Endpoint

This skill always uses the official VibeVideo API endpoint: https://vibevideo.app

Generate Image

Create an image generation task:

curl -s -X POST https://vibevideo.app/api/ai/generate \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "mediaType": "image",
    "scene": "text-to-image",
    "model": "nano-banana-2",
    "prompt": "A cat sitting on a rainbow",
    "options": {
      "aspect_ratio": "1:1",
      "quality": "2K"
    }
  }'

For image-to-image, set "scene": "image-to-image" and add "image_url": "..." in options.

Response:

{ "code": 0, "data": { "id": "task_id", "status": "pending", "taskId": "...", "costCredits": 5 } }

Generate Video

Create a video generation task:

curl -s -X POST https://vibevideo.app/api/ai/generate \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "mediaType": "video",
    "scene": "text-to-video",
    "model": "seedance-2-0",
    "prompt": "A dog playing in a park",
    "options": {
      "resolution": "720p",
      "duration": "5s",
      "aspect_ratio": "16:9"
    }
  }'

For image-to-video, set "scene": "image-to-video" and add "image_url": "..." in options. For frames-to-video, add "start_image_url": "..." and "end_image_url": "..." in options.

Query Task Status

Tasks are asynchronous. Poll until status is success, failed, or canceled:

curl -s -X POST https://vibevideo.app/api/ai/query \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{ "taskId": "YOUR_TASK_ID" }'

Response includes status, taskInfo, taskResult, and taskUrls (JSON string of media URLs).

Calculate Cost

Check credit cost before generating:

curl -s -X POST https://vibevideo.app/api/ai/cost \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "seedance-2-0",
    "mediaType": "video",
    "scene": "text-to-video",
    "options": { "resolution": "720p", "duration": "5s" }
  }'

Cancel Task

curl -s -X DELETE https://vibevideo.app/api/ai/tasks/YOUR_TASK_ID \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY"

Workflow

If the user doesn't specify a model, pick the default for the scene (see tables below)
Call the generate endpoint
Poll the query endpoint every 5 seconds until status is terminal (success/failed/canceled)
Parse taskUrls from the response and report the media URL(s) to the user
If code is not 0, handle the error (see Error Handling below)

Image Models

ID	Label	Vendor	Scenes	Qualities
nano-banana-2	Nano Banana 2	Google	text-to-image, image-to-image	1K, 2K, 4K
gpt-image-1-5	GPT Image 1.5	OpenAI	text-to-image, image-to-image	Medium, High
grok-imagine	Grok Imagine	Grok	text-to-image, image-to-image	—
seedream-5-0	Seedream 5.0	ByteDance	text-to-image, image-to-image	Basic, High
qwen-image	Qwen Image	Qwen	text-to-image, image-to-image	—
wan-2-7-image	Wan 2.7 Image	Qwen/Alibaba	text-to-image, image-to-image	1K, 2K
wan-2-7-image-pro	Wan 2.7 Image Pro	Qwen/Alibaba	text-to-image, image-to-image	1K, 2K, 4K

Default for text-to-image: nano-banana-2

Video Models

ID	Label	Vendor	Scenes	Resolutions	Durations
seedance-2-0	Seedance 2.0	ByteDance	text-to-video, image-to-video, frames-to-video, reference-to-video	720p, 1080p	5s, 10s, 15s
seedance-2-0-fast	Seedance 2.0 Fast	ByteDance	text-to-video, image-to-video, frames-to-video, reference-to-video	720p, 1080p	5s, 10s, 15s
seedance-1-5-pro	Seedance 1.5 Pro	ByteDance	text-to-video, image-to-video	480p, 720p, 1080p	4s, 8s, 12s
grok-imagine	Grok Imagine	Grok	text-to-video, image-to-video	480p, 720p	6s, 10s, 15s
kling-2-6	Kling 2.6	Kling	text-to-video, image-to-video	—	5s, 10s
runway	Runway	Runway	text-to-video, image-to-video	720p, 1080p	5s, 10s
veo-3-1	Veo 3.1	Google	text-to-video, image-to-video, frames-to-video, reference-to-video	720p, 1080p, 4k	—
veo-3-1-fast	Veo 3.1 Fast	Google	text-to-video, image-to-video, frames-to-video, reference-to-video	720p, 1080p, 4k	—
seedence-1-0-pro	Seedence 1.0 Pro	ByteDance	text-to-video, image-to-video	480p, 720p, 1080p	5s, 10s
seedence-1-0-pro-fast	Seedence 1.0 Pro Fast	ByteDance	image-to-video	720p, 1080p	5s, 10s
seedence-1-0-lite	Seedence 1.0 Lite	ByteDance	text-to-video, image-to-video	480p, 720p, 1080p	5s, 10s

Default for text-to-video: seedance-2-0

Error Handling

code: -1 with "no auth": User's API key is missing or invalid. Remind them to set VIBEVIDEO_API_KEY.
code: -1002 with "insufficient credits": User needs to purchase credits at VibeVideo dashboard.
code: -1 with "invalid": Wrong model ID, scene, or mediaType. Check against the model tables above.
Task stuck in "processing": Polling timeout. The task may still complete — suggest the user wait and query again later.

API Response Envelope

All endpoints return:

{ "code": 0, "message": "ok", "data": { ... } }

code: 0 means success. Non-zero code means error (check message).

Usage Guidance

This skill appears coherent and limited to calling the VibeVideo API. Before installing: (1) Confirm you trust vibevideo.app and the source of this skill; the skill will be able to use whatever VIBEVIDEO_API_KEY you provide. (2) Keep your API key secret and give the minimal key/permissions if the provider supports scoping; revoke the key if you stop using the skill. (3) Be aware generation may consume paid credits — watch for cost-related error codes and billing. (4) If you are uncomfortable with autonomous agent invocation, disable autonomous use for this skill in your agent settings.

Capability Assessment

✓ Purpose & Capability

Name/description match the runtime instructions: all actions are calls to the VibeVideo API. Required binary (curl) and required env var (VIBEVIDEO_API_KEY) are appropriate for the declared purpose.

✓ Instruction Scope

SKILL.md only instructs making HTTPS requests to https://vibevideo.app, polling task status, parsing taskUrls, checking costs, and cancelling tasks. It does not tell the agent to read unrelated files, other env vars, or send data to unknown endpoints.

✓ Install Mechanism

No install spec and no code files — this is instruction-only and will not write new code to disk. Lower risk because nothing is downloaded or installed by the skill itself.

✓ Credentials

Only one required environment variable (VIBEVIDEO_API_KEY) is declared and used; that is the expected minimal credential for calling the provider's API.

✓ Persistence & Privilege

always:false and no requests to modify other skills or system settings. The skill can be invoked autonomously by the agent (platform default) but it does not request elevated or persistent privileges.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install ai-image-video-generation
After installation, invoke the skill by name or use /ai-image-video-generation
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.3

- No changes detected in this version; all files remain unchanged. - Functionality, documentation, and metadata are identical to the previous release.

v1.0.2

- Now always uses the official VibeVideo API endpoint (https://vibevideo.app); self-hosting option removed. - File _meta.json removed. - Added structured skill metadata with environment variable and requirements. - Minor documentation streamlining in SKILL.md, reflecting the fixed endpoint and updated prerequisites. - No changes to skill usage or endpoints.

v1.0.1

- Added _meta.json file for skill metadata. - No changes to code or documentation content. - Version incremented to 1.0.1.

v1.0.0

Initial public release of the VibeVideo generation skill for image and video creation via the VibeVideo API.

Metadata

Slug ai-image-video-generation

Version 1.0.3

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 4

Frequently Asked Questions

What is AI Image & Video Generation?

Generate images and videos using the VibeVideo API. Use when the user asks to create AI images or videos, check generation status, list available models, or... It is an AI Agent Skill for Claude Code / OpenClaw, with 109 downloads so far.

How do I install AI Image & Video Generation?

Run "/install ai-image-video-generation" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is AI Image & Video Generation free?

Yes, AI Image & Video Generation is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does AI Image & Video Generation support?

AI Image & Video Generation is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created AI Image & Video Generation?

It is built and maintained by bytevirts (@bytevirts); the current version is v1.0.3.

More Skills