← 返回 Skills 市场
bytevirts

AI Image & Video Generation

作者 bytevirts · GitHub ↗ · v1.0.3 · MIT-0
cross-platform ✓ 安全检测通过
109
总下载
0
收藏
0
当前安装
4
版本数
在 OpenClaw 中安装
/install ai-image-video-generation
功能描述
Generate images and videos using the VibeVideo API. Use when the user asks to create AI images or videos, check generation status, list available models, or...
使用说明 (SKILL.md)

VibeVideo Generation Skill

Prerequisites

  • Environment variable VIBEVIDEO_API_KEY must be set with a valid API key
  • Get your API key from: Dashboard → Settings → API Keys

API Endpoint

This skill always uses the official VibeVideo API endpoint: https://vibevideo.app

Generate Image

Create an image generation task:

curl -s -X POST https://vibevideo.app/api/ai/generate \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "mediaType": "image",
    "scene": "text-to-image",
    "model": "nano-banana-2",
    "prompt": "A cat sitting on a rainbow",
    "options": {
      "aspect_ratio": "1:1",
      "quality": "2K"
    }
  }'

For image-to-image, set "scene": "image-to-image" and add "image_url": "..." in options.

Response:

{ "code": 0, "data": { "id": "task_id", "status": "pending", "taskId": "...", "costCredits": 5 } }

Generate Video

Create a video generation task:

curl -s -X POST https://vibevideo.app/api/ai/generate \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "mediaType": "video",
    "scene": "text-to-video",
    "model": "seedance-2-0",
    "prompt": "A dog playing in a park",
    "options": {
      "resolution": "720p",
      "duration": "5s",
      "aspect_ratio": "16:9"
    }
  }'

For image-to-video, set "scene": "image-to-video" and add "image_url": "..." in options. For frames-to-video, add "start_image_url": "..." and "end_image_url": "..." in options.

Query Task Status

Tasks are asynchronous. Poll until status is success, failed, or canceled:

curl -s -X POST https://vibevideo.app/api/ai/query \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{ "taskId": "YOUR_TASK_ID" }'

Response includes status, taskInfo, taskResult, and taskUrls (JSON string of media URLs).

Calculate Cost

Check credit cost before generating:

curl -s -X POST https://vibevideo.app/api/ai/cost \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "seedance-2-0",
    "mediaType": "video",
    "scene": "text-to-video",
    "options": { "resolution": "720p", "duration": "5s" }
  }'

Cancel Task

curl -s -X DELETE https://vibevideo.app/api/ai/tasks/YOUR_TASK_ID \
  -H "Authorization: Bearer $VIBEVIDEO_API_KEY"

Workflow

  1. If the user doesn't specify a model, pick the default for the scene (see tables below)
  2. Call the generate endpoint
  3. Poll the query endpoint every 5 seconds until status is terminal (success/failed/canceled)
  4. Parse taskUrls from the response and report the media URL(s) to the user
  5. If code is not 0, handle the error (see Error Handling below)

Image Models

ID Label Vendor Scenes Qualities
nano-banana-2 Nano Banana 2 Google text-to-image, image-to-image 1K, 2K, 4K
gpt-image-1-5 GPT Image 1.5 OpenAI text-to-image, image-to-image Medium, High
grok-imagine Grok Imagine Grok text-to-image, image-to-image
seedream-5-0 Seedream 5.0 ByteDance text-to-image, image-to-image Basic, High
qwen-image Qwen Image Qwen text-to-image, image-to-image
wan-2-7-image Wan 2.7 Image Qwen/Alibaba text-to-image, image-to-image 1K, 2K
wan-2-7-image-pro Wan 2.7 Image Pro Qwen/Alibaba text-to-image, image-to-image 1K, 2K, 4K

Default for text-to-image: nano-banana-2

Video Models

ID Label Vendor Scenes Resolutions Durations
seedance-2-0 Seedance 2.0 ByteDance text-to-video, image-to-video, frames-to-video, reference-to-video 720p, 1080p 5s, 10s, 15s
seedance-2-0-fast Seedance 2.0 Fast ByteDance text-to-video, image-to-video, frames-to-video, reference-to-video 720p, 1080p 5s, 10s, 15s
seedance-1-5-pro Seedance 1.5 Pro ByteDance text-to-video, image-to-video 480p, 720p, 1080p 4s, 8s, 12s
grok-imagine Grok Imagine Grok text-to-video, image-to-video 480p, 720p 6s, 10s, 15s
kling-2-6 Kling 2.6 Kling text-to-video, image-to-video 5s, 10s
runway Runway Runway text-to-video, image-to-video 720p, 1080p 5s, 10s
veo-3-1 Veo 3.1 Google text-to-video, image-to-video, frames-to-video, reference-to-video 720p, 1080p, 4k
veo-3-1-fast Veo 3.1 Fast Google text-to-video, image-to-video, frames-to-video, reference-to-video 720p, 1080p, 4k
seedence-1-0-pro Seedence 1.0 Pro ByteDance text-to-video, image-to-video 480p, 720p, 1080p 5s, 10s
seedence-1-0-pro-fast Seedence 1.0 Pro Fast ByteDance image-to-video 720p, 1080p 5s, 10s
seedence-1-0-lite Seedence 1.0 Lite ByteDance text-to-video, image-to-video 480p, 720p, 1080p 5s, 10s

Default for text-to-video: seedance-2-0

Error Handling

  • code: -1 with "no auth": User's API key is missing or invalid. Remind them to set VIBEVIDEO_API_KEY.
  • code: -1002 with "insufficient credits": User needs to purchase credits at VibeVideo dashboard.
  • code: -1 with "invalid": Wrong model ID, scene, or mediaType. Check against the model tables above.
  • Task stuck in "processing": Polling timeout. The task may still complete — suggest the user wait and query again later.

API Response Envelope

All endpoints return:

{ "code": 0, "message": "ok", "data": { ... } }

code: 0 means success. Non-zero code means error (check message).

安全使用建议
This skill appears coherent and limited to calling the VibeVideo API. Before installing: (1) Confirm you trust vibevideo.app and the source of this skill; the skill will be able to use whatever VIBEVIDEO_API_KEY you provide. (2) Keep your API key secret and give the minimal key/permissions if the provider supports scoping; revoke the key if you stop using the skill. (3) Be aware generation may consume paid credits — watch for cost-related error codes and billing. (4) If you are uncomfortable with autonomous agent invocation, disable autonomous use for this skill in your agent settings.
能力评估
Purpose & Capability
Name/description match the runtime instructions: all actions are calls to the VibeVideo API. Required binary (curl) and required env var (VIBEVIDEO_API_KEY) are appropriate for the declared purpose.
Instruction Scope
SKILL.md only instructs making HTTPS requests to https://vibevideo.app, polling task status, parsing taskUrls, checking costs, and cancelling tasks. It does not tell the agent to read unrelated files, other env vars, or send data to unknown endpoints.
Install Mechanism
No install spec and no code files — this is instruction-only and will not write new code to disk. Lower risk because nothing is downloaded or installed by the skill itself.
Credentials
Only one required environment variable (VIBEVIDEO_API_KEY) is declared and used; that is the expected minimal credential for calling the provider's API.
Persistence & Privilege
always:false and no requests to modify other skills or system settings. The skill can be invoked autonomously by the agent (platform default) but it does not request elevated or persistent privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install ai-image-video-generation
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /ai-image-video-generation 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.3
- No changes detected in this version; all files remain unchanged. - Functionality, documentation, and metadata are identical to the previous release.
v1.0.2
- Now always uses the official VibeVideo API endpoint (https://vibevideo.app); self-hosting option removed. - File _meta.json removed. - Added structured skill metadata with environment variable and requirements. - Minor documentation streamlining in SKILL.md, reflecting the fixed endpoint and updated prerequisites. - No changes to skill usage or endpoints.
v1.0.1
- Added _meta.json file for skill metadata. - No changes to code or documentation content. - Version incremented to 1.0.1.
v1.0.0
Initial public release of the VibeVideo generation skill for image and video creation via the VibeVideo API.
元数据
Slug ai-image-video-generation
版本 1.0.3
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 4
常见问题

AI Image & Video Generation 是什么?

Generate images and videos using the VibeVideo API. Use when the user asks to create AI images or videos, check generation status, list available models, or... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 109 次。

如何安装 AI Image & Video Generation?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ai-image-video-generation」即可一键安装,无需额外配置。

AI Image & Video Generation 是免费的吗?

是的,AI Image & Video Generation 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

AI Image & Video Generation 支持哪些平台?

AI Image & Video Generation 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 AI Image & Video Generation?

由 bytevirts(@bytevirts)开发并维护,当前版本 v1.0.3。

💬 留言讨论