功能描述

AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video ge...

使用说明 (SKILL.md)

IMA Video AI — Video Generator

Name: IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation
Author: dai-shuo

For complete API documentation, security details, all parameters, and Python examples, read SKILL-DETAIL.md. ⚠️ MANDATORY: You MUST read("SKILL-DETAIL.md") (full file — do NOT pass limit parameter) before your first video generation call. It contains the full API payload structure, error handling tables, and UX protocol that this summary omits. Skipping it or reading only part of it causes parameter errors and poor user experience.

Model ID Reference (CRITICAL)

Use exact model_id from this table. Do NOT infer from friendly names.

Friendly Name	model_id (t2v)	model_id (i2v)	Notes
Wan 2.6	`wan2.6-t2v`	`wan2.6-i2v`	⚠️ -t2v/-i2v suffix
Kling O1	`kling-video-o1`	`kling-video-o1`	⚠️ video- prefix
Kling 2.6	`kling-v2-6`	`kling-v2-6`	⚠️ v prefix
Hailuo 2.3	`MiniMax-Hailuo-2.3`	`MiniMax-Hailuo-2.3`	⚠️ MiniMax- prefix
Hailuo 2.0	`MiniMax-Hailuo-02`	`MiniMax-Hailuo-02`	⚠️ 02 not 2.0
Vidu Q2	`viduq2`	`viduq2-pro`	⚠️ Different for t2v/i2v
Google Veo 3.1	`veo-3.1-generate-preview`	`veo-3.1-generate-preview`	⚠️ -generate-preview suffix
Sora 2 Pro	`sora-2-pro`	`sora-2-pro`	✅ Straightforward
Pixverse V5.5	`pixverse`	`pixverse`	✅ Same as friendly name
SeeDance 1.5 Pro	`doubao-seedance-1.5-pro`	`doubao-seedance-1.5-pro`	⚠️ doubao- prefix

User input aliases: 万/Wan → wan2.6-* · 可灵/Kling → kling-video-o1 · 海螺/Hailuo → MiniMax-Hailuo-2.3 · Veo/Google Veo → veo-3.1-generate-preview

Video Generation Modes

User intent	task_type	When to use
Text only, no image	`text_to_video`	"生成一段…视频" / "text to video"
Image as first frame	`image_to_video`	"把这张图动起来" / "图生视频"
Image as visual reference (not first frame)	`reference_image_to_video`	"参考这张图生成" / "像这张风格"
Two images (start + end)	`first_last_frame_to_video`	"首帧+尾帧" / "从A过渡到B"

Visual Consistency (IMPORTANT)

If user mentions "same character", "series", "multi-shot", or continues from a previous generation:

Do NOT use text_to_video (will produce different-looking results)
Use image_to_video or reference_image_to_video with previous result as reference
Read ima-knowledge-ai/references/visual-consistency.md if available

Pre-Check: Knowledge Base

If ima-knowledge-ai is installed, read before generating:

ima-knowledge-ai/references/video-modes.md — understand mode differences
ima-knowledge-ai/references/visual-consistency.md — if multi-shot or character continuity needed

If not installed: use this SKILL's model table and defaults.

Model Selection Priority

User preference (if explicitly stated) → highest priority
ima-knowledge-ai recommendation (if installed)
Fallback defaults:

Task	Default Model	model_id	Cost
text_to_video	Wan 2.6	`wan2.6-t2v`	25 pts
text_to_video (premium)	Hailuo 2.3	`MiniMax-Hailuo-2.3`	38 pts
text_to_video (budget)	Vidu Q2	`viduq2`	5 pts
image_to_video	Wan 2.6	`wan2.6-i2v`	25 pts
first_last_frame	Kling O1	`kling-video-o1`	48 pts
reference_image	Kling O1	`kling-video-o1`	48 pts

User Input Parsing

Duration: 5秒→5 · 10秒→10 · 15秒→15 · 1分钟→use max (tell user "当前最长15秒") Aspect ratio: 横屏/16:9→16:9 · 竖屏/9:16→9:16 · 方形/1:1→1:1 Resolution: 720P/1080P/4K (if model supports) Budget: 最便宜→Vidu Q2 (5pts) · 最好→Kling O1 or Veo 3.1

Script Usage

# Text to video
python3 {baseDir}/scripts/ima_video_create.py \
  --api-key $IMA_API_KEY \
  --task-type text_to_video \
  --model-id wan2.6-t2v \
  --prompt "a puppy runs across a sunny meadow" \
  --user-id {user_id} \
  --output-json

# Image to video (accepts URLs and local file paths)
python3 {baseDir}/scripts/ima_video_create.py \
  --api-key $IMA_API_KEY \
  --task-type image_to_video \
  --model-id wan2.6-i2v \
  --prompt "camera slowly zooms in" \
  --input-images https://example.com/photo.jpg \
  --user-id {user_id} \
  --output-json

# First-last frame (exactly 2 images required)
python3 {baseDir}/scripts/ima_video_create.py \
  --api-key $IMA_API_KEY \
  --task-type first_last_frame_to_video \
  --model-id kling-video-o1 \
  --prompt "smooth transition" \
  --input-images first.jpg last.jpg \
  --user-id {user_id} \
  --output-json

Sending Results to User

# ✅ CORRECT: Use remote URL directly for inline video playback
message(action="send", media=video_url, caption="✅ 视频生成成功！\
• 模型：[Name]\
• 耗时：[X]s\
• 积分：[N pts]")

# Then send link for copying
message(action="send", message=f"🔗 视频链接：\
{video_url}")

# ❌ WRONG: Never download to local file (shows as attachment, not playable)

UX Protocol (Brief)

Pre-generation: "🎬 开始生成视频… 模型：[Name]，预计[X~Y]秒，消耗[N]积分"
Progress: Every 30-60s: "⏳ 视频生成中… [P]%" (cap at 95% until API returns success)
Success: Send video via media=video_url + send link text for sharing
Failure: Natural language error + suggest alternative models. Never show technical errors to users. See SKILL-DETAIL.md for full error translation table.

Never say to users: script names, API endpoints, attribute_id, technical parameter names. Only: model name · time · credits · result · status.

Sora 2 Pro Content Policy

Sora has strict OpenAI content safety: ❌ people, celebrities, IP assets. ✅ landscapes, abstract, animals, nature. If rejected, suggest Wan 2.6 or Kling O1.

Environment

Base URL: https://api.imastudio.com Headers: Authorization: Bearer $IMA_API_KEY · x-app-source: ima_skills · x_app_language: en

Core Flow

GET /open/v1/product/list?app=ima&platform=web&category=\x3Ctask_type> → get attribute_id, credit, model_version, form_config
[Image tasks only] Upload images or pass local paths to script
POST /open/v1/tasks/create → get task_id
POST /open/v1/tasks/detail → poll every 8s until resource_status==1

MANDATORY: Always query product list first. attribute_id is required — if 0 or missing, task fails.

User Preference Memory

Storage: ~/.openclaw/memory/ima_prefs.json

Save when user explicitly says "用XXX" / "默认用XXX" / "always use XXX"
Clear when user says "用最好的" / "推荐一个" / "自动选择"
Never save auto-selected or fallback models as preferences

Pixverse Special Case (v1.0.7+)

Pixverse V5.5/V5/V4 lack model in form_config. Script auto-infers from model_name (e.g. "Pixverse V5.5" → model: "v5.5"). No manual action needed.

⚠️ REMINDER: read("SKILL-DETAIL.md") (full file, no limit) is required before generating video. This file is a summary — SKILL-DETAIL.md has the complete API reference, error translation table, and UX protocol you need for correct execution.

安全使用建议

This skill appears coherent: it uses a bundled Python script and only needs an IMA API key. Before installing, verify you trust imastudio.com because your IMA_API_KEY will be sent to api.imastudio.com (video API) and imapi.liveme.com (image upload/CDN) when you upload or use local images. Be aware that local image files provided to the skill will be uploaded to IMA's OSS/CDN. The skill stores preferences and logs under ~/.openclaw (rotate/inspect these files if you care about local traces). If you reuse credentials across services, consider creating a dedicated IMA API key with minimal permissions and be prepared to rotate it if needed. Finally, if you want extra assurance, open and review scripts/ima_video_create.py and scripts/ima_logger.py (they are plain Python) and the SKILL-DETAIL.md to confirm behaviour and endpoints before use.

功能分析

Type: OpenClaw Skill Name: ima-ai-video-generator Version: 1.0.6 The IMA AI Video Generator skill is a well-documented tool for generating videos via the IMA Studio API. The core logic in `scripts/ima_video_create.py` handles task creation, status polling, and automatic parameter adjustment (reflection) to improve success rates. While the script sends the `IMA_API_KEY` to two different domains (`api.imastudio.com` and `imapi.liveme.com`), this behavior is explicitly disclosed and explained in `SKILL-DETAIL.md` as part of the service's architecture for task orchestration and media storage. The skill limits its file system access to its own configuration and log directories (`~/.openclaw/`), and no evidence of malicious intent, data exfiltration, or unauthorized execution was found.

能力评估

✓ Purpose & Capability

Name/description match the actual code and runtime requirements: the package bundles a Python script to call IMA Studio APIs and only requires python3 and IMA_API_KEY. No unrelated credentials or binaries are requested.

ℹ Instruction Scope

SKILL.md and SKILL-DETAIL.md explicitly instruct the agent to read the provided SKILL-DETAIL.md before first use and optionally read ima-knowledge-ai reference files if that other skill is installed. The script uploads local images to the provider and sends prompts to provider endpoints; it does not instruct reading arbitrary system files or exfiltrating unrelated data. Users should be aware that local image files will be uploaded (if provided) and that the skill may read a small set of OpenClaw paths (preferences and logs) and the ima-knowledge-ai references when present.

✓ Install Mechanism

No external download/install is declared (script-bundled). requirements.txt lists only 'requests'. There are no obscure URLs for code installation and included code is plain Python source.

✓ Credentials

Only IMA_API_KEY is required (declared as primary). The code uses that key to authenticate to two IMA-owned domains (api.imastudio.com and imapi.liveme.com) as documented in SKILL.md; this is proportionate to image upload and task creation.

ℹ Persistence & Privilege

The skill writes preferences and logs under ~/.openclaw (ima_prefs.json and ~/.openclaw/logs/ima_skills/). always is false and the skill is user-invocable. Writing prefs/logs is reasonable for this skill type, but note logs and prefs live in the user's OpenClaw directory and the skill may read ima-knowledge-ai references if that skill is installed.

版本历史

v1.0.6

Explicitly forbid limit parameter when reading SKILL-DETAIL.md

v1.0.5

Emphasize mandatory SKILL-DETAIL.md read for better agent success rate

v1.0.4

Fill 12K embedding budget completely - eliminate script code leaking into embedding

v1.0.3

Re-publish to trigger embedding index generation

v1.0.2

Fix metadata: primaryEnv, homepage, requires.bins per OpenClaw schema

v1.0.1

Add runtime/packages declarations (python3, requests) to fix suspicious flag

v1.0.0

IMA AI Video Generator — Version 1.0.0 - Initial release: AI-powered video generation skill for text-to-video, image-to-video, reference-image, and first-last-frame modes. - Supports top models such as Wan 2.6, Kling O1/2.6, Hailuo 2.0/2.3, Vidu Q2, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, SeeDance 1.5 Pro. - User-oriented model selection, alias parsing, and preference memory for streamlined experience. - Built-in knowledge base integration for guidance on mode selection and visual consistency. - Customizable video duration, aspect ratio, resolution, and budget options. - Full UX protocol for smooth progress, results delivery, and friendly error messages.

元数据

Slug ima-ai-video-generator

版本 1.0.6

许可证 MIT-0

累计安装 2

当前安装数 1

历史版本数 7

常见问题

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 是什么？

AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video ge... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 1250 次。

如何安装 IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ima-ai-video-generator」即可一键安装，无需额外配置。

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 是免费的吗？

是的，IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 支持哪些平台？

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation？

由 Dai Shuo（@dai-shuo）开发并维护，当前版本 v1.0.6。

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation