← 返回 Skills 市场
dai-shuo

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation

作者 Dai Shuo · GitHub ↗ · v1.0.6 · MIT-0
cross-platform ✓ 安全检测通过
1250
总下载
2
收藏
1
当前安装
7
版本数
在 OpenClaw 中安装
/install ima-ai-video-generator
功能描述
AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video ge...
使用说明 (SKILL.md)

IMA Video AI — Video Generator

For complete API documentation, security details, all parameters, and Python examples, read SKILL-DETAIL.md. ⚠️ MANDATORY: You MUST read("SKILL-DETAIL.md") (full file — do NOT pass limit parameter) before your first video generation call. It contains the full API payload structure, error handling tables, and UX protocol that this summary omits. Skipping it or reading only part of it causes parameter errors and poor user experience.

Model ID Reference (CRITICAL)

Use exact model_id from this table. Do NOT infer from friendly names.

Friendly Name model_id (t2v) model_id (i2v) Notes
Wan 2.6 wan2.6-t2v wan2.6-i2v ⚠️ -t2v/-i2v suffix
Kling O1 kling-video-o1 kling-video-o1 ⚠️ video- prefix
Kling 2.6 kling-v2-6 kling-v2-6 ⚠️ v prefix
Hailuo 2.3 MiniMax-Hailuo-2.3 MiniMax-Hailuo-2.3 ⚠️ MiniMax- prefix
Hailuo 2.0 MiniMax-Hailuo-02 MiniMax-Hailuo-02 ⚠️ 02 not 2.0
Vidu Q2 viduq2 viduq2-pro ⚠️ Different for t2v/i2v
Google Veo 3.1 veo-3.1-generate-preview veo-3.1-generate-preview ⚠️ -generate-preview suffix
Sora 2 Pro sora-2-pro sora-2-pro ✅ Straightforward
Pixverse V5.5 pixverse pixverse ✅ Same as friendly name
SeeDance 1.5 Pro doubao-seedance-1.5-pro doubao-seedance-1.5-pro ⚠️ doubao- prefix

User input aliases: 万/Wan → wan2.6-* · 可灵/Kling → kling-video-o1 · 海螺/Hailuo → MiniMax-Hailuo-2.3 · Veo/Google Veo → veo-3.1-generate-preview

Video Generation Modes

User intent task_type When to use
Text only, no image text_to_video "生成一段…视频" / "text to video"
Image as first frame image_to_video "把这张图动起来" / "图生视频"
Image as visual reference (not first frame) reference_image_to_video "参考这张图生成" / "像这张风格"
Two images (start + end) first_last_frame_to_video "首帧+尾帧" / "从A过渡到B"

Visual Consistency (IMPORTANT)

If user mentions "same character", "series", "multi-shot", or continues from a previous generation:

  • Do NOT use text_to_video (will produce different-looking results)
  • Use image_to_video or reference_image_to_video with previous result as reference
  • Read ima-knowledge-ai/references/visual-consistency.md if available

Pre-Check: Knowledge Base

If ima-knowledge-ai is installed, read before generating:

  1. ima-knowledge-ai/references/video-modes.md — understand mode differences
  2. ima-knowledge-ai/references/visual-consistency.md — if multi-shot or character continuity needed

If not installed: use this SKILL's model table and defaults.

Model Selection Priority

  1. User preference (if explicitly stated) → highest priority
  2. ima-knowledge-ai recommendation (if installed)
  3. Fallback defaults:
Task Default Model model_id Cost
text_to_video Wan 2.6 wan2.6-t2v 25 pts
text_to_video (premium) Hailuo 2.3 MiniMax-Hailuo-2.3 38 pts
text_to_video (budget) Vidu Q2 viduq2 5 pts
image_to_video Wan 2.6 wan2.6-i2v 25 pts
first_last_frame Kling O1 kling-video-o1 48 pts
reference_image Kling O1 kling-video-o1 48 pts

User Input Parsing

Duration: 5秒→5 · 10秒→10 · 15秒→15 · 1分钟→use max (tell user "当前最长15秒") Aspect ratio: 横屏/16:9→16:9 · 竖屏/9:16→9:16 · 方形/1:1→1:1 Resolution: 720P/1080P/4K (if model supports) Budget: 最便宜→Vidu Q2 (5pts) · 最好→Kling O1 or Veo 3.1

Script Usage

# Text to video
python3 {baseDir}/scripts/ima_video_create.py \
  --api-key $IMA_API_KEY \
  --task-type text_to_video \
  --model-id wan2.6-t2v \
  --prompt "a puppy runs across a sunny meadow" \
  --user-id {user_id} \
  --output-json

# Image to video (accepts URLs and local file paths)
python3 {baseDir}/scripts/ima_video_create.py \
  --api-key $IMA_API_KEY \
  --task-type image_to_video \
  --model-id wan2.6-i2v \
  --prompt "camera slowly zooms in" \
  --input-images https://example.com/photo.jpg \
  --user-id {user_id} \
  --output-json

# First-last frame (exactly 2 images required)
python3 {baseDir}/scripts/ima_video_create.py \
  --api-key $IMA_API_KEY \
  --task-type first_last_frame_to_video \
  --model-id kling-video-o1 \
  --prompt "smooth transition" \
  --input-images first.jpg last.jpg \
  --user-id {user_id} \
  --output-json

Sending Results to User

# ✅ CORRECT: Use remote URL directly for inline video playback
message(action="send", media=video_url, caption="✅ 视频生成成功!\
• 模型:[Name]\
• 耗时:[X]s\
• 积分:[N pts]")

# Then send link for copying
message(action="send", message=f"🔗 视频链接:\
{video_url}")

# ❌ WRONG: Never download to local file (shows as attachment, not playable)

UX Protocol (Brief)

  1. Pre-generation: "🎬 开始生成视频… 模型:[Name],预计[X~Y]秒,消耗[N]积分"
  2. Progress: Every 30-60s: "⏳ 视频生成中… [P]%" (cap at 95% until API returns success)
  3. Success: Send video via media=video_url + send link text for sharing
  4. Failure: Natural language error + suggest alternative models. Never show technical errors to users. See SKILL-DETAIL.md for full error translation table.

Never say to users: script names, API endpoints, attribute_id, technical parameter names. Only: model name · time · credits · result · status.

Sora 2 Pro Content Policy

Sora has strict OpenAI content safety: ❌ people, celebrities, IP assets. ✅ landscapes, abstract, animals, nature. If rejected, suggest Wan 2.6 or Kling O1.

Environment

Base URL: https://api.imastudio.com Headers: Authorization: Bearer $IMA_API_KEY · x-app-source: ima_skills · x_app_language: en

Core Flow

  1. GET /open/v1/product/list?app=ima&platform=web&category=\x3Ctask_type> → get attribute_id, credit, model_version, form_config
  2. [Image tasks only] Upload images or pass local paths to script
  3. POST /open/v1/tasks/create → get task_id
  4. POST /open/v1/tasks/detail → poll every 8s until resource_status==1

MANDATORY: Always query product list first. attribute_id is required — if 0 or missing, task fails.

User Preference Memory

Storage: ~/.openclaw/memory/ima_prefs.json

  • Save when user explicitly says "用XXX" / "默认用XXX" / "always use XXX"
  • Clear when user says "用最好的" / "推荐一个" / "自动选择"
  • Never save auto-selected or fallback models as preferences

Pixverse Special Case (v1.0.7+)

Pixverse V5.5/V5/V4 lack model in form_config. Script auto-infers from model_name (e.g. "Pixverse V5.5" → model: "v5.5"). No manual action needed.


⚠️ REMINDER: read("SKILL-DETAIL.md") (full file, no limit) is required before generating video. This file is a summary — SKILL-DETAIL.md has the complete API reference, error translation table, and UX protocol you need for correct execution.

安全使用建议
This skill appears coherent: it uses a bundled Python script and only needs an IMA API key. Before installing, verify you trust imastudio.com because your IMA_API_KEY will be sent to api.imastudio.com (video API) and imapi.liveme.com (image upload/CDN) when you upload or use local images. Be aware that local image files provided to the skill will be uploaded to IMA's OSS/CDN. The skill stores preferences and logs under ~/.openclaw (rotate/inspect these files if you care about local traces). If you reuse credentials across services, consider creating a dedicated IMA API key with minimal permissions and be prepared to rotate it if needed. Finally, if you want extra assurance, open and review scripts/ima_video_create.py and scripts/ima_logger.py (they are plain Python) and the SKILL-DETAIL.md to confirm behaviour and endpoints before use.
功能分析
Type: OpenClaw Skill Name: ima-ai-video-generator Version: 1.0.6 The IMA AI Video Generator skill is a well-documented tool for generating videos via the IMA Studio API. The core logic in `scripts/ima_video_create.py` handles task creation, status polling, and automatic parameter adjustment (reflection) to improve success rates. While the script sends the `IMA_API_KEY` to two different domains (`api.imastudio.com` and `imapi.liveme.com`), this behavior is explicitly disclosed and explained in `SKILL-DETAIL.md` as part of the service's architecture for task orchestration and media storage. The skill limits its file system access to its own configuration and log directories (`~/.openclaw/`), and no evidence of malicious intent, data exfiltration, or unauthorized execution was found.
能力评估
Purpose & Capability
Name/description match the actual code and runtime requirements: the package bundles a Python script to call IMA Studio APIs and only requires python3 and IMA_API_KEY. No unrelated credentials or binaries are requested.
Instruction Scope
SKILL.md and SKILL-DETAIL.md explicitly instruct the agent to read the provided SKILL-DETAIL.md before first use and optionally read ima-knowledge-ai reference files if that other skill is installed. The script uploads local images to the provider and sends prompts to provider endpoints; it does not instruct reading arbitrary system files or exfiltrating unrelated data. Users should be aware that local image files will be uploaded (if provided) and that the skill may read a small set of OpenClaw paths (preferences and logs) and the ima-knowledge-ai references when present.
Install Mechanism
No external download/install is declared (script-bundled). requirements.txt lists only 'requests'. There are no obscure URLs for code installation and included code is plain Python source.
Credentials
Only IMA_API_KEY is required (declared as primary). The code uses that key to authenticate to two IMA-owned domains (api.imastudio.com and imapi.liveme.com) as documented in SKILL.md; this is proportionate to image upload and task creation.
Persistence & Privilege
The skill writes preferences and logs under ~/.openclaw (ima_prefs.json and ~/.openclaw/logs/ima_skills/). always is false and the skill is user-invocable. Writing prefs/logs is reasonable for this skill type, but note logs and prefs live in the user's OpenClaw directory and the skill may read ima-knowledge-ai references if that skill is installed.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install ima-ai-video-generator
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /ima-ai-video-generator 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.6
Explicitly forbid limit parameter when reading SKILL-DETAIL.md
v1.0.5
Emphasize mandatory SKILL-DETAIL.md read for better agent success rate
v1.0.4
Fill 12K embedding budget completely - eliminate script code leaking into embedding
v1.0.3
Re-publish to trigger embedding index generation
v1.0.2
Fix metadata: primaryEnv, homepage, requires.bins per OpenClaw schema
v1.0.1
Add runtime/packages declarations (python3, requests) to fix suspicious flag
v1.0.0
IMA AI Video Generator — Version 1.0.0 - Initial release: AI-powered video generation skill for text-to-video, image-to-video, reference-image, and first-last-frame modes. - Supports top models such as Wan 2.6, Kling O1/2.6, Hailuo 2.0/2.3, Vidu Q2, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, SeeDance 1.5 Pro. - User-oriented model selection, alias parsing, and preference memory for streamlined experience. - Built-in knowledge base integration for guidance on mode selection and visual consistency. - Customizable video duration, aspect ratio, resolution, and budget options. - Full UX protocol for smooth progress, results delivery, and friendly error messages.
元数据
Slug ima-ai-video-generator
版本 1.0.6
许可证 MIT-0
累计安装 2
当前安装数 1
历史版本数 7
常见问题

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 是什么?

AI video generator with premier models: Wan 2.6, Kling O1/2.6, Google Veo 3.1, Sora 2 Pro, Pixverse V5.5, Hailuo 2.0/2.3, SeeDance 1.5 Pro, Vidu Q2. Video ge... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 1250 次。

如何安装 IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ima-ai-video-generator」即可一键安装,无需额外配置。

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 是免费的吗?

是的,IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 支持哪些平台?

IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 IMA AI Video Generator — Short & Promo Video, Text to Video, Image to Video Generation?

由 Dai Shuo(@dai-shuo)开发并维护,当前版本 v1.0.6。

💬 留言讨论