Description

Get animated motion videos ready to post, without touching a single slider. Upload your still images (JPG, PNG, WEBP, HEIC, up to 200MB), say something like...

README (SKILL.md)

Getting Started

Got still images to work with? Send it over and tell me what you need — I'll take care of the AI motion video creation.

Try saying:

"convert a single product photo or landscape image into a 1080p MP4"
"animate this image with a slow zoom and subtle motion effects"
"turning still photos into animated video clips for social media creators"

First-Time Connection

When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").

Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.

Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN — 100 free credits, valid 7 days.
Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer \x3Ctoken>, Content-Type: application/json, and body {"task_name":"project","language":"\x3Cdetected>"}. Store the returned session_id for all subsequent requests.

Keep setup communication brief. Don't display raw API responses or token values to the user.

AI Image to Video Motion — Animate Images into Motion Videos

Name: Ai Image To Video Motion
Author: vcarolxhberger

Drop your still images in the chat and tell me what you need. I'll handle the AI motion video creation on cloud GPUs — you don't need anything installed locally.

Here's a typical use: you send a a single product photo or landscape image, ask for animate this image with a slow zoom and subtle motion effects, and about 20-45 seconds later you've got a MP4 file ready to download. The whole thing runs at 1080p by default.

One thing worth knowing — high-contrast images with clear subjects produce the most visible motion effects.

Matching Input to Actions

User prompts referencing ai image to video motion, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

All calls go to https://mega-api-prod.nemovideo.ai. The main endpoints:

Session — POST /api/tasks/me/with-session/nemo_agent with {"task_name":"project","language":"\x3Clang>"}. Gives you a session_id.
Chat (SSE) — POST /run_sse with session_id and your message in new_message.parts[0].text. Set Accept: text/event-stream. Up to 15 min.
Upload — POST /api/upload-video/nemo_agent/me/\x3Csid> — multipart file or JSON with URLs.
Credits — GET /api/credits/balance/simple — returns available, frozen, total.
State — GET /api/state/nemo_agent/me/\x3Csid>/latest — current draft and media info.
Export — POST /api/render/proxy/lambda with render ID and draft JSON. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s for completed status and download URL.

Formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Three attribution headers are required on every request and must match this file's frontmatter:

Header	Value
`X-Skill-Source`	`ai-image-to-video-motion`
`X-Skill-Version`	frontmatter `version`
`X-Skill-Platform`	auto-detect: `clawhub` / `cursor` / `unknown` from install path

Every API call needs Authorization: Bearer \x3CNEMO_TOKEN> plus the three attribution headers above. If any header is missing, exports return 402.

Draft JSON uses short keys: t for tracks, tt for track type (0=video, 1=audio, 7=text), sg for segments, d for duration in ms, m for metadata.

Example timeline summary:

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says	You do
"click [button]" / "点击"	Execute via API
"open [panel]" / "打开"	Query session state
"drag/drop" / "拖拽"	Send edit via SSE
"preview in timeline"	Show track summary
"Export button" / "导出"	Execute export workflow

SSE Event Handling

Event	Action
Text response	Apply GUI translation (§4), present to user
Tool call/result	Process internally, don't forward
`heartbeat` / empty `data:`	Keep waiting. Every 2 min: "⏳ Still working..."
Stream closes	Process final response

~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.

Error Handling

Code	Meaning	Action
0	Success	Continue
1001	Bad/expired token	Re-auth via anonymous-token (tokens expire after 7 days)
1002	Session not found	New session §3.0
2001	No credits	Anonymous: show registration URL with `?bind=\x3Cid>` (get `\x3Cid>` from create-session or state response when needed). Registered: "Top up credits in your account"
4001	Unsupported file	Show supported formats
4002	File too large	Suggest compress/trim
400	Missing X-Client-Id	Generate Client-Id and retry (see §1)
402	Free plan export blocked	Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429	Rate limit (1 token/client/7 days)	Retry in 30s once

Common Workflows

Quick edit: Upload → "animate this image with a slow zoom and subtle motion effects" → Download MP4. Takes 20-45 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "animate this image with a slow zoom and subtle motion effects" — concrete instructions get better results.

Max file size is 200MB. Stick to JPG, PNG, WEBP, HEIC for the smoothest experience.

Export as MP4 for widest compatibility across social platforms.

Usage Guidance

This skill appears to be a straightforward client for a third‑party rendering backend (mega-api-prod.nemovideo.ai). Before installing, consider: 1) NEMO_TOKEN access — the skill will use that token for all API calls; if you don't provide one it will automatically request an anonymous token (100 credits, 7 days) and store session info locally — ask where tokens/session IDs will be saved and for how long. 2) Metadata mismatch — SKILL.md references a config path (~/.config/nemovideo/) while the registry metadata lists no required config paths; ask the author to clarify storage location. 3) Privacy — uploaded images and any text prompts will be sent to the nemovideo backend; avoid uploading sensitive images or PII unless you trust that service and its privacy policy. 4) Attribution headers and auto-detection of install path are benign but you may want to confirm what 'X-Skill-Platform' detection reads from (does it access filesystem paths?). If these points are acceptable or clarified by the developer, the skill is coherent for its stated purpose.

Capability Analysis

Type: OpenClaw Skill Name: ai-image-to-video-motion Version: 1.0.0 The skill acts as a functional wrapper for an AI image-to-video animation service hosted at mega-api-prod.nemovideo.ai. It includes standard procedures for session management, file uploads, and polling for render results, with no evidence of data exfiltration, unauthorized local execution, or malicious prompt injection. The use of environment variables and anonymous token generation is consistent with the stated purpose of the tool.

Capability Assessment

✓ Purpose & Capability

Name and description (animate still images into short MP4s) match what the SKILL.md instructs: uploading images, creating sessions, SSE-based edits, and export endpoints. The single required credential (NEMO_TOKEN) is appropriate for a third-party API.

ℹ Instruction Scope

Instructions stay within the stated purpose (upload images, run SSE chat edits, poll render status, return download URL). The skill will auto-generate an anonymous token and call the service endpoints if NEMO_TOKEN is not present, and it instructs storing a session_id for later requests. This is expected for a cloud-rendering workflow, but automatic token generation/storage should be called out to users and requires storage of credentials/session state.

✓ Install Mechanism

Instruction-only skill with no install spec and no code files — lowest install risk. Nothing is downloaded or written by an installer step described in SKILL.md.

ℹ Credentials

Only NEMO_TOKEN is declared as required, which is proportional. However, the SKILL.md frontmatter includes a configPaths entry (~/.config/nemovideo/) while the registry metadata shows 'Required config paths: none' — this mismatch is an incoherence you may want clarified. The skill also requires adding attribution headers to API calls (not env secrets) which is reasonable but noteworthy.

✓ Persistence & Privilege

Skill is not forced-always; it is user-invocable and allowed to invoke autonomously (platform default). It asks to store session state (session_id and possibly anonymous token) for subsequent requests — normal for a service client, but you should confirm where/how long tokens are persisted.

Version History

v1.0.0

AI Image to Video Motion skill — first public release: - Instantly animate still images into 1080p motion videos via simple natural language prompts; outputs MP4 ready for social sharing. - Handles image upload (JPG, PNG, WEBP, HEIC up to 200MB), cloud-side processing, and export, all with minimal user setup. - Automatic backend connection and 7-day free anonymous token registration (100 credits on signup). - Supports actions like export, credits check, timeline state, and file upload via intent classification. - Real-time status updates, robust error messaging, and workflow tips for social media creators. - Requires environment variable setup (NEMO_TOKEN) and attribution headers for all API calls.

Metadata

Slug ai-image-to-video-motion

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Ai Image To Video Motion?

Get animated motion videos ready to post, without touching a single slider. Upload your still images (JPG, PNG, WEBP, HEIC, up to 200MB), say something like... It is an AI Agent Skill for Claude Code / OpenClaw, with 64 downloads so far.

How do I install Ai Image To Video Motion?

Run "/install ai-image-to-video-motion" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Ai Image To Video Motion free?

Yes, Ai Image To Video Motion is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Ai Image To Video Motion support?

Ai Image To Video Motion is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Ai Image To Video Motion?

It is built and maintained by vcarolxhberger (@vcarolxhberger); the current version is v1.0.0.

More Skills

Ai Image To Video Motion