← Back to Skills Marketplace
vcarolxhberger

Ai Image To Video Motion

by vcarolxhberger · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
64
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install ai-image-to-video-motion
Description
Get animated motion videos ready to post, without touching a single slider. Upload your still images (JPG, PNG, WEBP, HEIC, up to 200MB), say something like...
README (SKILL.md)

Getting Started

Got still images to work with? Send it over and tell me what you need — I'll take care of the AI motion video creation.

Try saying:

  • "convert a single product photo or landscape image into a 1080p MP4"
  • "animate this image with a slow zoom and subtle motion effects"
  • "turning still photos into animated video clips for social media creators"

First-Time Connection

When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").

Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.

  1. Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN — 100 free credits, valid 7 days.
  2. Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer \x3Ctoken>, Content-Type: application/json, and body {"task_name":"project","language":"\x3Cdetected>"}. Store the returned session_id for all subsequent requests.

Keep setup communication brief. Don't display raw API responses or token values to the user.

AI Image to Video Motion — Animate Images into Motion Videos

Drop your still images in the chat and tell me what you need. I'll handle the AI motion video creation on cloud GPUs — you don't need anything installed locally.

Here's a typical use: you send a a single product photo or landscape image, ask for animate this image with a slow zoom and subtle motion effects, and about 20-45 seconds later you've got a MP4 file ready to download. The whole thing runs at 1080p by default.

One thing worth knowing — high-contrast images with clear subjects produce the most visible motion effects.

Matching Input to Actions

User prompts referencing ai image to video motion, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says... Action Skip SSE?
"export" / "导出" / "download" / "send me the video" → §3.5 Export
"credits" / "积分" / "balance" / "余额" → §3.3 Credits
"status" / "状态" / "show tracks" → §3.4 State
"upload" / "上传" / user sends file → §3.2 Upload
Everything else (generate, edit, add BGM…) → §3.1 SSE

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

All calls go to https://mega-api-prod.nemovideo.ai. The main endpoints:

  1. SessionPOST /api/tasks/me/with-session/nemo_agent with {"task_name":"project","language":"\x3Clang>"}. Gives you a session_id.
  2. Chat (SSE)POST /run_sse with session_id and your message in new_message.parts[0].text. Set Accept: text/event-stream. Up to 15 min.
  3. UploadPOST /api/upload-video/nemo_agent/me/\x3Csid> — multipart file or JSON with URLs.
  4. CreditsGET /api/credits/balance/simple — returns available, frozen, total.
  5. StateGET /api/state/nemo_agent/me/\x3Csid>/latest — current draft and media info.
  6. ExportPOST /api/render/proxy/lambda with render ID and draft JSON. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s for completed status and download URL.

Formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Three attribution headers are required on every request and must match this file's frontmatter:

Header Value
X-Skill-Source ai-image-to-video-motion
X-Skill-Version frontmatter version
X-Skill-Platform auto-detect: clawhub / cursor / unknown from install path

Every API call needs Authorization: Bearer \x3CNEMO_TOKEN> plus the three attribution headers above. If any header is missing, exports return 402.

Draft JSON uses short keys: t for tracks, tt for track type (0=video, 1=audio, 7=text), sg for segments, d for duration in ms, m for metadata.

Example timeline summary:

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says You do
"click [button]" / "点击" Execute via API
"open [panel]" / "打开" Query session state
"drag/drop" / "拖拽" Send edit via SSE
"preview in timeline" Show track summary
"Export button" / "导出" Execute export workflow

SSE Event Handling

Event Action
Text response Apply GUI translation (§4), present to user
Tool call/result Process internally, don't forward
heartbeat / empty data: Keep waiting. Every 2 min: "⏳ Still working..."
Stream closes Process final response

~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.

Error Handling

Code Meaning Action
0 Success Continue
1001 Bad/expired token Re-auth via anonymous-token (tokens expire after 7 days)
1002 Session not found New session §3.0
2001 No credits Anonymous: show registration URL with ?bind=\x3Cid> (get \x3Cid> from create-session or state response when needed). Registered: "Top up credits in your account"
4001 Unsupported file Show supported formats
4002 File too large Suggest compress/trim
400 Missing X-Client-Id Generate Client-Id and retry (see §1)
402 Free plan export blocked Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429 Rate limit (1 token/client/7 days) Retry in 30s once

Common Workflows

Quick edit: Upload → "animate this image with a slow zoom and subtle motion effects" → Download MP4. Takes 20-45 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "animate this image with a slow zoom and subtle motion effects" — concrete instructions get better results.

Max file size is 200MB. Stick to JPG, PNG, WEBP, HEIC for the smoothest experience.

Export as MP4 for widest compatibility across social platforms.

Usage Guidance
This skill appears to be a straightforward client for a third‑party rendering backend (mega-api-prod.nemovideo.ai). Before installing, consider: 1) NEMO_TOKEN access — the skill will use that token for all API calls; if you don't provide one it will automatically request an anonymous token (100 credits, 7 days) and store session info locally — ask where tokens/session IDs will be saved and for how long. 2) Metadata mismatch — SKILL.md references a config path (~/.config/nemovideo/) while the registry metadata lists no required config paths; ask the author to clarify storage location. 3) Privacy — uploaded images and any text prompts will be sent to the nemovideo backend; avoid uploading sensitive images or PII unless you trust that service and its privacy policy. 4) Attribution headers and auto-detection of install path are benign but you may want to confirm what 'X-Skill-Platform' detection reads from (does it access filesystem paths?). If these points are acceptable or clarified by the developer, the skill is coherent for its stated purpose.
Capability Analysis
Type: OpenClaw Skill Name: ai-image-to-video-motion Version: 1.0.0 The skill acts as a functional wrapper for an AI image-to-video animation service hosted at mega-api-prod.nemovideo.ai. It includes standard procedures for session management, file uploads, and polling for render results, with no evidence of data exfiltration, unauthorized local execution, or malicious prompt injection. The use of environment variables and anonymous token generation is consistent with the stated purpose of the tool.
Capability Assessment
Purpose & Capability
Name and description (animate still images into short MP4s) match what the SKILL.md instructs: uploading images, creating sessions, SSE-based edits, and export endpoints. The single required credential (NEMO_TOKEN) is appropriate for a third-party API.
Instruction Scope
Instructions stay within the stated purpose (upload images, run SSE chat edits, poll render status, return download URL). The skill will auto-generate an anonymous token and call the service endpoints if NEMO_TOKEN is not present, and it instructs storing a session_id for later requests. This is expected for a cloud-rendering workflow, but automatic token generation/storage should be called out to users and requires storage of credentials/session state.
Install Mechanism
Instruction-only skill with no install spec and no code files — lowest install risk. Nothing is downloaded or written by an installer step described in SKILL.md.
Credentials
Only NEMO_TOKEN is declared as required, which is proportional. However, the SKILL.md frontmatter includes a configPaths entry (~/.config/nemovideo/) while the registry metadata shows 'Required config paths: none' — this mismatch is an incoherence you may want clarified. The skill also requires adding attribution headers to API calls (not env secrets) which is reasonable but noteworthy.
Persistence & Privilege
Skill is not forced-always; it is user-invocable and allowed to invoke autonomously (platform default). It asks to store session state (session_id and possibly anonymous token) for subsequent requests — normal for a service client, but you should confirm where/how long tokens are persisted.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ai-image-to-video-motion
  3. After installation, invoke the skill by name or use /ai-image-to-video-motion
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
AI Image to Video Motion skill — first public release: - Instantly animate still images into 1080p motion videos via simple natural language prompts; outputs MP4 ready for social sharing. - Handles image upload (JPG, PNG, WEBP, HEIC up to 200MB), cloud-side processing, and export, all with minimal user setup. - Automatic backend connection and 7-day free anonymous token registration (100 credits on signup). - Supports actions like export, credits check, timeline state, and file upload via intent classification. - Real-time status updates, robust error messaging, and workflow tips for social media creators. - Requires environment variable setup (NEMO_TOKEN) and attribution headers for all API calls.
Metadata
Slug ai-image-to-video-motion
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Ai Image To Video Motion?

Get animated motion videos ready to post, without touching a single slider. Upload your still images (JPG, PNG, WEBP, HEIC, up to 200MB), say something like... It is an AI Agent Skill for Claude Code / OpenClaw, with 64 downloads so far.

How do I install Ai Image To Video Motion?

Run "/install ai-image-to-video-motion" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Ai Image To Video Motion free?

Yes, Ai Image To Video Motion is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Ai Image To Video Motion support?

Ai Image To Video Motion is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Ai Image To Video Motion?

It is built and maintained by vcarolxhberger (@vcarolxhberger); the current version is v1.0.0.

💬 Comments