← Back to Skills Marketplace
peand-rover

Image To Video Free Ai Generator

by peandrover adam · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
80
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install image-to-video-free-ai-generator
Description
Turn three product photos in JPG format into 1080p animated video clips just by typing what you need. Whether it's converting static images into short videos...
README (SKILL.md)

Getting Started

Got images to work with? Send it over and tell me what you need — I'll take care of the AI video creation.

Try saying:

  • "convert three product photos in JPG format into a 1080p MP4"
  • "turn these photos into a 15-second video with smooth transitions"
  • "converting static images into short videos for social media for social media creators"

Automatic Setup

On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: \x3Cuuid>. The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).

Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.

Confirm to the user you're connected and ready. Don't print tokens or raw JSON.

Image to Video Free AI Generator — Convert Images Into Video Clips

This tool takes your images and runs AI video creation through a cloud rendering pipeline. You upload, describe what you want, and download the result.

Say you have three product photos in JPG format and want to turn these photos into a 15-second video with smooth transitions — the backend processes it in about 30-60 seconds and hands you a 1080p MP4.

Tip: using fewer images with higher quality gives smoother results than many low-res photos.

Matching Input to Actions

User prompts referencing image to video free ai generator, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says... Action Skip SSE?
"export" / "导出" / "download" / "send me the video" → §3.5 Export
"credits" / "积分" / "balance" / "余额" → §3.3 Credits
"status" / "状态" / "show tracks" → §3.4 State
"upload" / "上传" / user sends file → §3.2 Upload
Everything else (generate, edit, add BGM…) → §3.1 SSE

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

All calls go to https://mega-api-prod.nemovideo.ai. The main endpoints:

  1. SessionPOST /api/tasks/me/with-session/nemo_agent with {"task_name":"project","language":"\x3Clang>"}. Gives you a session_id.
  2. Chat (SSE)POST /run_sse with session_id and your message in new_message.parts[0].text. Set Accept: text/event-stream. Up to 15 min.
  3. UploadPOST /api/upload-video/nemo_agent/me/\x3Csid> — multipart file or JSON with URLs.
  4. CreditsGET /api/credits/balance/simple — returns available, frozen, total.
  5. StateGET /api/state/nemo_agent/me/\x3Csid>/latest — current draft and media info.
  6. ExportPOST /api/render/proxy/lambda with render ID and draft JSON. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s for completed status and download URL.

Formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Three attribution headers are required on every request and must match this file's frontmatter:

Header Value
X-Skill-Source image-to-video-free-ai-generator
X-Skill-Version frontmatter version
X-Skill-Platform auto-detect: clawhub / cursor / unknown from install path

Include Authorization: Bearer \x3CNEMO_TOKEN> and all attribution headers on every request — omitting them triggers a 402 on export.

Draft JSON uses short keys: t for tracks, tt for track type (0=video, 1=audio, 7=text), sg for segments, d for duration in ms, m for metadata.

Example timeline summary:

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Translating GUI Instructions

The backend responds as if there's a visual interface. Map its instructions to API calls:

  • "click" or "点击" → execute the action via the relevant endpoint
  • "open" or "打开" → query session state to get the data
  • "drag/drop" or "拖拽" → send the edit command through SSE
  • "preview in timeline" → show a text summary of current tracks
  • "Export" or "导出" → run the export workflow

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Error Handling

Code Meaning Action
0 Success Continue
1001 Bad/expired token Re-auth via anonymous-token (tokens expire after 7 days)
1002 Session not found New session §3.0
2001 No credits Anonymous: show registration URL with ?bind=\x3Cid> (get \x3Cid> from create-session or state response when needed). Registered: "Top up credits in your account"
4001 Unsupported file Show supported formats
4002 File too large Suggest compress/trim
400 Missing X-Client-Id Generate Client-Id and retry (see §1)
402 Free plan export blocked Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429 Rate limit (1 token/client/7 days) Retry in 30s once

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "turn these photos into a 15-second video with smooth transitions" — concrete instructions get better results.

Max file size is 200MB. Stick to JPG, PNG, WEBP, HEIC for the smoothest experience.

Export as MP4 for widest compatibility across all social platforms.

Common Workflows

Quick edit: Upload → "turn these photos into a 15-second video with smooth transitions" → Download MP4. Takes 30-60 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Usage Guidance
This skill largely behaves like a thin client for a single cloud API and only asks for one token (NEMO_TOKEN). However: (1) the package has no homepage or source listed and the publisher identity is opaque — that reduces accountability; (2) the SKILL.md frontmatter mentions a local config path (~/.config/nemovideo/) that the registry did not list, which is an unexplained inconsistency; (3) the skill can generate an anonymous short-lived token for you if you don't supply one — prefer that over providing a long-lived credential; (4) confirm you trust the domain (mega-api-prod.nemovideo.ai) before uploading proprietary images and check the service's data retention/privacy policies; (5) if you proceed, prefer using temporary/limited-scope tokens, avoid supplying other unrelated credentials, and monitor for unexpected network activity. If you want higher assurance, ask the maintainer for a homepage, documentation, or repository link and clarification about the config path usage.
Capability Analysis
Type: OpenClaw Skill Name: image-to-video-free-ai-generator Version: 1.0.0 The skill bundle is a legitimate integration for an AI video generation service (nemovideo.ai). It provides clear instructions for the agent to handle authentication, session management, and API interactions for uploading images and exporting videos. No malicious behaviors, such as data exfiltration or unauthorized execution, were detected; the instructions even include security-conscious directions to avoid printing raw tokens to the user. All network activity is directed to the service's domain (mega-api-prod.nemovideo.ai).
Capability Assessment
Purpose & Capability
The skill claims to convert images into videos and only requests a single service token (NEMO_TOKEN), which matches that purpose. However the SKILL.md frontmatter includes a configPaths entry (~/.config/nemovideo/) while the registry metadata listed no required config paths — this mismatch is unexplained and could indicate stale or inconsistent metadata.
Instruction Scope
Runtime instructions are narrowly scoped to interacting with the nemo-video API (session creation, SSE chat, upload, export, credits/state). They instruct generating an anonymous token if none is provided and to upload user images; they do not ask the agent to read unrelated system files, histories, or external endpoints.
Install Mechanism
No installation steps or downloads are present (instruction-only skill), so nothing is written to disk by an installer. This is the lower-risk configuration for skills.
Credentials
Only one credential (NEMO_TOKEN) is declared as required and that is proportional for a cloud rendering API. The SKILL.md also documents a flow to mint an anonymous token if no token is present (reasonable). The unexplained frontmatter configPaths entry suggests the skill might also look for local config (~/.config/nemovideo/) — that access wasn't declared in the registry metadata and is not justified in the prose.
Persistence & Privilege
The skill is not marked always:true, uses normal autonomous invocation defaults, and does not request system-wide persistence or modification of other skills. No elevated presence is requested.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install image-to-video-free-ai-generator
  3. After installation, invoke the skill by name or use /image-to-video-free-ai-generator
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Image to Video Free AI Generator — Initial Release - Launches free AI-powered tool to convert up to three JPG product photos into 1080p animated video clips within 30–60 seconds. - Simple upload interface: just send images and describe the desired video outcome; no manual editing or export settings required. - Automatic cloud session setup, including free 7-day token generation for new users. - Supports common workflows like quick edits, batch processing, and iterative video refinements. - Handles video rendering, export (MP4 and more), credits, project state, and error management directly through backend APIs. - Guides users with prompt suggestions, real-time status updates, and user-friendly error messages.
Metadata
Slug image-to-video-free-ai-generator
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Image To Video Free Ai Generator?

Turn three product photos in JPG format into 1080p animated video clips just by typing what you need. Whether it's converting static images into short videos... It is an AI Agent Skill for Claude Code / OpenClaw, with 80 downloads so far.

How do I install Image To Video Free Ai Generator?

Run "/install image-to-video-free-ai-generator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Image To Video Free Ai Generator free?

Yes, Image To Video Free Ai Generator is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Image To Video Free Ai Generator support?

Image To Video Free Ai Generator is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Image To Video Free Ai Generator?

It is built and maintained by peandrover adam (@peand-rover); the current version is v1.0.0.

💬 Comments