← Back to Skills Marketplace
peand-rover

Image To Video Maker

by peandrover adam · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
141
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install image-to-video-maker
Description
Tired of static photos sitting in folders, never getting the attention they deserve? The image-to-video-maker skill on ClawHub breathes life into your still...
README (SKILL.md)

Getting Started

Welcome! Ready to turn your photos into a video that actually gets watched? Share your images and tell me how you'd like your video to look — let's build something great together.

Try saying:

  • "Turn these 12 product photos into a 30-second promotional video with smooth fade transitions"
  • "Create a travel video slideshow from my vacation photos, showing each image for 3 seconds with a cinematic feel"
  • "Make a birthday tribute video using these 20 family photos, ordered chronologically with gentle transitions"

Quick Start Setup

This skill connects to NemoVideo's cloud backend. On first use, set up the connection automatically and let the user know ("Connecting to NemoVideo...").

Token check: Look for NEMO_TOKEN in the environment. If found, skip to session creation. Otherwise:

  • Read ~/.config/nemovideo/client_id or generate a UUID and save it there
  • POST https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with X-Client-Id header
  • Extract data.token from the response — this is your NEMO_TOKEN (100 free credits, 7-day expiry)

Session: POST /api/tasks/me/with-session/nemo_agent at the same host with Bearer auth and body {"task_name":"project"}. Keep the returned session_id for all operations.

Let the user know with a brief "Ready!" when setup is complete. Don't expose tokens or raw API output.

Turn Your Best Photos Into Videos Worth Watching

Most people have hundreds of photos that never get shared simply because a single image doesn't tell the whole story. The image-to-video-maker skill changes that by stitching your still photos together into a flowing video that captures attention and communicates a narrative — whether it's a travel recap, a product showcase, a real estate walkthrough, or a family memory reel.

With this skill, you stay in control of the creative direction. You can specify how long each image should appear on screen, choose the order photos are displayed, and describe the mood or pacing you're going for. The result is a video that feels intentional and professional, not like an auto-generated slideshow from a decade-old app.

This tool is built for real-world use cases: social media content creators who need quick turnaround, small business owners showcasing products without a video budget, photographers delivering client galleries in a new format, and everyday users who just want to make something memorable. Whatever your reason, image-to-video-maker gets you from a folder of photos to a finished video in minutes.

Routing Your Slideshow Requests

Each request — whether you're uploading photos, setting transition styles, adjusting timing, or exporting your final video — is parsed and routed to the matching NemoVideo endpoint based on the action type detected in your message.

User says... Action Skip SSE?
"export" / "导出" / "download" / "send me the video" → §3.5 Export
"credits" / "积分" / "balance" / "余额" → §3.3 Credits
"status" / "状态" / "show tracks" → §3.4 State
"upload" / "上传" / user sends file → §3.2 Upload
Everything else (generate, edit, add BGM…) → §3.1 SSE

NemoVideo API Reference

The NemoVideo backend handles frame sequencing, Ken Burns motion effects, transition rendering, and audio sync to stitch your still images into a polished video slideshow. All processing happens server-side, so output quality and render speed depend on your active NemoVideo plan tier.

Skill attribution — read from this file's YAML frontmatter at runtime:

  • X-Skill-Source: image-to-video-maker
  • X-Skill-Version: from frontmatter version
  • X-Skill-Platform: detect from install path (~/.clawhub/clawhub, ~/.cursor/skills/cursor, else unknown)

All requests must include: Authorization: Bearer \x3CNEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.

API base: https://mega-api-prod.nemovideo.ai

Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"\x3Clang>"} — returns task_id, session_id. After creating a session, give the user a link: https://nemovideo.com/workspace/claim?token=&task=\x3Ctask_id>&session=\x3Csession_id>&skill_name=image-to-video-maker&skill_version=1.0.0&skill_source=\x3Cplatform>

Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"\x3Csid>","new_message":{"parts":[{"text":"\x3Cmsg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.

Upload: POST /api/upload-video/nemo_agent/me/\x3Csid> — file: multipart -F "files=@/path", or URL: {"urls":["\x3Curl>"],"source_type":"url"}

Credits: GET /api/credits/balance/simple — returns available, frozen, total

Session state: GET /api/state/nemo_agent/me/\x3Csid>/latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media

Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_\x3Cts>","sessionId":"\x3Csid>","draft":\x3Cjson>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s until status = completed. Download URL at output.url.

Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

SSE Event Handling

Event Action
Text response Apply GUI translation (§4), present to user
Tool call/result Process internally, don't forward
heartbeat / empty data: Keep waiting. Every 2 min: "⏳ Still working..."
Stream closes Process final response

~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says You do
"click [button]" / "点击" Execute via API
"open [panel]" / "打开" Query session state
"drag/drop" / "拖拽" Send edit via SSE
"preview in timeline" Show track summary
"Export button" / "导出" Execute export workflow

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Error Handling

Code Meaning Action
0 Success Continue
1001 Bad/expired token Re-auth via anonymous-token (tokens expire after 7 days)
1002 Session not found New session §3.0
2001 No credits Anonymous: show registration URL with ?bind=\x3Cid> (get \x3Cid> from create-session or state response when needed). Registered: "Top up at nemovideo.ai"
4001 Unsupported file Show supported formats
4002 File too large Suggest compress/trim
400 Missing X-Client-Id Generate Client-Id and retry (see §1)
402 Free plan export blocked Subscription tier issue, NOT credits. "Register at nemovideo.ai to unlock export."
429 Rate limit (1 token/client/7 days) Retry in 30s once

FAQ

How many images can I include in one video? There's no hard cap, but for smooth performance and a watchable result, most users work with between 5 and 60 images per video. Very large batches may benefit from being split into segments.

Can I add music or audio to my image video? Yes — you can describe the type of background music you want, or mention if you'd like a silent video. If you have a specific audio file in mind, note that in your prompt.

What aspect ratio will the video be? By default, the output is optimized for landscape (16:9), which works well for most platforms. If you need a square (1:1) format for Instagram or vertical (9:16) for TikTok or Reels, just specify that in your request.

Will the skill work with screenshots or graphics, not just photos? Absolutely. The image-to-video-maker handles any still image file — photographs, illustrations, screenshots, infographics, or design mockups all work as source material.

Troubleshooting

Images appear out of order in the final video: Make sure to specify the sequence explicitly when uploading. You can number your files or describe the desired order in your prompt (e.g., 'start with the exterior shots, then move to interiors').

Video output looks blurry or pixelated: This usually happens when the source images are low resolution. For best results, use photos that are at least 1080px on the shortest side. If you're working with older or compressed images, mention this upfront so the output settings can be adjusted accordingly.

The video is too long or too short: You can control pacing by specifying how many seconds each image should display. If you didn't set a duration and the result feels off, simply ask for a revised version with a specific timing (e.g., '2 seconds per image' or 'fit everything into 60 seconds').

Unsupported file format on playback: The image-to-video-maker supports mp4, mov, avi, webm, and mkv. If your device or platform has trouble playing the output, request a specific format in your next prompt.

Quick Start Guide

Step 1 — Gather your images: Collect the photos or graphics you want to include. For the best output, use consistently sized images and remove any duplicates or low-quality shots before uploading.

Step 2 — Describe your vision: In your prompt, tell the skill how long the video should be, how long each image should appear, what kind of transitions you prefer (fade, slide, cut, etc.), and whether you want any text overlays or audio.

Step 3 — Specify your output format: Choose from mp4, mov, avi, webm, or mkv depending on where you plan to use the video. If you're unsure, mp4 is the most universally compatible choice.

Step 4 — Review and refine: Once your first video is generated, watch it through and note anything you'd like adjusted — timing, order, pacing, or format. You can iterate quickly by describing what needs to change in a follow-up message. Most users get to a final result within two or three rounds of feedback.

Usage Guidance
This skill will upload whatever images you provide to nemovideo's cloud (mega-api-prod.nemovideo.ai) and will create/read a small config under ~/.config/nemovideo/ (client_id). Before installing, consider: (1) Do you trust nemovideo.com and the listed GitHub repo? (verify they match and review their privacy policy), (2) Avoid uploading sensitive images to third-party services, (3) Note the skill may obtain a short-lived anonymous token if you don't supply NEMO_TOKEN — treat that token like a credential, and (4) If you prefer, create an account or test with non-sensitive photos first. If you want stronger assurance, ask the publisher for a link to the source code for this exact skill bundle and verify the API endpoints and token-handling behavior there.
Capability Analysis
Type: OpenClaw Skill Name: image-to-video-maker Version: 1.0.0 The image-to-video-maker skill is a standard integration for the NemoVideo cloud service. It provides instructions for an AI agent to manage authentication via NEMO_TOKEN or an anonymous client ID stored in ~/.config/nemovideo/, and facilitates image uploads and video rendering through the mega-api-prod.nemovideo.ai backend. The skill follows security best practices by instructing the agent not to expose tokens and limits its file system and network activity to its own configuration and the stated service endpoints.
Capability Assessment
Purpose & Capability
Name/description (image→video) match the runtime instructions: the SKILL.md describes creating sessions, uploading images, and exporting videos to nemovideo's API. The declared primary credential (NEMO_TOKEN) and config path (~/.config/nemovideo/) are appropriate for a cloud-backed media service.
Instruction Scope
Instructions direct the agent to check NEMO_TOKEN, read/write ~/.config/nemovideo/client_id (or generate one), create sessions, stream SSE, upload files, and query credits/state. These actions are expected for this skill, but they do involve reading/writing the user's ~/.config directory and detecting install path (~/.clawhub/, ~/.cursor/skills/) to set X-Skill-Platform — a benign but privacy-relevant action. The doc also insists on including attribution headers and warns about not exposing tokens; there is a small inconsistency around how the token is persisted (it extracts data.token but doesn't explicitly say whether to save it to env or only memory).
Install Mechanism
No install spec and no code files — instruction-only. That minimizes disk-write risk and aligns with the skill being a thin connector to a third-party API.
Credentials
Only a single primary credential (NEMO_TOKEN) and one config path (~/.config/nemovideo/) are requested. Those are proportional to a cloud render/upload service. No unrelated secrets or broad system credentials are requested.
Persistence & Privilege
Skill is not always-enabled and allows normal autonomous invocation. It asks to create/save a client_id under ~/.config/nemovideo/, which is reasonable for client identification and does not request system-wide or other-skills' configuration changes.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install image-to-video-maker
  3. After installation, invoke the skill by name or use /image-to-video-maker
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of Image to Video Maker — easily turn your photos into dynamic video slideshows. - Converts still images into professional video slideshows with custom duration, transitions, and ordering. - Supports popular output formats: mp4, mov, avi, webm, and mkv. - No local editing required; quick cloud-based processing via NemoVideo backend. - Simple setup: automatic token generation and session handling with clear user feedback. - Includes user-friendly commands for uploading, generating, exporting, and checking credits or status. - Designed for content creators, marketers, photographers, and anyone wanting to make polished videos from photos.
Metadata
Slug image-to-video-maker
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Image To Video Maker?

Tired of static photos sitting in folders, never getting the attention they deserve? The image-to-video-maker skill on ClawHub breathes life into your still... It is an AI Agent Skill for Claude Code / OpenClaw, with 141 downloads so far.

How do I install Image To Video Maker?

Run "/install image-to-video-maker" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Image To Video Maker free?

Yes, Image To Video Maker is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Image To Video Maker support?

Image To Video Maker is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Image To Video Maker?

It is built and maintained by peandrover adam (@peand-rover); the current version is v1.0.0.

💬 Comments