功能描述

turn free images into image-based video with this skill. Works with JPG, PNG, WEBP, GIF files up to 200MB. content creators use it for creating videos from f...

使用说明 (SKILL.md)

Getting Started

Send me your free images and I'll handle the AI video creation. Or just describe what you're after.

Try saying:

"turn three free stock photos of a city skyline into a 1080p MP4"
"turn these free images into a 30-second promotional video with music and transitions"
"creating videos from free images without paid assets for content creators"

Getting Connected

Before handling any user request, establish a connection to the backend API. Show a brief status like "Connecting...".

If NEMO_TOKEN is in the environment, use it directly and create a session. Otherwise, acquire a free starter token:

Generate a UUID as client identifier
POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with the X-Client-Id header
The response includes a token with 100 free credits valid for 7 days — use it as NEMO_TOKEN

Then create a session by POSTing to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer authorization and body {"task_name":"project","language":"en"}. The session_id in the response is needed for all following requests.

Tell the user you're ready. Keep the technical details out of the chat.

AI Video Maker Free Image — Turn Free Images Into Videos

Name: Ai Video Maker Free Image
Author: tk8544-b

Send me your free images and describe the result you want. The AI video creation runs on remote GPU nodes — nothing to install on your machine.

A quick example: upload three free stock photos of a city skyline, type "turn these free images into a 30-second promotional video with music and transitions", and you'll get a 1080p MP4 back in roughly 30-60 seconds. All rendering happens server-side.

Worth noting: using fewer high-resolution images produces smoother, faster results than many low-quality ones.

Matching Input to Actions

User prompts referencing ai video maker free image, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Headers are derived from this file's YAML frontmatter. X-Skill-Source is ai-video-maker-free-image, X-Skill-Version comes from the version field, and X-Skill-Platform is detected from the install path (~/.clawhub/ = clawhub, ~/.cursor/skills/ = cursor, otherwise unknown).

Include Authorization: Bearer \x3CNEMO_TOKEN> and all attribution headers on every request — omitting them triggers a 402 on export.

API base: https://mega-api-prod.nemovideo.ai

Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"\x3Clang>"} — returns task_id, session_id.

Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"\x3Csid>","new_message":{"parts":[{"text":"\x3Cmsg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.

Upload: POST /api/upload-video/nemo_agent/me/\x3Csid> — file: multipart -F "files=@/path", or URL: {"urls":["\x3Curl>"],"source_type":"url"}

Credits: GET /api/credits/balance/simple — returns available, frozen, total

Session state: GET /api/state/nemo_agent/me/\x3Csid>/latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media

Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_\x3Cts>","sessionId":"\x3Csid>","draft":\x3Cjson>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s until status = completed. Download URL at output.url.

Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says	You do
"click [button]" / "点击"	Execute via API
"open [panel]" / "打开"	Query session state
"drag/drop" / "拖拽"	Send edit via SSE
"preview in timeline"	Show track summary
"Export button" / "导出"	Execute export workflow

Draft JSON uses short keys: t for tracks, tt for track type (0=video, 1=audio, 7=text), sg for segments, d for duration in ms, m for metadata.

Example timeline summary:

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Error Handling

Code	Meaning	Action
0	Success	Continue
1001	Bad/expired token	Re-auth via anonymous-token (tokens expire after 7 days)
1002	Session not found	New session §3.0
2001	No credits	Anonymous: show registration URL with `?bind=\x3Cid>` (get `\x3Cid>` from create-session or state response when needed). Registered: "Top up credits in your account"
4001	Unsupported file	Show supported formats
4002	File too large	Suggest compress/trim
400	Missing X-Client-Id	Generate Client-Id and retry (see §1)
402	Free plan export blocked	Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429	Rate limit (1 token/client/7 days)	Retry in 30s once

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "turn these free images into a 30-second promotional video with music and transitions" — concrete instructions get better results.

Max file size is 200MB. Stick to JPG, PNG, WEBP, GIF for the smoothest experience.

Export as MP4 for widest compatibility across social platforms.

Common Workflows

Quick edit: Upload → "turn these free images into a 30-second promotional video with music and transitions" → Download MP4. Takes 30-60 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

安全使用建议

This skill appears to be a thin instruction-only integration for a cloud rendering API (mega-api-prod.nemovideo.ai) and only needs a NEMO_TOKEN to operate — that is reasonable. Before installing/using it, consider: 1) Source verification: the skill's source/homepage is unknown; prefer skills from known authors or with a homepage. 2) Token use: only provide a short-lived or limited NEMO_TOKEN if possible; the skill can request an anonymous starter token per its instructions, which is safer than giving a long-lived secret. 3) Local path access: the SKILL.md implies detecting install/config paths (~/.clawhub, ~/.cursor, ~/.config/nemovideo). Ask the maintainer why local path detection is needed and confirm the agent will not read arbitrary files in your home directory. 4) Privacy of uploads: images you upload will be sent to the external API; avoid uploading sensitive images unless you trust the service. 5) Clarify the configPaths inconsistency: the registry metadata you were shown lists no config paths, but the SKILL.md includes one — ask for clarification. If the author can confirm no local files beyond the user-provided images will be read and provide a verifiable homepage/repo, this would raise confidence to high.

功能分析

Type: OpenClaw Skill Name: ai-video-maker-free-image Version: 1.0.0 The skill is a legitimate integration for the NemoVideo AI service, designed to convert images into videos using a remote API (mega-api-prod.nemovideo.ai). The SKILL.md file provides detailed instructions for the agent to handle authentication via tokens, manage sessions, upload media, and poll for rendering status. There is no evidence of data exfiltration, unauthorized file access, or malicious command execution; all network activities and environment variable requirements (NEMO_TOKEN) are directly aligned with the stated functionality.

能力评估

⚠ Purpose & Capability

Name/description match the runtime instructions (remote GPU rendering via nemovideo API). Requiring NEMO_TOKEN is proportionate. However the SKILL.md metadata references a config path (~/.config/nemovideo/) and the instructions derive an X-Skill-Platform from local install paths (~/.clawhub/, ~/.cursor/skills/). The registry metadata (provided to you) listed no config paths, so there's an inconsistency about whether the skill expects to inspect local config/install locations.

⚠ Instruction Scope

Instructions are specific about API endpoints (mega-api-prod.nemovideo.ai), session creation, SSE, upload endpoints and polling for renders — all coherent for a cloud render service. But the doc also instructs the agent to 'detect' install path to set X-Skill-Platform (implies reading home directories), and to derive request headers from YAML frontmatter. Asking the agent to read local install/config paths is scope creep relative to a pure image→video transform and could expose local metadata not needed for rendering.

✓ Install Mechanism

Instruction-only skill with no install spec and no code files. This is low-risk from an installation perspective because nothing is written to disk by an installer.

ℹ Credentials

The only declared required credential is NEMO_TOKEN (primaryEnv), which is reasonable for a hosted API. The SKILL.md also provides a fallback anonymous-token flow (generates a UUID and POSTs for an anonymous token), which reduces need for long-lived secrets. The inconsistency about configPaths in the SKILL.md metadata (versus registry metadata) is notable — a config path could contain additional secrets/config the skill might read if implemented.

✓ Persistence & Privilege

always:false and user-invocable:true. The skill does not request permanent presence or elevated platform privileges. Autonomous invocation is allowed (default) but not in itself a flagged concern.

版本历史

v1.0.0

AI Video Maker Free Image — Initial Release - Create 1080p MP4 videos from free images (JPG, PNG, WEBP, GIF, up to 200MB). - Automatic session and token setup—no registration needed for free tier. - Upload images and describe your desired output for cloud-based video rendering. - Supports credit checking, export/download, timeline preview, and error handling. - Comprehensive prompts and workflows for content creators using free assets only.

元数据

Slug ai-video-maker-free-image

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

Ai Video Maker Free Image 是什么？

turn free images into image-based video with this skill. Works with JPG, PNG, WEBP, GIF files up to 200MB. content creators use it for creating videos from f... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 68 次。

如何安装 Ai Video Maker Free Image？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ai-video-maker-free-image」即可一键安装，无需额外配置。

Ai Video Maker Free Image 是免费的吗？

是的，Ai Video Maker Free Image 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Ai Video Maker Free Image 支持哪些平台？

Ai Video Maker Free Image 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Ai Video Maker Free Image？

由 tk8544-b（@tk8544-b）开发并维护，当前版本 v1.0.0。

Ai Video Maker Free Image