← Back to Skills Marketplace
dsewell-583h0

Ai Video Maker Ai

by dsewell-583h0 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
86
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install ai-video-maker-ai
Description
Turn five product images and a logo file into 1080p AI-generated videos just by typing what you need. Whether it's creating videos from images or clips using...
README (SKILL.md)

Getting Started

Got images or clips to work with? Send it over and tell me what you need — I'll take care of the AI video creation.

Try saying:

  • "create five product images and a logo file into a 1080p MP4"
  • "turn these images into a 30-second promo video with music and text overlays"
  • "creating videos from images or clips using AI automation for marketers and content creators"

Automatic Setup

On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: \x3Cuuid>. The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).

Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.

Confirm to the user you're connected and ready. Don't print tokens or raw JSON.

AI Video Maker AI — Create Videos with AI Automation

Drop your images or clips in the chat and tell me what you need. I'll handle the AI video creation on cloud GPUs — you don't need anything installed locally.

Here's a typical use: you send a five product images and a logo file, ask for turn these images into a 30-second promo video with music and text overlays, and about 1-2 minutes later you've got a MP4 file ready to download. The whole thing runs at 1080p by default.

One thing worth knowing — using fewer than 10 images speeds up generation significantly.

Matching Input to Actions

User prompts referencing ai video maker ai, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says... Action Skip SSE?
"export" / "导出" / "download" / "send me the video" → §3.5 Export
"credits" / "积分" / "balance" / "余额" → §3.3 Credits
"status" / "状态" / "show tracks" → §3.4 State
"upload" / "上传" / user sends file → §3.2 Upload
Everything else (generate, edit, add BGM…) → §3.1 SSE

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Base URL: https://mega-api-prod.nemovideo.ai

Endpoint Method Purpose
/api/tasks/me/with-session/nemo_agent POST Start a new editing session. Body: {"task_name":"project","language":"\x3Clang>"}. Returns session_id.
/run_sse POST Send a user message. Body includes app_name, session_id, new_message. Stream response with Accept: text/event-stream. Timeout: 15 min.
/api/upload-video/nemo_agent/me/\x3Csid> POST Upload a file (multipart) or URL.
/api/credits/balance/simple GET Check remaining credits (available, frozen, total).
/api/state/nemo_agent/me/\x3Csid>/latest GET Fetch current timeline state (draft, video_infos, generated_media).
/api/render/proxy/lambda POST Start export. Body: {"id":"render_\x3Cts>","sessionId":"\x3Csid>","draft":\x3Cjson>,"output":{"format":"mp4","quality":"high"}}. Poll status every 30s.

Accepted file types: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Three attribution headers are required on every request and must match this file's frontmatter:

Header Value
X-Skill-Source ai-video-maker-ai
X-Skill-Version frontmatter version
X-Skill-Platform auto-detect: clawhub / cursor / unknown from install path

All requests must include: Authorization: Bearer \x3CNEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.

Error Codes

  • 0 — success, continue normally
  • 1001 — token expired or invalid; re-acquire via /api/auth/anonymous-token
  • 1002 — session not found; create a new one
  • 2001 — out of credits; anonymous users get a registration link with ?bind=\x3Cid>, registered users top up
  • 4001 — unsupported file type; show accepted formats
  • 4002 — file too large; suggest compressing or trimming
  • 400 — missing X-Client-Id; generate one and retry
  • 402 — free plan export blocked; not a credit issue, subscription tier
  • 429 — rate limited; wait 30s and retry once

SSE Event Handling

Event Action
Text response Apply GUI translation (§4), present to user
Tool call/result Process internally, don't forward
heartbeat / empty data: Keep waiting. Every 2 min: "⏳ Still working..."
Stream closes Process final response

~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says You do
"click [button]" / "点击" Execute via API
"open [panel]" / "打开" Query session state
"drag/drop" / "拖拽" Send edit via SSE
"preview in timeline" Show track summary
"Export button" / "导出" Execute export workflow

Draft JSON uses short keys: t for tracks, tt for track type (0=video, 1=audio, 7=text), sg for segments, d for duration in ms, m for metadata.

Example timeline summary:

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "turn these images into a 30-second promo video with music and text overlays" — concrete instructions get better results.

Max file size is 500MB. Stick to MP4, MOV, JPG, PNG for the smoothest experience.

Export as MP4 for widest compatibility.

Common Workflows

Quick edit: Upload → "turn these images into a 30-second promo video with music and text overlays" → Download MP4. Takes 1-2 minutes for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Usage Guidance
What to check before installing: 1) Confirm the config-path mismatch — ask the author whether the skill will read/write ~/.config/nemovideo/ (and what exactly is stored there). 2) Verify the external domain (mega-api-prod.nemovideo.ai) is the intended upstream and acceptable for your data; uploaded media will go to their cloud GPUs. 3) If you prefer not to store long-lived tokens locally, use the anonymous-token flow and verify how/where that token/session is persisted and how long it lives. 4) Be cautious about uploading sensitive images or proprietary assets to an external service. 5) Check privacy/retention and billing (credits/subscription) behavior — SKILL.md references credit limits and potential paid tiers. 6) If anything is unclear or you cannot verify the config/storage behavior, treat the skill as untrusted or revoke tokens after use.
Capability Analysis
Type: OpenClaw Skill Name: ai-video-maker-ai Version: 1.0.0 The ai-video-maker-ai skill is a legitimate integration for an AI video generation service. It provides detailed instructions for the agent to manage authentication, sessions, and file uploads to the 'nemovideo.ai' API. The code and instructions in SKILL.md are strictly aligned with the stated purpose of converting images and clips into videos, with no evidence of data exfiltration, malicious execution, or harmful prompt injection. Standard security practices, such as masking tokens in output, are encouraged.
Capability Assessment
Purpose & Capability
Name and description match the actions described (upload images/clips, create renders, check credits, export). Requesting a NEMO_TOKEN and calling nemovideo.ai endpoints is proportionate. However, SKILL.md frontmatter lists a config path (~/.config/nemovideo/) that the registry metadata did not declare, which is an inconsistency to clarify.
Instruction Scope
Instructions direct the agent to obtain/use a bearer token (NEMO_TOKEN or anonymous-token flow), create sessions, upload files, start renders, poll SSE, and save session_id. Those are all within the stated video-creation purpose. The SKILL.md requires attribution headers tied to the skill frontmatter and asks the agent to 'auto-detect' platform/install path — this implies the agent may read install path metadata. No instructions request unrelated files or other credentials.
Install Mechanism
Instruction-only skill with no install spec and no code files. Lowest install risk: nothing is downloaded or written by an installer step in the package.
Credentials
Only one environment credential is declared (NEMO_TOKEN / primaryEnv). That aligns with needing an API bearer token for the external service. The SKILL.md also supports an anonymous token flow (no secret), which reduces credential exposure. No unrelated credentials are requested.
Persistence & Privilege
The SKILL.md frontmatter indicates use of ~/.config/nemovideo/ (config path) for session/token storage, but the registry metadata shows no required config paths — a mismatch. The instructions also say to save session_id and use tokens for subsequent calls, implying local persistence. Confirm where session tokens are stored and what is written to disk before installing.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ai-video-maker-ai
  3. After installation, invoke the skill by name or use /ai-video-maker-ai
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release of AI Video Maker AI. - Instantly turn up to five product images and a logo into 1080p AI-generated videos via simple text prompts. - Automatically handles video creation, music, and text overlays without timeline editing or manual export steps. - One-click setup with token management for both free and authenticated users; 100 free credits available for new users. - Upload and process common video, image, and audio file types (mp4, jpg, png, mp3, etc.)—up to 500MB per file. - Efficient, cloud-based rendering with typical video delivery within 1–2 minutes. - Full session management: check credits, timeline status, or export videos directly by chat command.
Metadata
Slug ai-video-maker-ai
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Ai Video Maker Ai?

Turn five product images and a logo file into 1080p AI-generated videos just by typing what you need. Whether it's creating videos from images or clips using... It is an AI Agent Skill for Claude Code / OpenClaw, with 86 downloads so far.

How do I install Ai Video Maker Ai?

Run "/install ai-video-maker-ai" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Ai Video Maker Ai free?

Yes, Ai Video Maker Ai is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Ai Video Maker Ai support?

Ai Video Maker Ai is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Ai Video Maker Ai?

It is built and maintained by dsewell-583h0 (@dsewell-583h0); the current version is v1.0.0.

💬 Comments