Description

Skip the learning curve of professional editing software. Describe what you want — turn this script into an explainer video with voiceover and animated slide...

README (SKILL.md)

Getting Started

Ready when you are. Drop your text or script here or describe what you want to make.

Try saying:

"create a 200-word product description script into a 1080p MP4"
"turn this script into an explainer video with voiceover and animated slides"
"creating explainer videos from scripts or text without manual animation for marketers, startup founders, educators"

Automatic Setup

On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: \x3Cuuid>. The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).

Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.

Confirm to the user you're connected and ready. Don't print tokens or raw JSON.

Explainer Video Online — Create Explainer Videos from Script

Name: Explainer Video Online
Author: mhogan2013-9

Send me your text or script and describe the result you want. The AI explainer video creation runs on remote GPU nodes — nothing to install on your machine.

A quick example: upload a 200-word product description script, type "turn this script into an explainer video with voiceover and animated slides", and you'll get a 1080p MP4 back in roughly 1-2 minutes. All rendering happens server-side.

Worth noting: shorter scripts under 90 seconds produce the most focused and watchable explainer videos.

Matching Input to Actions

User prompts referencing explainer video online, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Base URL: https://mega-api-prod.nemovideo.ai

Endpoint	Method	Purpose
`/api/tasks/me/with-session/nemo_agent`	POST	Start a new editing session. Body: `{"task_name":"project","language":"\x3Clang>"}`. Returns `session_id`.
`/run_sse`	POST	Send a user message. Body includes `app_name`, `session_id`, `new_message`. Stream response with `Accept: text/event-stream`. Timeout: 15 min.
`/api/upload-video/nemo_agent/me/\x3Csid>`	POST	Upload a file (multipart) or URL.
`/api/credits/balance/simple`	GET	Check remaining credits (`available`, `frozen`, `total`).
`/api/state/nemo_agent/me/\x3Csid>/latest`	GET	Fetch current timeline state (`draft`, `video_infos`, `generated_media`).
`/api/render/proxy/lambda`	POST	Start export. Body: `{"id":"render_\x3Cts>","sessionId":"\x3Csid>","draft":\x3Cjson>,"output":{"format":"mp4","quality":"high"}}`. Poll status every 30s.

Accepted file types: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Skill attribution — read from this file's YAML frontmatter at runtime:

X-Skill-Source: explainer-video-online
X-Skill-Version: from frontmatter version
X-Skill-Platform: detect from install path (~/.clawhub/ → clawhub, ~/.cursor/skills/ → cursor, else unknown)

All requests must include: Authorization: Bearer \x3CNEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.

Error Codes

0 — success, continue normally
1001 — token expired or invalid; re-acquire via /api/auth/anonymous-token
1002 — session not found; create a new one
2001 — out of credits; anonymous users get a registration link with ?bind=\x3Cid>, registered users top up
4001 — unsupported file type; show accepted formats
4002 — file too large; suggest compressing or trimming
400 — missing X-Client-Id; generate one and retry
402 — free plan export blocked; not a credit issue, subscription tier
429 — rate limited; wait 30s and retry once

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says	You do
"click [button]" / "点击"	Execute via API
"open [panel]" / "打开"	Query session state
"drag/drop" / "拖拽"	Send edit via SSE
"preview in timeline"	Show track summary
"Export button" / "导出"	Execute export workflow

Draft JSON uses short keys: t for tracks, tt for track type (0=video, 1=audio, 7=text), sg for segments, d for duration in ms, m for metadata.

Example timeline summary:

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "turn this script into an explainer video with voiceover and animated slides" — concrete instructions get better results.

Max file size is 200MB. Stick to MP4, MOV, TXT, DOCX for the smoothest experience.

Export as MP4 for widest compatibility across websites, presentations, and social platforms.

Common Workflows

Quick edit: Upload → "turn this script into an explainer video with voiceover and animated slides" → Download MP4. Takes 1-2 minutes for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Usage Guidance

This skill appears to be a legitimate client for a remote explainer‑video API, but there are a few things to confirm before using it: - Ask the publisher why the SKILL.md frontmatter lists a local config path (~/.config/nemovideo/) while the registry entry shows no required config paths. Confirm whether the skill will read files from that directory and what it contains (tokens, logs, etc.). - Confirm exactly where NEMO_TOKEN and any anonymous tokens/session IDs are stored (in‑memory only, in an agent config, or written to disk) and how long they remain valid. - The skill instructs probing install paths (e.g., ~/.clawhub/, ~/.cursor/) to set X-Skill-Platform; ask whether this probing is strictly necessary and what information is derived/exposed by that check. - Be cautious about uploading sensitive files (private videos, audio, or documents) until you know the provider's retention, sharing, and training policies. - If you prefer, use ephemeral/limited tokens (anonymous tokens) and avoid providing long‑lived credentials. If you need higher assurance, request a privacy/security statement from the author explaining the filesystem checks and token handling. If the author can justify the filesystem checks and confirm no undisclosed config/file reads occur, the skill's footprint would be coherent with its purpose; until then, treat it with caution.

Capability Analysis

Type: OpenClaw Skill Name: explainer-video-online Version: 1.0.0 The skill bundle is a legitimate integration for a video generation service hosted at nemovideo.ai. It provides clear instructions for the AI agent to handle authentication via NEMO_TOKEN, manage editing sessions, and upload media files for server-side processing. The instructions include security-conscious practices such as suppressing the display of raw tokens and JSON responses, and the requested filesystem access is limited to identifying the installation environment for API attribution purposes.

Capability Assessment

✓ Purpose & Capability

Name/description and the runtime instructions consistently describe a remote explainer‑video service accessed via an API and requiring a single API token (NEMO_TOKEN). The actions (session creation, upload, render, SSE) are coherent with the stated purpose.

⚠ Instruction Scope

The SKILL.md tells the agent to read the skill's frontmatter and detect the agent's install path to set X-Skill-Platform, and the frontmatter metadata references a local config path (~/.config/nemovideo/). Probing install paths or arbitrary home paths to derive an attribution header extends the agent's filesystem access beyond what a pure 'upload script → render' flow strictly needs and should be justified. The skill also instructs the agent to upload user files to a third‑party endpoint (expected) — verify user consent and retention policy.

✓ Install Mechanism

This is an instruction‑only skill with no install spec or downloaded code; nothing will be written to disk by an installer as part of skill installation. That minimizes supply‑chain risk.

ℹ Credentials

Only a single credential (NEMO_TOKEN) is required, which is appropriate for an API client. However, the frontmatter also lists a config path (~/.config/nemovideo/) which is not declared in the registry metadata summary — a mismatch that could allow the skill to read local configuration files. The skill also instructs obtaining an anonymous token and using/saving it as NEMO_TOKEN; confirm where/how tokens and session IDs are stored.

✓ Persistence & Privilege

The skill is not force‑enabled (always:false) and model invocation is allowed (normal for skills). It asks the agent to persist a session_id and to use/store tokens for subsequent requests, which is expected for session-based APIs but should be transparent to the user.

Version History

v1.0.0

Initial release — create explainer videos from text or scripts with voiceover and animation in minutes. - Upload MP4, MOV, TXT, DOCX, and more (up to 200MB); AI handles all editing and rendering in the cloud. - Generate new sessions and free trial tokens automatically on first use. - Supports key actions: uploading, editing by description, previewing, exporting videos, and checking credit balance. - Export explainer videos as 1080p MP4 with animated slides and auto-generated voiceover. - Clear user guidance with tips, error code explanations, and example workflows.

Metadata

Slug explainer-video-online

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Explainer Video Online?

Skip the learning curve of professional editing software. Describe what you want — turn this script into an explainer video with voiceover and animated slide... It is an AI Agent Skill for Claude Code / OpenClaw, with 88 downloads so far.

How do I install Explainer Video Online?

Run "/install explainer-video-online" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Explainer Video Online free?

Yes, Explainer Video Online is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Explainer Video Online support?

Explainer Video Online is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Explainer Video Online?

It is built and maintained by mhogan2013-9 (@mhogan2013-9); the current version is v1.0.0.

More Skills

Explainer Video Online