功能描述

generate text prompts into AI generated videos with this skill. Works with MP4, MOV, WebM, GIF files up to 500MB. content creators use it for generating vide...

使用说明 (SKILL.md)

Getting Started

Send me your text prompts and I'll handle the AI video generation. Or just describe what you're after.

Try saying:

"generate a short text description of a product demo scene into a 1080p MP4"
"generate a 30-second video from this script about a coffee brand"
"generating videos from text prompts at no cost for content creators"

Automatic Setup

On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: \x3Cuuid>. The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).

Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.

Confirm to the user you're connected and ready. Don't print tokens or raw JSON.

Free to Generation — Generate Videos from Text Prompts

Name: Free To Generation
Author: susan4731-wilfordf

Send me your text prompts and describe the result you want. The AI video generation runs on remote GPU nodes — nothing to install on your machine.

A quick example: upload a short text description of a product demo scene, type "generate a 30-second video from this script about a coffee brand", and you'll get a 1080p MP4 back in roughly 1-2 minutes. All rendering happens server-side.

Worth noting: shorter and more specific prompts produce more accurate video results.

Matching Input to Actions

User prompts referencing free to generation, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Headers are derived from this file's YAML frontmatter. X-Skill-Source is free-to-generation, X-Skill-Version comes from the version field, and X-Skill-Platform is detected from the install path (~/.clawhub/ = clawhub, ~/.cursor/skills/ = cursor, otherwise unknown).

Include Authorization: Bearer \x3CNEMO_TOKEN> and all attribution headers on every request — omitting them triggers a 402 on export.

API base: https://mega-api-prod.nemovideo.ai

Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"\x3Clang>"} — returns task_id, session_id.

Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"\x3Csid>","new_message":{"parts":[{"text":"\x3Cmsg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.

Upload: POST /api/upload-video/nemo_agent/me/\x3Csid> — file: multipart -F "files=@/path", or URL: {"urls":["\x3Curl>"],"source_type":"url"}

Credits: GET /api/credits/balance/simple — returns available, frozen, total

Session state: GET /api/state/nemo_agent/me/\x3Csid>/latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media

Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_\x3Cts>","sessionId":"\x3Csid>","draft":\x3Cjson>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s until status = completed. Download URL at output.url.

Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

SSE Event Handling

Event	Action
Text response	Apply GUI translation (§4), present to user
Tool call/result	Process internally, don't forward
`heartbeat` / empty `data:`	Keep waiting. Every 2 min: "⏳ Still working..."
Stream closes	Process final response

~30% of editing operations return no text in the SSE stream. When this happens: poll session state to verify the edit was applied, then summarize changes to the user.

Translating GUI Instructions

The backend responds as if there's a visual interface. Map its instructions to API calls:

"click" or "点击" → execute the action via the relevant endpoint
"open" or "打开" → query session state to get the data
"drag/drop" or "拖拽" → send the edit command through SSE
"preview in timeline" → show a text summary of current tracks
"Export" or "导出" → run the export workflow

Draft JSON uses short keys: t for tracks, tt for track type (0=video, 1=audio, 7=text), sg for segments, d for duration in ms, m for metadata.

Example timeline summary:

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Error Handling

Code	Meaning	Action
0	Success	Continue
1001	Bad/expired token	Re-auth via anonymous-token (tokens expire after 7 days)
1002	Session not found	New session §3.0
2001	No credits	Anonymous: show registration URL with `?bind=\x3Cid>` (get `\x3Cid>` from create-session or state response when needed). Registered: "Top up credits in your account"
4001	Unsupported file	Show supported formats
4002	File too large	Suggest compress/trim
400	Missing X-Client-Id	Generate Client-Id and retry (see §1)
402	Free plan export blocked	Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429	Rate limit (1 token/client/7 days)	Retry in 30s once

Common Workflows

Quick edit: Upload → "generate a 30-second video from this script about a coffee brand" → Download MP4. Takes 1-2 minutes for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "generate a 30-second video from this script about a coffee brand" — concrete instructions get better results.

Max file size is 500MB. Stick to MP4, MOV, WebM, GIF for the smoothest experience.

Export as MP4 for widest compatibility.

安全使用建议

This skill will send your text prompts and any files you choose to upload to the third‑party API at mega-api-prod.nemovideo.ai and requires a NEMO_TOKEN (or it will mint a short-lived anonymous token). Before installing, consider: only provide video/audio files you are comfortable uploading; do not supply unrelated secrets or private files; confirm you trust the nemovideo service and its privacy/retention policy; check where tokens/sessions are stored by the agent (the skill says not to print tokens but will persist session_id); and be aware of the small metadata mismatch (declared config path in SKILL.md vs registry). If you need stronger guarantees, ask the skill provider for a privacy/terms link or a hosted service homepage and verify the API domain and ownership.

功能分析

Type: OpenClaw Skill Name: free-to-generation Version: 1.0.0 The skill provides a functional interface for an AI video generation service hosted at nemovideo.ai. It outlines standard API interactions including anonymous token generation, session management, and file uploads. The requested permissions (NEMO_TOKEN and network access) are consistent with the stated purpose of cloud-based video rendering, and there is no evidence of malicious intent, data exfiltration, or unauthorized command execution in SKILL.md or _meta.json.

能力评估

✓ Purpose & Capability

Name and description claim text→video generation and the skill only requests a single service credential (NEMO_TOKEN) and endpoints for a video-rendering API — this is coherent. Note: the YAML frontmatter in SKILL.md lists a config path (~/.config/nemovideo/) even though the registry metadata reported no required config paths; this mismatch is minor but unexpected.

ℹ Instruction Scope

SKILL.md instructs the agent to obtain/use NEMO_TOKEN (or mint an anonymous token), create sessions, upload files (multipart or by URL), stream SSE messages, poll render status, and return download URLs. Those actions are appropriate for a remote render service. The instructions do reference local file paths for uploads and detect installation paths (~/.clawhub/, ~/.cursor/skills/) to set an attribution header — that implies the agent may read the install location or accept local file paths you provide. There are no instructions to read unrelated system files or other credentials.

✓ Install Mechanism

No install spec or executable downloads are present (instruction-only). Nothing is written to disk by an installer as part of the skill package itself.

✓ Credentials

Only one credential (NEMO_TOKEN) is required and is justified by the described API usage. The skill also supports obtaining a short-lived anonymous token via the service's auth endpoint; no unrelated secrets or multiple external credentials are requested.

✓ Persistence & Privilege

The skill does not request always:true and does not modify other skills or system-wide settings. It operates by calling remote APIs and handling session tokens for its own operations.

版本历史

v1.0.0

Initial release of Free to Generation — generate videos from text prompts. - Generate MP4, MOV, WebM, or GIF videos (up to 500MB) from natural language prompts. - Automatic free token provisioning for new users; quick cloud-based session setup. - 1080p video rendering on cloud GPUs, returning download links within 1–2 minutes. - File upload, export, session management, credits check, and status tracking supported. - Workflow examples, error handling, and translation of backend GUI instructions to text output included.

元数据

Slug free-to-generation

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

Free To Generation 是什么？

generate text prompts into AI generated videos with this skill. Works with MP4, MOV, WebM, GIF files up to 500MB. content creators use it for generating vide... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 51 次。

如何安装 Free To Generation？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install free-to-generation」即可一键安装，无需额外配置。

Free To Generation 是免费的吗？

是的，Free To Generation 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Free To Generation 支持哪些平台？

Free To Generation 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Free To Generation？

由 susan4731-wilfordf（@susan4731-wilfordf）开发并维护，当前版本 v1.0.0。

Free To Generation