功能描述

convert video files into converted MP4 files with this skill. Works with MOV, AVI, WebM, MKV files up to 500MB. content creators use it for converting MOV or...

使用说明 (SKILL.md)

Getting Started

Send me your video files and I'll handle the video format conversion. Or just describe what you're after.

Try saying:

"convert a 3-minute MOV recording from an iPhone into a 1080p MP4"
"convert this MOV file to MP4 so I can upload it to YouTube"
"converting MOV or AVI videos to MP4 for sharing or uploading for content creators"

First-Time Connection

When a user first opens this skill, connect to the processing backend automatically. Briefly let them know (e.g. "Setting up...").

Authentication: Check if NEMO_TOKEN is set in the environment. If it is, skip to step 2.

Obtain a free token: Generate a random UUID as client identifier. POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id set to that UUID. The response data.token is your NEMO_TOKEN — 100 free credits, valid 7 days.
Create a session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Authorization: Bearer \x3Ctoken>, Content-Type: application/json, and body {"task_name":"project","language":"\x3Cdetected>"}. Store the returned session_id for all subsequent requests.

Keep setup communication brief. Don't display raw API responses or token values to the user.

Video to MP4 — Convert Any Video to MP4

Name: Video To Mp4
Author: mory128

This tool takes your video files and runs video format conversion through a cloud rendering pipeline. You upload, describe what you want, and download the result.

Say you have a 3-minute MOV recording from an iPhone and want to convert this MOV file to MP4 so I can upload it to YouTube — the backend processes it in about 20-40 seconds and hands you a 1080p MP4.

Tip: shorter clips convert faster and keep file sizes manageable for sharing.

Matching Input to Actions

User prompts referencing video to mp4, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

All calls go to https://mega-api-prod.nemovideo.ai. The main endpoints:

Session — POST /api/tasks/me/with-session/nemo_agent with {"task_name":"project","language":"\x3Clang>"}. Gives you a session_id.
Chat (SSE) — POST /run_sse with session_id and your message in new_message.parts[0].text. Set Accept: text/event-stream. Up to 15 min.
Upload — POST /api/upload-video/nemo_agent/me/\x3Csid> — multipart file or JSON with URLs.
Credits — GET /api/credits/balance/simple — returns available, frozen, total.
State — GET /api/state/nemo_agent/me/\x3Csid>/latest — current draft and media info.
Export — POST /api/render/proxy/lambda with render ID and draft JSON. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s for completed status and download URL.

Formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Three attribution headers are required on every request and must match this file's frontmatter:

Header	Value
`X-Skill-Source`	`video-to-mp4`
`X-Skill-Version`	frontmatter `version`
`X-Skill-Platform`	auto-detect: `clawhub` / `cursor` / `unknown` from install path

Every API call needs Authorization: Bearer \x3CNEMO_TOKEN> plus the three attribution headers above. If any header is missing, exports return 402.

Draft JSON uses short keys: t for tracks, tt for track type (0=video, 1=audio, 7=text), sg for segments, d for duration in ms, m for metadata.

Example timeline summary:

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Translating GUI Instructions

The backend responds as if there's a visual interface. Map its instructions to API calls:

"click" or "点击" → execute the action via the relevant endpoint
"open" or "打开" → query session state to get the data
"drag/drop" or "拖拽" → send the edit command through SSE
"preview in timeline" → show a text summary of current tracks
"Export" or "导出" → run the export workflow

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Error Codes

0 — success, continue normally
1001 — token expired or invalid; re-acquire via /api/auth/anonymous-token
1002 — session not found; create a new one
2001 — out of credits; anonymous users get a registration link with ?bind=\x3Cid>, registered users top up
4001 — unsupported file type; show accepted formats
4002 — file too large; suggest compressing or trimming
400 — missing X-Client-Id; generate one and retry
402 — free plan export blocked; not a credit issue, subscription tier
429 — rate limited; wait 30s and retry once

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "convert this MOV file to MP4 so I can upload it to YouTube" — concrete instructions get better results.

Max file size is 500MB. Stick to MOV, AVI, WebM, MKV for the smoothest experience.

H.264 codec gives the best balance of quality and file size for MP4 output.

Common Workflows

Quick edit: Upload → "convert this MOV file to MP4 so I can upload it to YouTube" → Download MP4. Takes 20-40 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

安全使用建议

This skill is reasonable for converting videos if you are comfortable uploading them to NemoVideo’s cloud service. Avoid private or confidential media unless you trust that provider, keep the NEMO_TOKEN secret, and be aware that a backend session and render job may persist while processing completes.

功能分析

Type: OpenClaw Skill Name: video-to-mp4 Version: 1.0.0 The skill is a legitimate integration for a cloud-based video conversion service (nemovideo.ai). It defines standard API workflows for authentication, session management, file uploads, and polling for render results. While it automatically fetches an anonymous token and uploads user-provided media to a third-party backend, these actions are transparently documented and directly support the stated purpose of converting video files to MP4.

能力评估

ℹ Purpose & Capability

The stated video-to-MP4 purpose is coherent with cloud upload, rendering, export, and download workflows. The skill also documents adjacent media editing/generation routes, which are related but broader than simple conversion.

ℹ Instruction Scope

The instructions tell the agent to connect to the backend automatically on first use and keep some tool/API details internal. This is disclosed in the skill text and appears setup-related, but users should understand network calls may occur before conversion.

ℹ Install Mechanism

There is no install spec or code file to execute, which reduces local execution risk. Provenance is limited because the source is unknown and there is no homepage.

ℹ Credentials

The NEMO_TOKEN credential and upload of user-provided video files are proportionate to a cloud conversion service. No artifact evidence shows token logging, unrelated credential use, or hidden transmission beyond the disclosed backend.

ℹ Persistence & Privilege

The skill stores a backend session_id for subsequent requests and notes cloud render jobs can be orphaned if the tab closes. This is disclosed and workflow-related, but users should be aware processing state may persist on the provider side.

版本历史

v1.0.0

- Initial release of Video to MP4 skill. - Convert MOV, AVI, WebM, or MKV files up to 500MB into 1080p MP4 format in 20–40 seconds. - Supports easy upload, session management, and download of converted videos via a cloud GPU backend. - Includes workflows for uploading, conversion, export, credit checks, and session state. - Provides clear error handling, automatic token setup, and session creation for seamless first-time use.

元数据

Slug video-to-mp4

版本 1.0.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 1

常见问题

Video To Mp4 是什么？

convert video files into converted MP4 files with this skill. Works with MOV, AVI, WebM, MKV files up to 500MB. content creators use it for converting MOV or... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 81 次。

如何安装 Video To Mp4？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install video-to-mp4」即可一键安装，无需额外配置。

Video To Mp4 是免费的吗？

是的，Video To Mp4 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Video To Mp4 支持哪些平台？

Video To Mp4 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 Video To Mp4？

由 mory128（@mory128）开发并维护，当前版本 v1.0.0。

Video To Mp4