Description

Get synced music videos ready to post, without touching a single slider. Upload your audio files (MP3, WAV, MP4, MOV, up to 500MB), say something like "sync...

README (SKILL.md)

Getting Started

Got audio files to work with? Send it over and tell me what you need — I'll take care of the AI music video creation.

Try saying:

"create a 3-minute MP3 track recorded March 30th into a 1080p MP4"
"sync visuals to the beat and generate a music video for track ab2n 0330"
"generating music videos from audio tracks for musicians and content creators"

Quick Start Setup

This skill connects to a cloud processing backend. On first use, set up the connection automatically and let the user know ("Connecting...").

Token check: Look for NEMO_TOKEN in the environment. If found, skip to session creation. Otherwise:

Generate a UUID as client identifier
POST https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with X-Client-Id header
Extract data.token from the response — this is your NEMO_TOKEN (100 free credits, 7-day expiry)

Session: POST https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Keep the returned session_id for all operations.

Let the user know with a brief "Ready!" when setup is complete. Don't expose tokens or raw API output.

Music AB2N 0330 — Turn Audio Tracks Into Videos

Name: Music Ab2n 0330
Author: dsewell-583h0

Send me your audio files and describe the result you want. The AI music video creation runs on remote GPU nodes — nothing to install on your machine.

A quick example: upload a 3-minute MP3 track recorded March 30th, type "sync visuals to the beat and generate a music video for track ab2n 0330", and you'll get a 1080p MP4 back in roughly 1-2 minutes. All rendering happens server-side.

Worth noting: shorter audio segments under 3 minutes render noticeably faster.

Matching Input to Actions

User prompts referencing music ab2n 0330, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says...	Action	Skip SSE?
"export" / "导出" / "download" / "send me the video"	→ §3.5 Export	✅
"credits" / "积分" / "balance" / "余额"	→ §3.3 Credits	✅
"status" / "状态" / "show tracks"	→ §3.4 State	✅
"upload" / "上传" / user sends file	→ §3.2 Upload	✅
Everything else (generate, edit, add BGM…)	→ §3.1 SSE	❌

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

All calls go to https://mega-api-prod.nemovideo.ai. The main endpoints:

Session — POST /api/tasks/me/with-session/nemo_agent with {"task_name":"project","language":"\x3Clang>"}. Gives you a session_id.
Chat (SSE) — POST /run_sse with session_id and your message in new_message.parts[0].text. Set Accept: text/event-stream. Up to 15 min.
Upload — POST /api/upload-video/nemo_agent/me/\x3Csid> — multipart file or JSON with URLs.
Credits — GET /api/credits/balance/simple — returns available, frozen, total.
State — GET /api/state/nemo_agent/me/\x3Csid>/latest — current draft and media info.
Export — POST /api/render/proxy/lambda with render ID and draft JSON. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s for completed status and download URL.

Formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Skill attribution — read from this file's YAML frontmatter at runtime:

X-Skill-Source: music-ab2n-0330
X-Skill-Version: from frontmatter version
X-Skill-Platform: detect from install path (~/.clawhub/ → clawhub, ~/.cursor/skills/ → cursor, else unknown)

All requests must include: Authorization: Bearer \x3CNEMO_TOKEN>, X-Skill-Source, X-Skill-Version, X-Skill-Platform. Missing attribution headers will cause export to fail with 402.

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Translating GUI Instructions

The backend responds as if there's a visual interface. Map its instructions to API calls:

"click" or "点击" → execute the action via the relevant endpoint
"open" or "打开" → query session state to get the data
"drag/drop" or "拖拽" → send the edit command through SSE
"preview in timeline" → show a text summary of current tracks
"Export" or "导出" → run the export workflow

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Error Handling

Code	Meaning	Action
0	Success	Continue
1001	Bad/expired token	Re-auth via anonymous-token (tokens expire after 7 days)
1002	Session not found	New session §3.0
2001	No credits	Anonymous: show registration URL with `?bind=\x3Cid>` (get `\x3Cid>` from create-session or state response when needed). Registered: "Top up credits in your account"
4001	Unsupported file	Show supported formats
4002	File too large	Suggest compress/trim
400	Missing X-Client-Id	Generate Client-Id and retry (see §1)
402	Free plan export blocked	Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429	Rate limit (1 token/client/7 days)	Retry in 30s once

Common Workflows

Quick edit: Upload → "sync visuals to the beat and generate a music video for track ab2n 0330" → Download MP4. Takes 1-2 minutes for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "sync visuals to the beat and generate a music video for track ab2n 0330" — concrete instructions get better results.

Max file size is 500MB. Stick to MP3, WAV, MP4, MOV for the smoothest experience.

Export as MP4 for widest compatibility across streaming and social platforms.

Usage Guidance

This skill uploads your audio and related metadata to an external service (mega-api-prod.nemovideo.ai) and either uses an existing NEMO_TOKEN from your environment or will obtain a short-lived anonymous token for you. Before installing or using it: (1) Confirm you trust the remote domain and understand its privacy/retention policy — don't upload private/confidential audio unless you trust the service. (2) If you already have a NEMO_TOKEN, prefer using it rather than letting the skill obtain an anonymous token. (3) Ask the author why the metadata declares ~/.config/nemovideo/ as a required config path (the SKILL.md doesn't explain reading it). (4) If you are uncomfortable with the skill probing install paths or contacting an unknown backend, do not enable it. Additional information that would raise confidence: a public homepage or documentation for the backend, privacy/retention terms, and clarification about the purpose of the declared config path.

Capability Analysis

Type: OpenClaw Skill Name: music-ab2n-0330 Version: 1.0.0 The skill is a functional integration for an AI music video generation service hosted at mega-api-prod.nemovideo.ai. It manages session tokens, file uploads, and server-sent events (SSE) for video processing as described in SKILL.md. While it performs platform detection by checking its own installation path and handles authentication tokens, these behaviors are transparently documented and consistent with the tool's stated purpose of providing a cloud-based rendering pipeline.

Capability Assessment

ℹ Purpose & Capability

Name/description (turn audio into videos) align with required credential NEMO_TOKEN and the described API endpoints for upload, render, and export. One minor inconsistency: metadata declares a required config path (~/.config/nemovideo/) but the SKILL.md does not explain reading or needing that config directory.

ℹ Instruction Scope

SKILL.md stays within the stated purpose: it instructs uploading user audio, creating sessions, streaming SSE edits, polling render status, and returning download URLs. It also instructs generating an anonymous token via a POST to the vendor API when NEMO_TOKEN is absent — this is expected for an anonymous/credit flow but means the skill will contact an external service and transmit a UUID and uploaded media. The instructions also ask the agent to read the file's YAML frontmatter for attribution and to detect install path (probing ~/.clawhub or ~/.cursor/skills/) which requires checking the user's filesystem; this is plausible for attribution but is additional file-system access beyond simple upload.

✓ Install Mechanism

No install spec or code is present — instruction-only skill. This is the lowest install risk; nothing will be written to disk by an install step. Runtime network calls are the main surface area.

ℹ Credentials

Only one declared environment variable (NEMO_TOKEN / primaryEnv) is required, which is proportional to a cloud API service. The metadata's configPaths entry (~/.config/nemovideo/) is unexplained by the SKILL.md and could imply additional local config access; that mismatch is worth questioning.

✓ Persistence & Privilege

always is false and the skill does not request elevated or persistent system privileges. Autonomous invocation is allowed (platform default) but not combined with other high-privilege requests here.

Version History

v1.0.0

Initial release: Easily turn audio tracks into AI-generated, synced music videos with quick cloud rendering. - Upload MP3, WAV, MP4, or MOV files (up to 500MB) and generate ready-to-share 1080p MP4 music videos in 1-2 minutes. - Automated, no-slider workflow—just describe your request, and the service syncs visuals to your track’s beat. - Built-in environment/token management; auto-creates sessions and manages cloud jobs transparently. - All major actions supported: upload, generate, credits check, export, and project state queries. - Real-time feedback, visual timeline summaries, and error handling for common issues (file types, credits, export limits).

Metadata

Slug music-ab2n-0330

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Music Ab2n 0330?

Get synced music videos ready to post, without touching a single slider. Upload your audio files (MP3, WAV, MP4, MOV, up to 500MB), say something like "sync... It is an AI Agent Skill for Claude Code / OpenClaw, with 23 downloads so far.

How do I install Music Ab2n 0330?

Run "/install music-ab2n-0330" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Music Ab2n 0330 free?

Yes, Music Ab2n 0330 is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Music Ab2n 0330 support?

Music Ab2n 0330 is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Music Ab2n 0330?

It is built and maintained by dsewell-583h0 (@dsewell-583h0); the current version is v1.0.0.

More Skills

Music Ab2n 0330