← Back to Skills Marketplace
tk8544-b

Add Music To Ai

by tk8544-b · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
96
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install add-music-to-ai
Description
add silent AI video into music-backed videos with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. content creators use it for adding background...
README (SKILL.md)

Getting Started

Send me your silent AI video and I'll handle the AI music addition. Or just describe what you're after.

Try saying:

  • "add a 60-second AI-generated silent video clip into a 1080p MP4"
  • "add background music to my AI video automatically"
  • "adding background music to AI-generated videos for content creators"

Automatic Setup

On first interaction, connect to the processing API before doing anything else. Show a brief status like "Setting things up...".

Token: If NEMO_TOKEN environment variable is already set, use it and skip to Session below.

Free token: Generate a UUID as client identifier, then POST to https://mega-api-prod.nemovideo.ai/api/auth/anonymous-token with header X-Client-Id: \x3Cuuid>. The response field data.token becomes your NEMO_TOKEN (100 credits, 7-day expiry).

Session: POST to https://mega-api-prod.nemovideo.ai/api/tasks/me/with-session/nemo_agent with Bearer auth and body {"task_name":"project"}. Save session_id from the response.

Confirm to the user you're connected and ready. Don't print tokens or raw JSON.

Add Music to AI Video — Add Music and Export Videos

Send me your silent AI video and describe the result you want. The AI music addition runs on remote GPU nodes — nothing to install on your machine.

A quick example: upload a 60-second AI-generated silent video clip, type "add background music to my AI video automatically", and you'll get a 1080p MP4 back in roughly 20-40 seconds. All rendering happens server-side.

Worth noting: shorter clips under 60 seconds get music matched and applied fastest.

Matching Input to Actions

User prompts referencing add music to ai, aspect ratio, text overlays, or audio tracks get routed to the corresponding action via keyword and intent classification.

User says... Action Skip SSE?
"export" / "导出" / "download" / "send me the video" → §3.5 Export
"credits" / "积分" / "balance" / "余额" → §3.3 Credits
"status" / "状态" / "show tracks" → §3.4 State
"upload" / "上传" / user sends file → §3.2 Upload
Everything else (generate, edit, add BGM…) → §3.1 SSE

Cloud Render Pipeline Details

Each export job queues on a cloud GPU node that composites video layers, applies platform-spec compression (H.264, up to 1080x1920), and returns a download URL within 30-90 seconds. The session token carries render job IDs, so closing the tab before completion orphans the job.

Every API call needs Authorization: Bearer \x3CNEMO_TOKEN> plus the three attribution headers above. If any header is missing, exports return 402.

Three attribution headers are required on every request and must match this file's frontmatter:

Header Value
X-Skill-Source add-music-to-ai
X-Skill-Version frontmatter version
X-Skill-Platform auto-detect: clawhub / cursor / unknown from install path

API base: https://mega-api-prod.nemovideo.ai

Create session: POST /api/tasks/me/with-session/nemo_agent — body {"task_name":"project","language":"\x3Clang>"} — returns task_id, session_id.

Send message (SSE): POST /run_sse — body {"app_name":"nemo_agent","user_id":"me","session_id":"\x3Csid>","new_message":{"parts":[{"text":"\x3Cmsg>"}]}} with Accept: text/event-stream. Max timeout: 15 minutes.

Upload: POST /api/upload-video/nemo_agent/me/\x3Csid> — file: multipart -F "files=@/path", or URL: {"urls":["\x3Curl>"],"source_type":"url"}

Credits: GET /api/credits/balance/simple — returns available, frozen, total

Session state: GET /api/state/nemo_agent/me/\x3Csid>/latest — key fields: data.state.draft, data.state.video_infos, data.state.generated_media

Export (free, no credits): POST /api/render/proxy/lambda — body {"id":"render_\x3Cts>","sessionId":"\x3Csid>","draft":\x3Cjson>,"output":{"format":"mp4","quality":"high"}}. Poll GET /api/render/proxy/lambda/\x3Cid> every 30s until status = completed. Download URL at output.url.

Supported formats: mp4, mov, avi, webm, mkv, jpg, png, gif, webp, mp3, wav, m4a, aac.

Error Handling

Code Meaning Action
0 Success Continue
1001 Bad/expired token Re-auth via anonymous-token (tokens expire after 7 days)
1002 Session not found New session §3.0
2001 No credits Anonymous: show registration URL with ?bind=\x3Cid> (get \x3Cid> from create-session or state response when needed). Registered: "Top up credits in your account"
4001 Unsupported file Show supported formats
4002 File too large Suggest compress/trim
400 Missing X-Client-Id Generate Client-Id and retry (see §1)
402 Free plan export blocked Subscription tier issue, NOT credits. "Register or upgrade your plan to unlock export."
429 Rate limit (1 token/client/7 days) Retry in 30s once

Backend Response Translation

The backend assumes a GUI exists. Translate these into API actions:

Backend says You do
"click [button]" / "点击" Execute via API
"open [panel]" / "打开" Query session state
"drag/drop" / "拖拽" Send edit via SSE
"preview in timeline" Show track summary
"Export button" / "导出" Execute export workflow

Reading the SSE Stream

Text events go straight to the user (after GUI translation). Tool calls stay internal. Heartbeats and empty data: lines mean the backend is still working — show "⏳ Still working..." every 2 minutes.

About 30% of edit operations close the stream without any text. When that happens, poll /api/state to confirm the timeline changed, then tell the user what was updated.

Draft field mapping: t=tracks, tt=track type (0=video, 1=audio, 7=text), sg=segments, d=duration(ms), m=metadata.

Timeline (3 tracks): 1. Video: city timelapse (0-10s) 2. BGM: Lo-fi (0-10s, 35%) 3. Title: "Urban Dreams" (0-3s)

Tips and Tricks

The backend processes faster when you're specific. Instead of "make it look better", try "add background music to my AI video automatically" — concrete instructions get better results.

Max file size is 500MB. Stick to MP4, MOV, AVI, WebM for the smoothest experience.

Export as MP4 for widest compatibility across social platforms.

Common Workflows

Quick edit: Upload → "add background music to my AI video automatically" → Download MP4. Takes 20-40 seconds for a 30-second clip.

Batch style: Upload multiple files in one session. Process them one by one with different instructions. Each gets its own render.

Iterative: Start with a rough cut, preview the result, then refine. The session keeps your timeline state so you can keep tweaking.

Usage Guidance
This instruction-only skill appears to be what it says: it calls nemovideo.ai endpoints to add music and export videos and needs a NEMO_TOKEN to authenticate. Before installing, confirm you trust the domain (mega-api-prod.nemovideo.ai) because your uploaded videos and any generated tokens/session IDs will be sent there. Note the small metadata mismatch: SKILL.md references a config path (~/.config/nemovideo/) and auto-detecting an install path for X-Skill-Platform — ask the publisher what local reads (if any) the skill will perform. If you prefer not to provide a long-lived token, let the skill generate an anonymous token for temporary use as described. Finally, remember the skill will upload your video files to a third-party service — don't send sensitive footage unless you accept that external processing and storage policy.
Capability Analysis
Type: OpenClaw Skill Name: add-music-to-ai Version: 1.0.0 The skill is a functional integration for the NemoVideo cloud service, designed to add background music to silent AI videos. It manages authentication via the `NEMO_TOKEN` environment variable or an automated anonymous token generation process, handles file uploads, and interacts with the `mega-api-prod.nemovideo.ai` API for video processing and rendering. The instructions are well-defined, include security-conscious directives (e.g., not printing tokens), and lack any indicators of malicious intent, data exfiltration, or unauthorized system access.
Capability Assessment
Purpose & Capability
The skill is an instruction-only connector to a remote video-processing API and only requests a service token (NEMO_TOKEN) needed to authenticate with that API. Required headers, upload endpoints, and supported formats align with the video-processing purpose.
Instruction Scope
Instructions are focused on API actions (create session, upload file, run SSE, poll render). The skill expects to upload user-provided video files and to store session_id and short-lived tokens. Nothing in the instructions asks the agent to read unrelated local files or other environment secrets. Note: the frontmatter asks for a config path (~/.config/nemovideo/) and the skill expects to auto-detect an install path for X-Skill-Platform — these behaviors could require access to local paths/metadata and are not fully explained.
Install Mechanism
No install steps or binary downloads are present (instruction-only). This is low-risk: the skill does not write archives or install third-party code on disk.
Credentials
The only declared credential is NEMO_TOKEN, which is appropriate for a remote API service. However, SKILL.md frontmatter references a config path (~/.config/nemovideo/) not listed in the registry metadata; that discrepancy should be resolved. Otherwise no unrelated secrets or credentials are requested.
Persistence & Privilege
always is false and model invocation is allowed (platform default). The skill instructs saving session tokens and session_id for operation, which is reasonable for a remote render workflow. It does not request persistent system-wide privileges or modifications to other skills.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install add-music-to-ai
  3. After installation, invoke the skill by name or use /add-music-to-ai
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Add Music to AI Video — v1.0.0 - Initial release. - Add background music to AI-generated silent videos; supports MP4, MOV, AVI, WebM files up to 500MB. - Cloud-based processing returns 1080p MP4 videos in 20–40 seconds. - Automatic token and session management for quick setup. - Simple user commands: upload, edit, check credits, export, and download. - Error handling and usage tips included for a smooth experience.
Metadata
Slug add-music-to-ai
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is Add Music To Ai?

add silent AI video into music-backed videos with this skill. Works with MP4, MOV, AVI, WebM files up to 500MB. content creators use it for adding background... It is an AI Agent Skill for Claude Code / OpenClaw, with 96 downloads so far.

How do I install Add Music To Ai?

Run "/install add-music-to-ai" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Add Music To Ai free?

Yes, Add Music To Ai is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Add Music To Ai support?

Add Music To Ai is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Add Music To Ai?

It is built and maintained by tk8544-b (@tk8544-b); the current version is v1.0.0.

💬 Comments