← Back to Skills Marketplace
gprecious

AI Music Video

by Taejin Yoo · GitHub ↗ · v1.2.4
cross-platform ✓ Security Clean
1340
Downloads
1
Stars
10
Active Installs
7
Versions
Install in OpenClaw
/install ai-music-video
Description
Generate AI music videos end-to-end. Creates music with Suno (sunoapi.org), generates visuals with OpenAI/Seedream/Google/Seedance, and assembles into music...
README (SKILL.md)

AI Music Video Generator

Create complete music videos: AI music + AI visuals + ffmpeg assembly.

Quick Start

"90년대 보이밴드 풍 한국어 노래 만들어줘" → music only
"발라드 뮤비 만들어줘" → music + slideshow MV
"EDM 뮤비 풀영상으로" → music + video clips MV
"Suno 뮤비로 만들어줘" → Suno native music video

Workflow

1. Plan scenes from lyrics/mood

Before generating, create prompts.json — array of scene descriptions derived from the song's lyrics, mood, and narrative. 8-12 scenes for a 3-min song.

[
  {"prompt": "Neon-lit city street at night, rain reflections", "type": "image"},
  {"prompt": "Camera slowly panning across a rooftop at sunset", "type": "video"},
  "A lone figure walking through cherry blossoms"
]

2. Generate music

bash scripts/suno_music.sh \
  --prompt "가사 또는 설명" \
  --style "90s boy band pop, korean" \
  --title "너만을 원해" \
  --model V4_5ALL --custom \
  --outdir /tmp/mv_project

Options:

  • --model V4_5ALL (default), V5, V4_5PLUS, V4_5, V4
  • --instrumental — no vocals
  • --vocal-gender m|f — vocal gender hint
  • --negative-tags "Heavy Metal, Drums" — styles to avoid
  • --music-video — generate Suno native music video (MP4)
  • --dry-run — cost check only

Persona (일관된 스타일 유지):

  • --persona-id ID — 기존 페르소나 사용 (같은 보컬/스타일로 여러 곡 생성)
  • --create-persona — 생성된 곡에서 페르소나 생성 → persona.json 저장
  • --persona-name "이름" / --persona-desc "설명" / --persona-style "스타일"

Auto features:

  • 🎤 Timestamped Lyrics: Non-instrumental tracks automatically fetch lyrics timestamps and save as {outdir}/lyrics.srt
  • 🎬 Suno Native MV: With --music-video, Suno generates a visualized MP4 video directly
  • 🎭 Persona: With --create-persona, extracts voice/style identity for reuse

3. Generate visuals (custom MV flow)

bash scripts/gen_visuals.sh \
  --mode slideshow \
  --prompts-file /tmp/mv_project/prompts.json \
  --image-provider seedream \
  --outdir /tmp/mv_project

Or with OpenAI (cheaper, lower res):

bash scripts/gen_visuals.sh \
  --mode slideshow \
  --prompts-file /tmp/mv_project/prompts.json \
  --image-provider openai --image-model gpt-image-1-mini --image-quality medium \
  --outdir /tmp/mv_project

Add --dry-run first to show cost estimate before spending.

4. Assemble

bash scripts/assemble_mv.sh \
  --audio /tmp/mv_project/track_0_xxx.mp3 \
  --outdir /tmp/mv_project \
  --output /tmp/mv_project/final_mv.mp4 \
  --transition fade

Subtitle behavior:

  • Auto-detects {outdir}/lyrics.srt and overlays lyrics automatically
  • --subtitle /path/to/custom.srt — use custom SRT file
  • --no-subtitle — disable lyrics overlay entirely

Modes

Mode Visual Best For Cost (10 scenes)
slideshow AI images Fast, cheap ~$0.02 (mini low) / ~$0.09 (mini med) / ~$0.45 (Seedream)
video AI video clips Premium ~$1.40 (Seedance Lite) / ~$8.00 (Sora 2)
hybrid Mix of both Balanced ~$0.50-$4.00
suno-native Suno MV Easiest Suno credits only

Image cost is token-based — actual billing may be lower than listed estimates. Use --dry-run for precise cost.

Provider Options

Images: --image-provider seedream (recommended), openai, or google-together Image Model (OpenAI): --image-model gpt-image-1-mini (default, cheap) or gpt-image-1 (premium) Videos: --video-provider sora (default), sora-pro, seedance-lite, seedance-pro, veo-fast, veo-audio Quality: --image-quality low|medium|high

Cost Tracking

Every script outputs cost before and after. Always --dry-run first. Cost data saved to {outdir}/cost_estimate.json and {outdir}/visuals_meta.json.

Environment Variables

export SUNO_API_KEY="your-sunoapi-key"      # Required — sunoapi.org
export OPENAI_API_KEY="your-openai-key"     # Required — images + Sora video
export BYTEPLUS_API_KEY="your-byteplus-key" # Optional — Seedream 4.5 (recommended for images)
export TOGETHER_API_KEY="your-together-key" # Optional — Seedance, Veo, Imagen
export SUNO_CALLBACK_URL=""                 # Optional — see Callback URL below

⚠️ Required keys: SUNO_API_KEY and OPENAI_API_KEY must be set before running any script. BYTEPLUS_API_KEY is needed for Seedream image provider (sign up at console.byteplus.com, 200 free images). TOGETHER_API_KEY is only needed for Seedance/Veo/Imagen providers.

Callback URL

The Suno API requires a callBackUrl field for music generation requests. By default, if SUNO_CALLBACK_URL is not set, the script uses https://localhost/noop as a harmless no-op endpoint (an unreachable localhost URL that effectively disables callbacks).

To customize: set SUNO_CALLBACK_URL to your own endpoint, or set it to any dummy URL you control. The callback payload contains task metadata and audio URLs — no API keys are sent.

To disable: set SUNO_CALLBACK_URL=https://localhost/noop or any unreachable URL. Generation still works via polling; the callback is not required for the script to function.

Persona Workflow (채널 컨셉 유지)

YouTube 채널처럼 일관된 스타일로 여러 곡을 만들 때:

# 1. 첫 곡 생성 + 페르소나 만들기
bash scripts/suno_music.sh \
  --prompt "코드 리뷰하며 듣는 노래" \
  --style "indie rock, energetic, coding vibe" \
  --title "Pull Request" \
  --custom --create-persona \
  --persona-name "개발자 노동요 싱어" \
  --persona-desc "개발자가 코딩하며 듣기 좋은 에너지 넘치는 보컬. 인디록, 일렉, 팝 장르를 넘나든다." \
  --persona-style "indie rock, electronic, developer work music" \
  --outdir /tmp/dev-bgm-01

# 2. persona.json에서 personaId 확인
cat /tmp/dev-bgm-01/persona.json

# 3. 같은 페르소나로 다음 곡 생성 — 보컬/스타일 일관성 유지
bash scripts/suno_music.sh \
  --prompt "야근하면서 듣는 노래" \
  --style "electronic pop, night coding" \
  --title "Midnight Deploy" \
  --custom --persona-id \x3CPERSONA_ID> \
  --outdir /tmp/dev-bgm-02

페르소나는 보컬 특성 + 음악 스타일을 기억해서, 채널 전체의 통일감을 유지해줌.

Prerequisites

  • curl, python3, ffmpeg (for assembly)

References

  • SunoAPI details: Read references/sunoapi.md
  • Visual provider details: Read references/visual-providers.md
Usage Guidance
This skill appears to do what it says: it will call Suno and OpenAI (and optionally BytePlus/Together) using the API keys you provide, download generated media, and assemble a final MP4 with ffmpeg. Before installing or running: 1) Only supply SUNO_API_KEY and OPENAI_API_KEY you trust for this use; treat optional keys (BYTEPLUS_API_KEY, TOGETHER_API_KEY) the same. 2) Run dry-run modes first (scripts support --dry-run) to verify cost estimates and behavior without spending credits. 3) Do not set SUNO_CALLBACK_URL to an endpoint you don't control — callbacks include task metadata and media URLs and could leak information if pointed at an attacker-controlled server. 4) Ensure you understand and accept the potential costs of image/video generation. 5) Review and run the bundled tests in a safe environment (no production keys) if you want to validate behavior before use.
Capability Analysis
Type: OpenClaw Skill Name: ai-music-video Version: 1.2.4 The skill bundle appears benign. All scripts (`suno_music.sh`, `gen_visuals.sh`, `assemble_mv.sh`) use standard tools (`curl`, `python3`, `ffmpeg`) for their stated purpose of AI music and video generation. API keys are handled via environment variables and sent to legitimate AI service endpoints. Notably, `suno_music.sh` includes a strong security control by validating `SUNO_CALLBACK_URL` to only allow HTTPS or `https://localhost/noop`, preventing potential data exfiltration via arbitrary HTTP callbacks. Input sanitization is also present in `assemble_mv.sh` for `ffmpeg` arguments. No evidence of malicious execution, persistence, data exfiltration, or prompt injection against the agent was found in the code or `SKILL.md`.
Capability Assessment
Purpose & Capability
The skill requires SUNO_API_KEY and OPENAI_API_KEY and the binaries curl, python3, ffmpeg — all are directly used by the included scripts for music generation, image/video generation, and assembly. Optional env vars (BYTEPLUS_API_KEY, TOGETHER_API_KEY) match documented optional providers. No unrelated credentials or binaries are requested.
Instruction Scope
Runtime instructions and scripts call external provider APIs (Suno, OpenAI, BytePlus, Together) using declared env keys, download generated media to the specified outdir, and write metadata/cost files. The Suno callback URL is supported and will be invoked if you set SUNO_CALLBACK_URL to an https endpoint — the code defaults to a no-op localhost URL if unset. This is expected for the workflow but is an avenue for sending task metadata (audio URLs, task ids) to an external endpoint if you configure it.
Install Mechanism
There is no install spec (instruction-only with bundled scripts). Nothing is downloaded or installed by the skill itself; scripts run with system binaries already required. This minimizes install-time risk.
Credentials
Requested environment variables (SUNO_API_KEY, OPENAI_API_KEY) are proportionate and required for the services the skill uses. Optional keys (BYTEPLUS_API_KEY, TOGETHER_API_KEY) are justified for the documented providers. No unrelated secrets (cloud provider keys, SSH keys, etc.) are requested.
Persistence & Privilege
The skill is not marked always:true and does not modify other skills or system-wide agent configs. It only writes outputs and metadata into user-specified outdir paths. Model invocation is allowed (default) which is normal for skills.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ai-music-video
  3. After installation, invoke the skill by name or use /ai-music-video
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.2.4
Security fix: shell injection in assemble_mv.sh (filename escaping for ffmpeg concat lists), subtitle path escaping, python3 safe argv passing, SUNO_CALLBACK_URL restricted to https only
v1.2.3
- Changed the default SUNO_CALLBACK_URL to "https://localhost/noop" (previously "https://httpbin.org/post"), disabling callbacks by default instead of using a no-op echo service. - Updated SKILL.md documentation to clarify the new default and recommended practices for SUNO_CALLBACK_URL. - No functional script changes; documentation update only.
v1.2.2
- Improved robustness in scripts/assemble_mv.sh and scripts/suno_music.sh. - Minor fixes in test configuration (tests/conftest.py). - Updated metadata in _meta.json. - Documentation unchanged.
v1.2.1
Security fixes: default callback URL changed from httpbin.org to localhost/noop, fixed shell injection in prompt handling (temp file pattern), fixed broken conditional in gen_visuals.sh hybrid mode, added filename quoting in assemble_mv.sh
v1.2.0
v1.2.0: Add Seedream 4.5 (BytePlus) as recommended image provider. Add gpt-image-1-mini as default OpenAI model. Token-based cost tracking for accurate billing. Updated pricing tables and provider guide.
v1.1.0
v1.1.0: Security & metadata fixes per ClawHub review. - FIXED: Declared required env vars (SUNO_API_KEY, OPENAI_API_KEY) and optional (TOGETHER_API_KEY) in SKILL.md metadata - FIXED: callBackUrl is now configurable via SUNO_CALLBACK_URL env var (defaults to httpbin.org/post as harmless no-op; can be overridden or disabled) - FIXED: Corrected Suno API endpoints — /mp4/generate (music video), /generate/get-timestamped-lyrics (lyrics), /generate/generate-persona (persona), /mp4/record-info (MV polling) - FIXED: Sora 2 video API endpoint updated to POST /v1/videos (OpenAI) - Added documentation explaining callback behavior and how to disable it - Added Korean font note for subtitle rendering (fonts-nanum)
v1.0.0
Initial release: Suno V4.5/V5 music + OpenAI image/Sora video + ffmpeg assembly. Slideshow/video/hybrid modes. Timestamped lyrics SRT. Suno native MV. Persona support. Cost tracking. 25/25 tests passing.
Metadata
Slug ai-music-video
Version 1.2.4
License
All-time Installs 10
Active Installs 10
Total Versions 7
Frequently Asked Questions

What is AI Music Video?

Generate AI music videos end-to-end. Creates music with Suno (sunoapi.org), generates visuals with OpenAI/Seedream/Google/Seedance, and assembles into music... It is an AI Agent Skill for Claude Code / OpenClaw, with 1340 downloads so far.

How do I install AI Music Video?

Run "/install ai-music-video" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is AI Music Video free?

Yes, AI Music Video is completely free (open-source). You can download, install and use it at no cost.

Which platforms does AI Music Video support?

AI Music Video is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created AI Music Video?

It is built and maintained by Taejin Yoo (@gprecious); the current version is v1.2.4.

💬 Comments