← Back to Skills Marketplace
dai-shuo

IMA AI Music & Voice Generator — Song, BGM, Background Soundtrack, Jingle, Lyrics, Beat Maker, Voiceover, Narration & Composition

by Dai Shuo · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
189
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install ima-ai-music-song-voice-generator
Description
AI music generator and voice generator with Suno sonic v5, DouBao BGM, and DouBao Song. Generate AI music, songs with lyrics, background music, soundtracks,...
README (SKILL.md)

IMA AI Music & Voice Generator

⚠️ MANDATORY: You MUST read("SKILL-DETAIL.md") (full file — do NOT pass limit parameter) before your first music generation call. It contains the full API payload structure, Suno parameters, error translation tables, reflection mechanism, and UX protocol that this summary omits. Skipping it causes parameter errors and poor user experience.

Model Reference (CRITICAL)

Use exact model_id. Do NOT infer from friendly names.

Name model_id Cost Best For
Suno sonic v5 🔥 sonic 25 pts Full songs, lyrics, vocals, jingles, custom composition
DouBao BGM GenBGM 30 pts Background music, soundtracks, ambient loops
DouBao Song GenSong 30 pts Song generation, vocal compositions

Model Selection

User intent Model Why
Song with lyrics/vocals Suno sonic Full custom mode: lyrics, vocal_gender, tags
背景音乐 / BGM / ambient / loop DouBao GenBGM Background soundtrack specialist
Simple song / 歌曲 DouBao GenSong Quick song generation
Jingle / ad music / 广告配乐 Suno sonic Genre tags + style control
Beat / instrumental / 纯音乐 Suno sonic + make_instrumental=true Best instrumental quality
Voiceover / narration / 配音 See note below This skill focuses on music; for TTS use ima-tts-ai

Suno is the default — most versatile AI music generator with full composition control.

Suno Key Parameters

Parameter Description Example
custom_mode Enable lyrics/vocals/tags control true
vocal_gender male / female / mixed "female"
lyrics Custom lyrics text `"[Verse 1]\
Hello world..."`
make_instrumental Force instrumental, no vocals true
tags Genre/style tags "lo-fi hip hop, chill"
negative_tags Exclude styles "heavy metal, screaming"
title Song title "Summer Breeze"

Prompt tips for AI music composition:

  • Genre: "lo-fi hip hop", "orchestral cinematic", "upbeat pop", "jazz"
  • Tempo: "80 BPM", "fast tempo", "slow ballad"
  • Mood: "happy and energetic", "melancholic", "tense and dramatic"

Knowledge Base (if ima-knowledge-ai installed)

Read before generating: workflow-design.md (video+music coordination), model-selection.md (cost/quality).

Script Usage

# AI music generator — song with Suno (AI song generator, jingle, composition)
python3 {baseDir}/scripts/ima_voice_create.py \
  --api-key $IMA_API_KEY --task-type text_to_music \
  --model-id sonic --prompt "upbeat corporate jingle, 30 seconds" \
  --user-id {user_id} --output-json

# AI BGM generator — background soundtrack
python3 {baseDir}/scripts/ima_voice_create.py \
  --api-key $IMA_API_KEY --task-type text_to_music \
  --model-id GenBGM --prompt "calm ambient background music" \
  --user-id {user_id} --output-json

# List available music models
python3 {baseDir}/scripts/ima_voice_create.py \
  --api-key $IMA_API_KEY --task-type text_to_music --list-models

Sending Results to User

# ✅ CORRECT: Use remote URL for inline audio playback
message(action="send", media=audio_url, caption="✅ 音乐生成成功!\
• 模型:[Name]\
• 耗时:[X]s\
• 积分:[N pts]")

# Then send link for sharing
message(action="send", message=f"🔗 音频链接:\
{audio_url}")

UX Protocol (Brief)

  1. Acknowledge: Short reply ("好的!帮你生成音乐 🎵")
  2. Pre-gen: Model, time, cost via message tool
  3. Progress: Every 10-15s: "⏳ [P]%" (cap 95%)
  4. Success: Send media=url + link text
  5. Failure: Natural language + suggest alternatives
  6. Done: No further action

Generation times: DouBao BGM/Song: 10-25s · Suno: 20-45s Never expose: script names, API endpoints, attribute_id, technical params.

User Preferences

Storage: ~/.openclaw/memory/ima_prefs.json

  • Save on explicit: "用Suno" / "默认用BGM" / "always use Suno"
  • Clear on: "推荐一个" / "自动选择"
  • Never save auto-selected models

Core Flow

  1. GET /open/v1/product/list?category=text_to_musicattribute_id, credit, form_config
  2. POST /open/v1/tasks/createtask_id
  3. POST /open/v1/tasks/detail → poll every 5s until resource_status==1

MANDATORY: Always query product list first. Missing attribute_id → task fails. Suno note: model_version inside parameters.parameters must be sonic-v5, outer model_version is sonic.

Environment

Base URL: https://api.imastudio.com Headers: Authorization: Bearer $IMA_API_KEY · x-app-source: ima_skills · x_app_language: en


⚠️ REMINDER: read("SKILL-DETAIL.md") (full file, no limit) is required before generating music. This summary covers model selection and routing — SKILL-DETAIL.md has complete API payloads, Suno custom_mode details, error handling, reflection mechanism, and UX protocol needed for correct execution.

Usage Guidance
This skill is coherent with its stated purpose but you should still exercise normal caution: (1) only provide a scoped or test IMA_API_KEY until you trust the skill and its provider (imastudio.com); (2) review the bundled scripts (scripts/ima_voice_create.py and ima_logger.py) yourself — they are small and readable and claim to call only https://api.imastudio.com; (3) be aware the skill writes prefs and logs to ~/.openclaw — if you prefer, make those paths read-only or redirect them to /dev/null during testing; (4) test with a limited API key and monitor network traffic/requests if you have concerns; (5) if you require higher assurance, ask the publisher for a signed source or host the script yourself after review and deploy a fork with your API endpoint/settings.
Capability Analysis
Type: OpenClaw Skill Name: ima-ai-music-song-voice-generator Version: 1.0.0 The IMA AI Music & Voice Generator skill bundle is a well-structured and transparent tool for generating music via the IMA Studio API. The core logic in `scripts/ima_voice_create.py` implements a legitimate task-polling workflow with a sophisticated 'reflection' mechanism for automatic error recovery and parameter adjustment. The bundle includes detailed security and privacy documentation in `SKILL-DETAIL.md`, explicitly declaring its network usage (api.imastudio.com) and local file system access (~/.openclaw/memory/ima_prefs.json for user preferences). No evidence of data exfiltration, malicious execution, or harmful prompt injection was found.
Capability Assessment
Purpose & Capability
Name/description, required binary (python3), and single required env var (IMA_API_KEY) align with a music/voice generation skill that calls an external IMA API. No unrelated credentials or binaries are requested.
Instruction Scope
Runtime instructions require reading SKILL-DETAIL.md and optionally knowledge files under ~/.openclaw/skills/ima-knowledge-ai/references/*. That is document/UX guidance for correct use; it does not instruct reading arbitrary user files. The SKILL.md mandates reading the detail file before first use (reasonable for correct API payload construction), but that gives the agent full visibility into the skill docs — verify you trust the skill source.
Install Mechanism
No install spec (instruction-only install). Bundled code is plain Python and depends on requests (declared). No downloads from untrusted URLs, no archive extraction, and network calls point to api.imastudio.com as expected.
Credentials
Only IMA_API_KEY is required and declared as the primary credential — appropriate for an API-backed music service. The skill writes small local prefs/logs (~1KB and log files) as declared; consider that logs could potentially record operational details (errors/attributes). The skill states it will not include API keys in prompts.
Persistence & Privilege
The skill persists only to its own prefs file (~/.openclaw/memory/ima_prefs.json) and logs under ~/.openclaw/logs/ima_skills/, as declared. always:false and no evidence it modifies other skills or system-wide settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ima-ai-music-song-voice-generator
  3. After installation, invoke the skill by name or use /ima-ai-music-song-voice-generator
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release: SDO-optimized music and voice generation skill
Metadata
Slug ima-ai-music-song-voice-generator
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is IMA AI Music & Voice Generator — Song, BGM, Background Soundtrack, Jingle, Lyrics, Beat Maker, Voiceover, Narration & Composition?

AI music generator and voice generator with Suno sonic v5, DouBao BGM, and DouBao Song. Generate AI music, songs with lyrics, background music, soundtracks,... It is an AI Agent Skill for Claude Code / OpenClaw, with 189 downloads so far.

How do I install IMA AI Music & Voice Generator — Song, BGM, Background Soundtrack, Jingle, Lyrics, Beat Maker, Voiceover, Narration & Composition?

Run "/install ima-ai-music-song-voice-generator" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is IMA AI Music & Voice Generator — Song, BGM, Background Soundtrack, Jingle, Lyrics, Beat Maker, Voiceover, Narration & Composition free?

Yes, IMA AI Music & Voice Generator — Song, BGM, Background Soundtrack, Jingle, Lyrics, Beat Maker, Voiceover, Narration & Composition is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does IMA AI Music & Voice Generator — Song, BGM, Background Soundtrack, Jingle, Lyrics, Beat Maker, Voiceover, Narration & Composition support?

IMA AI Music & Voice Generator — Song, BGM, Background Soundtrack, Jingle, Lyrics, Beat Maker, Voiceover, Narration & Composition is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created IMA AI Music & Voice Generator — Song, BGM, Background Soundtrack, Jingle, Lyrics, Beat Maker, Voiceover, Narration & Composition?

It is built and maintained by Dai Shuo (@dai-shuo); the current version is v1.0.0.

💬 Comments