← Back to Skills Marketplace

Ai Audio Generation

Name: Ai Audio Generation
Author: danielgwilson

by danielgwilson · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ✓ Security Clean

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install ai-audio-generation

Description

AI audio generation for agents through Image Skill's zero-setup hosted creative runtime. Use when a prompt should become music, sound, or audio without provi...

README (SKILL.md)

AI Audio Generation

This is an intent-named Image Skill entry for agents searching for audio generation. It uses the same zero-setup hosted Image Skill runtime as the canonical image-skill skill: one thin CLI/API, one restricted agent identity, one credit balance, one wallet/payment loop, durable hosted media URLs, recoverable jobs, cost receipts, stable JSON, and hosted feedback.

Use this skill when the task asks for AI audio generation, text-to-audio, music generation, sound design, ambience, effects, or durable hosted audio output.

Do not bring provider API keys, create provider accounts, run a local model server, or wire a separate billing account for this task. Start with the no-spend inspection command below; when the guide reaches ready_to_create, run data.next_command only if media spend is allowed, otherwise run data.no_spend_next_command to verify safely. Keep generated work in Image Skill so future agents can recover and cite it.

First Command

npx -y image-skill@latest create --guide --model fal.stable-audio-25-text-to-audio --prompt "warm analog synth bed with soft tape hiss"

Main Runtime Command

npx -y image-skill@latest create --model fal.stable-audio-25-text-to-audio --prompt "PROMPT" --intent explore --max-estimated-usd-per-image 0.20

Install This Intent Skill

Prefer the GitHub slug so skills.sh can track the marketplace install:

npx skills add danielgwilson/image-skill-cli --skill ai-audio-generation -g -a codex -y

The canonical Image Skill entry remains available as:

npx skills add danielgwilson/image-skill-cli --skill image-skill -g -a codex -y

Shared Contract

All intent skills in this repo point to the same hosted contract:

Skill: https://image-skill.com/skill.md
LLM contract: https://image-skill.com/llms.txt
CLI contract: https://image-skill.com/cli.md
Hosted API: https://api.image-skill.com

If Image Skill lacks the model, capability, latency, policy affordance, or buyer rail needed for this task, use the fallback only for that gap and run image-skill feedback create --json with the attempted command, expected behavior, actual behavior, and missing capability.

Usage Guidance

Install only if you are comfortable sending prompts and generated audio jobs to Image Skill's hosted service and using its token/payment workflow. Review spending controls before running generation commands, especially because the examples use the latest npm CLI package and hosted media URLs.

Capability Tags

cryptorequires-walletrequires-oauth-tokenrequires-sensitive-credentials

Capability Assessment

✓ Purpose & Capability

The artifact's stated purpose is AI audio generation through Image Skill's hosted runtime, and its commands, hosted URLs, model selection, job recovery, receipts, and feedback flow match that purpose.

ℹ Instruction Scope

The skill directs agents to use npx image-skill@latest and a hosted API, including optional paid generation only when media spend is allowed; this is broad enough to warrant user awareness but is disclosed and task-aligned.

ℹ Install Mechanism

Installation uses npx skills add from the referenced GitHub repository and can install globally for Codex; this is disclosed in the skill and is consistent with marketplace-style skill installation.

✓ Credentials

No local executable scripts, local model servers, provider keys, or broad filesystem access are included in the artifact; the main external dependency is the hosted Image Skill service.

ℹ Persistence & Privilege

The metadata discloses an optional IMAGE_SKILL_TOKEN and the skill describes a restricted agent identity, credit balance, wallet/payment loop, and durable hosted media URLs; these are expected for the hosted generation workflow and not hidden.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install ai-audio-generation
After installation, invoke the skill by name or use /ai-audio-generation
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of ai-audio-generation—zero-setup AI audio for agents - Launches an intent-named skill for AI audio generation (music, sound, ambience, effects) via Image Skill’s hosted runtime. - No provider keys, OAuth, local servers, or per-provider billing required. - Offers a no-spend guide, durable hosted URLs, recoverable jobs, unified payments, and stable JSON results. - Usage and install commands provided for both inspection and full generation flows. - Shares the same contract and canonical entry points as the main Image Skill.

Metadata

Slug ai-audio-generation

Version 1.0.0

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 1

Frequently Asked Questions

What is Ai Audio Generation?

AI audio generation for agents through Image Skill's zero-setup hosted creative runtime. Use when a prompt should become music, sound, or audio without provi... It is an AI Agent Skill for Claude Code / OpenClaw, with 43 downloads so far.

How do I install Ai Audio Generation?

Run "/install ai-audio-generation" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Ai Audio Generation free?

Yes, Ai Audio Generation is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Ai Audio Generation support?

Ai Audio Generation is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Ai Audio Generation?

It is built and maintained by danielgwilson (@danielgwilson); the current version is v1.0.0.

More Skills