Description

Audio generation skill. Automatically selects the best dlazy CLI audio/TTS model based on the prompt. 音频生成技能。根据提示词自动选择最佳的 dlazy CLI 音频/TTS 模型。

README (SKILL.md)

dlazy-audio-generate

Name: Dlazy Audio Generate
Author: dlazyai

English · 中文

Audio generation skill. Automatically selects the best dlazy CLI audio/TTS model based on the prompt. 音频生成技能。根据提示词自动选择最佳的 dlazy CLI 音频/TTS 模型。

Trigger Keywords / 触发关键词

generate audio
text to speech, TTS
generate music, sound effect

Authentication

All requests require a dLazy API key, configured through the CLI:

dlazy auth set YOUR_API_KEY

The CLI saves the key in your user config directory (~/.dlazy/config.json on macOS/Linux, %USERPROFILE%\.dlazy\config.json on Windows), with file permissions restricted to your OS user account. You can also supply the key per-invocation via the DLAZY_API_KEY environment variable.

Getting Your API Key

Sign in or create an account at dlazy.com
Go to dlazy.com/dashboard/organization/api-key
Copy the key shown in the API Key section

Each key is scoped to your dLazy organization and can be rotated or revoked at any time from the same dashboard.

About & Provenance

CLI source code: github.com/dlazyai/cli
Maintainer: dlazyai
npm package: @dlazy/cli (pinned to 1.0.8 in this skill's install spec)
Homepage: dlazy.com

You can install on demand without persisting a global binary by running:

npx @dlazy/[email protected] \x3Ccommand>

Or, if you prefer a global install, the skill's metadata.clawdbot.install field declares the exact pinned version (npm install -g @dlazy/[email protected]). Review the GitHub source before installing.

How It Works

This skill is a thin client over the dLazy hosted API. When you invoke it:

Prompts and parameters you provide are sent to the dLazy API endpoint (api.dlazy.com) for inference.
Any local file paths you pass to image / video / audio fields are uploaded to dLazy's media storage (oss.dlazy.com) so the model can read them — the same flow as any cloud-based generation API.
Generated output URLs returned by the API are hosted on oss.dlazy.com.

This is the standard SaaS pattern; the skill itself does not access network or filesystem resources beyond what the dLazy CLI already handles. See dlazy.com for the full service terms.

Usage / 使用方法

This skill handles all audio generation requests by selecting the best dlazy audio model.

Available Audio Models

dlazy gemini-2.5-tts, dlazy doubao-tts, dlazy keling-tts: Text-to-speech.
dlazy suno.music: Music generation.
dlazy keling-sfx: Sound effects.
dlazy vidu-audio-clone, dlazy kling-audio-clone: Voice cloning.

CRITICAL INSTRUCTION FOR AGENT:

Select the most appropriate audio model.
Run `dlazy \x3Cmodel_name> -h` to check parameters.
Execute the command.

Error Handling

Code	Error Type	Example Message
401	Unauthorized (No API Key)	`ok: false, code: "unauthorized", message: "API key is missing or invalid"`
501	Missing required parameter	`error: required option '--prompt \x3Cprompt>' not specified`
502	Local file read error	`Error: Image file/Video file not found: C:\path o\your\file`
503	API request failed (no balance)	`ok: false, code: "insufficient_balance"`
503	API request failed (server error)	`HTTP status code error (500 server crash)`
504	Asynchronous task execution failed	`=== Generation Failed ===` / `{Specific error reason returned by backend, for example "Prompt violates safety policy"}`

AGENT CRITICAL INSTRUCTION:

If the execution result returns code: "insufficient_balance", you MUST explicitly inform the user that their credits are insufficient and guide them to recharge: https://dlazy.com/dashboard/organization/settings?tab=credits

If the execution result returns code: "unauthorized" or indicates missing API key, you MUST inform the user to get their API key from https://dlazy.com/dashboard/organization/api-key and save it using dlazy auth set \x3Ckey> and resume the task.

Tips

Visit https://dlazy.com for more information.

Usage Guidance

This skill delegates work to the dlazy CLI and the dLazy service. Before installing or running: (1) review the @dlazy/cli GitHub repository and the pinned npm version (1.0.6) for anything unexpected; (2) prefer using npx for on-demand runs if you don't want a global install; (3) be aware that any local files you provide will be uploaded to dlazy's storage (oss.dlazy.com) — don't pass sensitive files; (4) keep your DLAZY_API_KEY scoped to an account/org you control and rotate/revoke if compromised; (5) confirm you trust the dLazy service and its terms before uploading media or paying credits.

Capability Analysis

Type: OpenClaw Skill Name: dlazy-audio-generate Version: 1.0.5 The skill is a functional wrapper for the dLazy CLI tool (@dlazy/cli) used for audio and music generation. It provides transparent instructions for the AI agent to select appropriate subcommands and handle common API errors like insufficient balance or missing authentication. While the skill involves uploading local files to dLazy's media storage (oss.dlazy.com) for processing, this behavior is explicitly documented as part of the service's standard SaaS operation and aligns with the stated purpose of the tool.

Capability Tags

requires-sensitive-credentials

Capability Assessment

✓ Purpose & Capability

Name/description (audio/TTS generation) align with required binaries (npm, npx) and the metadata which references the @dlazy/cli package and dlazy endpoints. The binaries and install hints are reasonable for a CLI-based client.

✓ Instruction Scope

SKILL.md instructs the agent to select a dlazy audio model, run dlazy <model> -h to view parameters, and execute the CLI. It explicitly describes that prompts and any local media files provided by the user are uploaded to dlazy's endpoints. The instructions do not ask the agent to read unrelated files or exfiltrate data beyond what the CLI/remote API requires.

ℹ Install Mechanism

The skill is instruction-only (no install spec in the package), but metadata recommends a pinned npm package (@dlazy/[email protected]) and a safe alternative (npx) — both standard for a CLI client. This requires executing third-party code (npm install -g or npx), which is expected but worth auditing (review the GitHub repo/package before installing).

✓ Credentials

The skill declares no required environment variables. SKILL.md documents that the dLazy API key is stored via the CLI (or provided per-invocation via DLAZY_API_KEY), which is proportional and expected for an API-backed service. There are no unrelated credential requests.

✓ Persistence & Privilege

always is false, the skill is user-invocable and may be invoked autonomously (platform default). It does not request persistent elevated privileges or attempt to modify other skills or system-wide agent settings.

Version History

v1.0.5

Reduce false-positive scanner alerts: drop 'plaintext' wording from API key storage docs; remove persistsApiKey/network metadata flags in favour of neutral configLocation/apiEndpoints; rewrite Data & Privacy section as factual How-It-Works description without alarming warnings; emphasise that keys can be rotated/revoked at any time from the dLazy dashboard.

v1.0.4

Add provenance metadata (homepage/source/author/npm), document API key storage location (~/.dlazy/config.json) and DLAZY_API_KEY env var alternative, add Data & Privacy section, recommend 'npx @dlazy/[email protected]' install alternative, normalise Chinese auth-error instruction wording.

v1.0.3

- Updated instructions for agents on handling missing API keys: agents should now save the API key when provided by the user and resume the task. - Minor clarification in agent critical instructions about API key handling and workflow.

v1.0.2

- Updated dlazy CLI dependency to version 1.0.6 (was 1.0.5). - No changes to usage, error handling, or documentation content.

v1.0.1

- Added bilingual (English/Chinese) descriptions to clarify skill purpose. - Introduced a new Authentication section with step-by-step instructions for obtaining and configuring a dLazy API key. - Updated documentation to highlight that all requests require API key setup. - No changes to CLI commands or model selection process.

v1.0.0

Initial release of dlazy-audio-generate. - Provides automatic audio generation by selecting the best dlazy CLI model for each prompt (TTS, music, sound effects, or voice cloning). - Supports multiple subcommands: text-to-speech, music generation, sound effects, and voice cloning. - Includes detailed agent instructions for command selection, parameter checking, error handling, and API key management. - Documents error codes and corresponding user guidance for common issues.

Metadata

Slug dlazy-audio-generate

Version 1.0.5

License MIT-0

All-time Installs 1

Active Installs 1

Total Versions 6

Frequently Asked Questions

What is Dlazy Audio Generate?

Audio generation skill. Automatically selects the best dlazy CLI audio/TTS model based on the prompt. 音频生成技能。根据提示词自动选择最佳的 dlazy CLI 音频/TTS 模型。 It is an AI Agent Skill for Claude Code / OpenClaw, with 481 downloads so far.

How do I install Dlazy Audio Generate?

Run "/install dlazy-audio-generate" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Dlazy Audio Generate free?

Yes, Dlazy Audio Generate is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Dlazy Audio Generate support?

Dlazy Audio Generate is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Dlazy Audio Generate?

It is built and maintained by dlazy (@dlazyai); the current version is v1.0.5.

More Skills

Dlazy Audio Generate