← 返回 Skills 市场
dlazyai

Dlazy Gemini 2.5 Tts

作者 dlazy · GitHub ↗ · v1.0.6 · MIT-0
cross-platform ⚠ suspicious
603
总下载
0
收藏
1
当前安装
7
版本数
在 OpenClaw 中安装
/install dlazy-gemini-2-5-tts
功能描述
Generate multilingual, highly natural audio using Gemini 2.5 text-to-speech.
使用说明 (SKILL.md)

dlazy-gemini-2.5-tts

English · 中文

Generate multilingual, highly natural audio using Gemini 2.5 text-to-speech.

Trigger Keywords

  • gemini tts
  • text to speech
  • generate speech

Authentication

All requests require a dLazy API key, configured through the CLI:

dlazy auth set YOUR_API_KEY

The CLI saves the key in your user config directory (~/.dlazy/config.json on macOS/Linux, %USERPROFILE%\.dlazy\config.json on Windows), with file permissions restricted to your OS user account. You can also supply the key per-invocation via the DLAZY_API_KEY environment variable.

Getting Your API Key

  1. Sign in or create an account at dlazy.com
  2. Go to dlazy.com/dashboard/organization/api-key
  3. Copy the key shown in the API Key section

Each key is scoped to your dLazy organization and can be rotated or revoked at any time from the same dashboard.

About & Provenance

You can install on demand without persisting a global binary by running:

npx @dlazy/[email protected] \x3Ccommand>

Or, if you prefer a global install, the skill's metadata.clawdbot.install field declares the exact pinned version (npm install -g @dlazy/[email protected]). Review the GitHub source before installing.

How It Works

This skill is a thin client over the dLazy hosted API. When you invoke it:

  • Prompts and parameters you provide are sent to the dLazy API endpoint (api.dlazy.com) for inference.
  • Any local file paths you pass to image / video / audio fields are uploaded to dLazy's media storage (oss.dlazy.com) so the model can read them — the same flow as any cloud-based generation API.
  • Generated output URLs returned by the API are hosted on oss.dlazy.com.

This is the standard SaaS pattern; the skill itself does not access network or filesystem resources beyond what the dLazy CLI already handles. See dlazy.com for the full service terms.

Usage

CRITICAL INSTRUCTION FOR AGENT: Run the dlazy gemini-2.5-tts command to get results.

dlazy gemini-2.5-tts -h

Options:
  --prompt \x3Cprompt>                    Prompt
  --voice_language \x3Cvoice_language>    Voice Language [default: cmn] (choices: "cmn", "en")
  --voiceName \x3CvoiceName>              Voice Name Options depend on "voice_language". when voice_language="cmn": Zephyr (Zephyr - 明亮), Puck (Puck - 欢快), Charon (Charon - 信息丰富), Kore (Kore - 坚定), Fenrir (Fenrir - 兴奋), Leda (Leda - 青春), Orus (Orus - 公正), Aoede (Aoede - 清爽), Callirrhoe (Callirrhoe - 轻松), Autonoe (Autonoe - 明亮), Enceladus (Enceladus - 气声), Iapetus (Iapetus - 清晰), Umbriel (Umbriel - 轻松愉快), Algieba (Algieba - 平滑), Despina (Despina - 平滑), Erinome (Erinome - 清晰), Algenib (Algenib - 沙哑), Rasalgethi (Rasalgethi - 信息丰富), Laomedeia (Laomedeia - 欢快), Achernar (Achernar - 柔和), Alnilam (Alnilam - 坚定), Schedar (Schedar - 均匀), Gacrux (Gacrux - 成熟), Pulcherrima (Pulcherrima - 转折), Achird (Achird - 友好), Zubenelgenubi (Zubenelgenubi - 随意), Vindemiatrix (Vindemiatrix - 温和), Sadachbia (Sadachbia - 活泼), Sadaltager (Sadaltager - 知识渊博), Sulafat (Sulafat - 偏高); when voice_language="en": Zephyr (Zephyr - Bright), Puck (Puck - Cheerful), Charon (Charon - Informative), Kore (Kore - Firm), Fenrir (Fenrir - Excitable), Leda (Leda - Youthful), Orus (Orus - Just), Aoede (Aoede - Breezy), Callirrhoe (Callirrhoe - Relaxed), Autonoe (Autonoe - Bright), Enceladus (Enceladus - Breath), Iapetus (Iapetus - Clear), Umbriel (Umbriel - Light), Algieba (Algieba - Smooth), Despina (Despina - Smooth), Erinome (Erinome - Clear), Algenib (Algenib - Gravelly), Rasalgethi (Rasalgethi - Informative), Laomedeia (Laomedeia - Cheerful), Achernar (Achernar - Soft), Alnilam (Alnilam - Firm), Schedar (Schedar - Even), Gacrux (Gacrux - Mature), Pulcherrima (Pulcherrima - Turning), Achird (Achird - Friendly), Zubenelgenubi (Zubenelgenubi - Casual), Vindemiatrix (Vindemiatrix - Gentle), Sadachbia (Sadachbia - Lively), Sadaltager (Sadaltager - Scholarly), Sulafat (Sulafat - High) [default: Kore] (choices: "Zephyr", "Puck", "Charon", "Kore", "Fenrir", "Leda", "Orus", "Aoede", "Callirrhoe", "Autonoe", "Enceladus", "Iapetus", "Umbriel", "Algieba", "Despina", "Erinome", "Algenib", "Rasalgethi", "Laomedeia", "Achernar", "Alnilam", "Schedar", "Gacrux", "Pulcherrima", "Achird", "Zubenelgenubi", "Vindemiatrix", "Sadachbia", "Sadaltager", "Sulafat")
  --promptRefs \x3CpromptRefs...>         promptRefs [default: ]
  --input \x3Cspec>                       JSON payload: inline string, @file, or - (stdin)
  --dry-run                            Print payload + cost estimate without calling API
  --no-wait                            Return generateId immediately for async tasks
  --timeout \x3Cseconds>                  Max seconds to wait for async completion (default: "1800")
  -h, --help                           display help for command

Output Format

{
  "ok": true,
  "kind": "urls",
  "data": {
    "urls": [
      "https://oss.dlazy.com/result.mp4"
    ]
  }
}

Command Examples

# basic call:
dlazy gemini-2.5-tts --prompt 'prompt text'

# complex call:
dlazy gemini-2.5-tts --prompt 'prompt text'

Error Handling

Code Error Type Example Message
401 Unauthorized (No API Key) ok: false, code: "unauthorized", message: "API key is missing or invalid"
501 Missing required parameter error: required option '--prompt \x3Cprompt>' not specified
502 Local file read error Error: Image file/Video file not found: C:\path o\your\file
503 API request failed (no balance) ok: false, code: "insufficient_balance"
503 API request failed (server error) HTTP status code error (500 server crash)
504 Asynchronous task execution failed === Generation Failed === / {Specific error reason returned by backend, for example "Prompt violates safety policy"}

AGENT CRITICAL INSTRUCTION:

  1. If the execution result returns code: "insufficient_balance", you MUST explicitly inform the user that their credits are insufficient and guide them to recharge: https://dlazy.com/dashboard/organization/settings?tab=credits
  2. If the execution result returns code: "unauthorized" or indicates missing API key, you MUST inform the user to get their API key from https://dlazy.com/dashboard/organization/api-key and save it using dlazy auth set \x3Ckey> and resume the task.

Tips

Visit https://dlazy.com for more information.

安全使用建议
This skill appears to be a straightforward client for the dLazy TTS API and needs your dLazy API key — which is expected — but take these precautions before installing or using it: 1) Verify the upstream GitHub repo and the published npm package (@dlazy/cli) match each other and review the package source (the SKILL.md references the repo but registry metadata and SKILL frontmatter show mismatched versions). 2) Prefer on-demand npx invocation over a global npm install until you inspect the package. 3) Do not supply a long‑lived organization key unless you trust the service; consider using a scoped or expirable key and be ready to rotate/revoke it. 4) Confirm that saving the key to ~/.dlazy/config.json with user-only permissions is acceptable for your environment. 5) If you need higher assurance, audit the package code (or run it in an isolated environment) to verify it only contacts api.dlazy.com and oss.dlazy.com and does not perform unexpected network or filesystem actions.
功能分析
Type: OpenClaw Skill Name: dlazy-gemini-2-5-tts Version: 1.0.6 The skill is a standard integration for the dLazy Gemini 2.5 text-to-speech service, utilizing the `@dlazy/cli` npm package. It transparently documents its operational model, including the storage of API keys in `~/.dlazy/config.json` and the necessity of uploading local media to `oss.dlazy.com` for cloud-based inference. The instructions provided in `SKILL.md` and `SKILL-cn.md` are intended to guide the AI agent in handling specific API error states (e.g., insufficient balance or unauthorized access) and do not contain malicious prompt injections or hidden commands. No evidence of data exfiltration or unauthorized system access was found.
能力标签
requires-sensitive-credentials
能力评估
Purpose & Capability
Name/description (Gemini TTS) align with the required binaries (npm/npx) and the declared use of the @dlazy/cli npm package. However, provenance metadata is inconsistent: the registry metadata/version and the SKILL.md frontmatter disagree (registry version 1.0.6, SKILL.md lists 1.0.3, metadata.install pins @dlazy/[email protected]), and the skill's Source was listed as 'unknown' despite links to github.com/dlazyai/cli. Verify the correct upstream package and repository before trusting installs.
Instruction Scope
Runtime instructions are narrowly scoped: they tell the agent to run the dLazy CLI, authenticate via dlazy auth set or DLAZY_API_KEY, and note that media files will be uploaded to oss.dlazy.com and API calls go to api.dlazy.com. The SKILL.md includes strong agent-facing directives (CRITICAL INSTRUCTION and guidance to instruct users to recharge or rotate keys) which are prescriptive but within the skill's domain. There is no instruction to read unrelated system files or exfiltrate unrelated credentials.
Install Mechanism
No install spec in the registry-level manifest, but the SKILL.md metadata suggests installing a pinned npm package (@dlazy/[email protected]) or invoking via npx. Installing from npm is an expected mechanism for a CLI client and is traceable, but you should inspect the package and GitHub repo before npm install because the files shipped by the package will run on the host. The version discrepancy (1.0.3/1.0.6/1.0.8) increases risk and requires verification.
Credentials
The skill requires a dLazy API key (stored in ~/.dlazy/config.json or provided via DLAZY_API_KEY); that is proportionate for a hosted TTS service. It does not request unrelated environment variables or other credentials. Note the skill will cause the CLI to upload local media files when you pass local paths, so local file access is expected and justified.
Persistence & Privilege
always:false and default autonomous invocation are appropriate. The skill does not request persistent system-wide privileges or modifications to other skills. The CLI will save its own config under the user's home directory (expected behavior).
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install dlazy-gemini-2-5-tts
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /dlazy-gemini-2-5-tts 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.6
bump @dlazy/cli to 1.0.8
v1.0.5
Reduce false-positive scanner alerts: drop 'plaintext' wording from API key storage docs; remove persistsApiKey/network metadata flags in favour of neutral configLocation/apiEndpoints; rewrite Data & Privacy section as factual How-It-Works description without alarming warnings; emphasise that keys can be rotated/revoked at any time from the dLazy dashboard.
v1.0.4
Add provenance metadata (homepage/source/author/npm), document API key storage location (~/.dlazy/config.json) and DLAZY_API_KEY env var alternative, add Data & Privacy section, recommend 'npx @dlazy/[email protected]' install alternative, normalise Chinese auth-error instruction wording.
v1.0.3
- Minor update to skill version 1.0.3. - Clarified agent instructions for handling missing or invalid API keys: the user should be told to provide the key and the agent should save it, instead of asking the user to send it and then save it. - No changes to core functionality or command options.
v1.0.2
- Updated CLI dependency to @dlazy/cli version 1.0.6 (was 1.0.5). - No feature or behavior changes; documentation updated to reflect CLI version change.
v1.0.1
- Added an explicit Authentication section with instructions to set and obtain the dLazy API key. - Clarified that all requests require API key configuration via CLI. - No changes to usage, options, output, or error handling.
v1.0.0
Initial release of dlazy-gemini-2.5-tts. - Generate multilingual, highly natural audio from text using Gemini 2.5 TTS. - Supports command-line options for language, voice, and prompt customization. - Handles errors such as missing API key, insufficient balance, and server failures with user guidance. - Provides example commands and detailed usage instructions. - Clear instructions for resolving authorization and credit issues.
元数据
Slug dlazy-gemini-2-5-tts
版本 1.0.6
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 7
常见问题

Dlazy Gemini 2.5 Tts 是什么?

Generate multilingual, highly natural audio using Gemini 2.5 text-to-speech. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 603 次。

如何安装 Dlazy Gemini 2.5 Tts?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install dlazy-gemini-2-5-tts」即可一键安装,无需额外配置。

Dlazy Gemini 2.5 Tts 是免费的吗?

是的,Dlazy Gemini 2.5 Tts 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Dlazy Gemini 2.5 Tts 支持哪些平台?

Dlazy Gemini 2.5 Tts 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Dlazy Gemini 2.5 Tts?

由 dlazy(@dlazyai)开发并维护,当前版本 v1.0.6。

💬 留言讨论