← Back to Skills Marketplace
dlazyai

Dlazy Fun Asr

by dlazy · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ Security Clean
43
Downloads
0
Stars
1
Active Installs
1
Versions
Install in OpenClaw
/install dlazy-fun-asr
Description
Alibaba Bailian Fun-ASR recording transcription. Supports Chinese, English and other languages, with auto language detection and speaker diarization. Suitabl...
README (SKILL.md)

dlazy-fun-asr

English · 中文

Alibaba Bailian Fun-ASR recording transcription. Supports Chinese, English and other languages, with auto language detection and speaker diarization. Suitable for subtitles, transcription, and meeting notes.

Trigger Keywords

  • fun-asr

Authentication

All requests require a dLazy API key. The recommended way to authenticate is:

dlazy login

This runs a device-code flow (also works in remote shells) and automatically saves your API key to the local CLI config — no manual copy/paste required.

Alternative: Set the Key Manually

If you already have an API key, you can save it directly:

dlazy auth set YOUR_API_KEY

The CLI saves the key in your user config directory (~/.dlazy/config.json on macOS/Linux, %USERPROFILE%\.dlazy\config.json on Windows), with file permissions restricted to your OS user account. You can also supply the key per-invocation via the DLAZY_API_KEY environment variable.

Getting Your API Key Manually

  1. Sign in or create an account at dlazy.com
  2. Go to dlazy.com/dashboard/organization/api-key
  3. Copy the key shown in the API Key section

Each key is scoped to your dLazy organization and can be rotated or revoked at any time from the same dashboard.

About & Provenance

You can install on demand without persisting a global binary by running:

npx @dlazy/[email protected] \x3Ccommand>

Or, if you prefer a global install, the skill's metadata.clawdbot.install field declares the exact pinned version (npm install -g @dlazy/[email protected]). Review the GitHub source before installing.

How It Works

This skill is a thin client over the dLazy hosted API. When you invoke it:

  • Prompts and parameters you provide are sent to the dLazy API endpoint (api.dlazy.com) for inference.
  • Any local file paths you pass to image / video / audio fields are uploaded to dLazy's media storage (files.dlazy.com) so the model can read them — the same flow as any cloud-based generation API.
  • Generated output URLs returned by the API are hosted on files.dlazy.com.

This is the standard SaaS pattern; the skill itself does not access network or filesystem resources beyond what the dLazy CLI already handles. See dlazy.com for the full service terms.

Usage

CRITICAL INSTRUCTION FOR AGENT: Execute dlazy fun-asr to get the result.

dlazy fun-asr -h

Options:
  --audio_url [audio_url]              Audio URL [audio: url or local path]
  --language_code [language_code]      Language [default: zh] (choices: "zh", "en")
  --diarize [diarize]                  Diarize Speakers [default: false]
  --tag_audio_events [tag_audio_events]tag_audio_events [default: false]
  --num_speakers [num_speakers]        num_speakers
  --dry-run                            Print payload + cost estimate without calling API
  --no-wait                            Return generateId immediately for async tasks
  --timeout \x3Cseconds>                  Max seconds to wait for async completion (default: "1800")
  -h, --help                           display help for command

Any flag also accepts pipe references — - (auto-pick from upstream stdin), @N (n-th output), @N.path (jsonpath into output), @* (all primary values), @stdin / @stdin:path (whole envelope). See dlazy --help for details.

Output Format

{
  "ok": true,
  "result": {
    "tool": "fun-asr",
    "modelId": "fun-asr",
    "outputs": [
      {
        "type": "json",
        "id": "o_xxxxxxxx",
        "value": {}
      }
    ]
  }
}

Async tasks (when --no-wait is passed) return outputs: [] and a task: { generateId, status } field instead. Use dlazy status \x3CgenerateId> --wait to poll.

Examples

dlazy fun-asr --prompt 'prompt content'

Error Handling

Code Error Type Example Message
401 Unauthorized (No API Key) ok: false, code: "unauthorized", message: "API key is missing or invalid"
501 Missing required parameter error: required option '--prompt \x3Cprompt>' not specified
502 Local file read error Error: Image file/Video file not found: C:\path o\your\file
503 API request failed (no balance) ok: false, code: "insufficient_balance"
503 API request failed (server error) HTTP status code error (500 server crash)
504 Asynchronous task execution failed === Generation Failed === / {Specific error reason returned by backend, for example "Prompt violates safety policy"}

AGENT CRITICAL INSTRUCTION:

  1. If the execution result returns code: "insufficient_balance", you MUST explicitly inform the user that their credits are insufficient and guide them to recharge: https://dlazy.com/dashboard/organization/settings?tab=credits
  2. If the execution result returns code: "unauthorized" or indicates missing API key, you MUST inform the user to get their API key from https://dlazy.com/dashboard/organization/api-key and save it using dlazy auth set \x3Ckey> and resume the task.

Tips

Visit https://dlazy.com for more information.

Usage Guidance
Before installing, confirm you trust the dLazy CLI package and service. Prefer the npx pinned version for one-off use if you do not want a global install, and do not pass confidential audio unless you are comfortable uploading it to dLazy's cloud service. Rotate or revoke the API key from the dLazy dashboard if needed.
Capability Tags
requires-sensitive-credentials
Capability Assessment
Purpose & Capability
The stated purpose is Fun-ASR audio transcription with language detection and diarization, and the instructions align with invoking the dLazy CLI for that task.
Instruction Scope
The agent instructions are limited to running the relevant CLI command and handling authentication or balance errors; I found no role changes, hidden override behavior, or unrelated commands.
Install Mechanism
The skill declares a pinned npm global install for @dlazy/[email protected] and an npx alternative. The external package contents were not present locally, and network restrictions prevented npm metadata retrieval, but the install path is disclosed.
Credentials
The skill requires a dLazy API key and sends prompts, parameters, and user-provided local audio paths to dLazy endpoints. That is sensitive but proportionate and clearly disclosed for a cloud transcription service.
Persistence & Privilege
The documented persistence is local API-key storage at ~/.dlazy/config.json or an environment variable. I found no background service, privilege escalation, destructive action, or unbounded local indexing in the artifacts.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install dlazy-fun-asr
  3. After installation, invoke the skill by name or use /dlazy-fun-asr
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
- Improved documentation with detailed usage, authentication steps, and error handling instructions. - Added support details for auto language detection and speaker diarization. - Included examples for CLI usage and output formats. - Specified version pinning for the @dlazy/cli dependency. - Enhanced user guidance for troubleshooting common authentication and balance errors.
Metadata
Slug dlazy-fun-asr
Version 1.0.0
License MIT-0
All-time Installs 1
Active Installs 1
Total Versions 1
Frequently Asked Questions

What is Dlazy Fun Asr?

Alibaba Bailian Fun-ASR recording transcription. Supports Chinese, English and other languages, with auto language detection and speaker diarization. Suitabl... It is an AI Agent Skill for Claude Code / OpenClaw, with 43 downloads so far.

How do I install Dlazy Fun Asr?

Run "/install dlazy-fun-asr" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Dlazy Fun Asr free?

Yes, Dlazy Fun Asr is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Dlazy Fun Asr support?

Dlazy Fun Asr is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Dlazy Fun Asr?

It is built and maintained by dlazy (@dlazyai); the current version is v1.0.0.

💬 Comments