← 返回 Skills 市场
dlazyai

Dlazy Videoretalk

作者 dlazy · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
52
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install dlazy-videoretalk
功能描述
Tongyi VideoRetalk lip sync / lip-sync (mouth sync, dubbing) video model — takes a talking-person video plus a voice audio track and regenerates the video so...
使用说明 (SKILL.md)

dlazy-videoretalk

English · 中文

Tongyi VideoRetalk lip sync / lip-sync (mouth sync, dubbing) video model — takes a talking-person video plus a voice audio track and regenerates the video so the speaker's mouth/lips match the new audio. Use this for lip syncing a person video to new speech. Optionally provide a reference face image to pick the target person when the video contains multiple faces.

Trigger Keywords

  • videoretalk

Authentication

All requests require a dLazy API key. The recommended way to authenticate is:

dlazy login

This runs a device-code flow (also works in remote shells) and automatically saves your API key to the local CLI config — no manual copy/paste required.

Alternative: Set the Key Manually

If you already have an API key, you can save it directly:

dlazy auth set YOUR_API_KEY

The CLI saves the key in your user config directory (~/.dlazy/config.json on macOS/Linux, %USERPROFILE%\.dlazy\config.json on Windows), with file permissions restricted to your OS user account. You can also supply the key per-invocation via the DLAZY_API_KEY environment variable.

Getting Your API Key Manually

  1. Sign in or create an account at dlazy.com
  2. Go to dlazy.com/dashboard/organization/api-key
  3. Copy the key shown in the API Key section

Each key is scoped to your dLazy organization and can be rotated or revoked at any time from the same dashboard.

About & Provenance

You can install on demand without persisting a global binary by running:

npx @dlazy/[email protected] \x3Ccommand>

Or, if you prefer a global install, the skill's metadata.clawdbot.install field declares the exact pinned version (npm install -g @dlazy/[email protected]). Review the GitHub source before installing.

How It Works

This skill is a thin client over the dLazy hosted API. When you invoke it:

  • Prompts and parameters you provide are sent to the dLazy API endpoint (api.dlazy.com) for inference.
  • Any local file paths you pass to image / video / audio fields are uploaded to dLazy's media storage (files.dlazy.com) so the model can read them — the same flow as any cloud-based generation API.
  • Generated output URLs returned by the API are hosted on files.dlazy.com.

This is the standard SaaS pattern; the skill itself does not access network or filesystem resources beyond what the dLazy CLI already handles. See dlazy.com for the full service terms.

Usage

CRITICAL INSTRUCTION FOR AGENT: Execute dlazy videoretalk to get the result.

dlazy videoretalk -h

Options:
  --video_url [video_url]              Video URL [video: url or local path]
  --audio_url [audio_url]              Audio URL [audio: url or local path]
  --ref_image_url [ref_image_url]      Reference Face Image [image: url or local path]
  --video_extension [video_extension]  Extend Video to Audio Length [default: false] (choices: "true", "false")
  --query_face_threshold [query_face_threshold]Face Match Threshold [default: 170] [only when ref_image_url non-empty]
  --dry-run                            Print payload + cost estimate without calling API
  --no-wait                            Return generateId immediately for async tasks
  --timeout \x3Cseconds>                  Max seconds to wait for async completion (default: "1800")
  -h, --help                           display help for command

Any flag also accepts pipe references — - (auto-pick from upstream stdin), @N (n-th output), @N.path (jsonpath into output), @* (all primary values), @stdin / @stdin:path (whole envelope). See dlazy --help for details.

Output Format

{
  "ok": true,
  "result": {
    "tool": "videoretalk",
    "modelId": "videoretalk",
    "outputs": [
      {
        "type": "image",
        "id": "o_xxxxxxxx",
        "url": "https://files.dlazy.com/result.png",
        "mimeType": "image/png"
      }
    ]
  }
}

Async tasks (when --no-wait is passed) return outputs: [] and a task: { generateId, status } field instead. Use dlazy status \x3CgenerateId> --wait to poll.

Examples

dlazy videoretalk --prompt 'prompt content'

Error Handling

Code Error Type Example Message
401 Unauthorized (No API Key) ok: false, code: "unauthorized", message: "API key is missing or invalid"
501 Missing required parameter error: required option '--prompt \x3Cprompt>' not specified
502 Local file read error Error: Image file/Video file not found: C:\path o\your\file
503 API request failed (no balance) ok: false, code: "insufficient_balance"
503 API request failed (server error) HTTP status code error (500 server crash)
504 Asynchronous task execution failed === Generation Failed === / {Specific error reason returned by backend, for example "Prompt violates safety policy"}

AGENT CRITICAL INSTRUCTION:

  1. If the execution result returns code: "insufficient_balance", you MUST explicitly inform the user that their credits are insufficient and guide them to recharge: https://dlazy.com/dashboard/organization/settings?tab=credits
  2. If the execution result returns code: "unauthorized" or indicates missing API key, you MUST inform the user to get their API key from https://dlazy.com/dashboard/organization/api-key and save it using dlazy auth set \x3Ckey> and resume the task.

Tips

Visit https://dlazy.com for more information.

安全使用建议
Install only if you are comfortable using dLazy's third-party CLI and sending prompts plus any selected media files to dLazy's hosted service. Use the npx option if you do not want a global install, and rotate or revoke the API key if your trust or usage changes.
能力标签
requires-sensitive-credentials
能力评估
Purpose & Capability
The purpose is Qwen image generation, and the documented behavior fits that purpose: prompts and selected media are sent to dLazy endpoints and generated results are returned as hosted URLs.
Instruction Scope
The runtime instruction only tells the agent to use the documented dLazy help command; the stronger operational guidance in the skill is about normal invocation, authentication errors, and balance errors.
Install Mechanism
Installation uses a third-party npm CLI pinned as @dlazy/[email protected], with an npx alternative disclosed for users who do not want a global install.
Credentials
Network access to api.dlazy.com and files.dlazy.com, plus optional upload of user-selected local media, is proportionate for a cloud image-generation service and is disclosed.
Persistence & Privilege
The CLI may persist an API key in ~/.dlazy/config.json and a global npm install may persist the binary, but these behaviors are documented and no hidden background execution or privilege escalation appears in the artifacts.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install dlazy-videoretalk
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /dlazy-videoretalk 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
dlazy-videoretalk 1.1.1 - Expanded documentation and usage instructions, including install steps, authentication, and error handling. - Added detailed CLI options, output format, and troubleshooting tips. - Clarified support for multiple faces with reference image selection. - Provided clear guidance for error cases (e.g., insufficient balance, missing API key). - Updated skill metadata and links to resources.
元数据
Slug dlazy-videoretalk
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Dlazy Videoretalk 是什么?

Tongyi VideoRetalk lip sync / lip-sync (mouth sync, dubbing) video model — takes a talking-person video plus a voice audio track and regenerates the video so... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 52 次。

如何安装 Dlazy Videoretalk?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install dlazy-videoretalk」即可一键安装,无需额外配置。

Dlazy Videoretalk 是免费的吗?

是的,Dlazy Videoretalk 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Dlazy Videoretalk 支持哪些平台?

Dlazy Videoretalk 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Dlazy Videoretalk?

由 dlazy(@dlazyai)开发并维护,当前版本 v1.0.0。

💬 留言讨论