← Back to Skills Marketplace
156554395

Doubao Image Video

by 156554395 · GitHub ↗ · v0.3.1 · MIT-0
cross-platform ⚠ suspicious
174
Downloads
1
Stars
0
Active Installs
3
Versions
Install in OpenClaw
/install doubao-image-video
Description
豆包图片与视频生成原生技能。适用于用户提到豆包、文生图、图生图、文生视频、图生视频、查询视频生成任务、等待任务完成或下载最终视频时,直接调用火山引擎 Ark 接口,不依赖外部 MCP 服务。
README (SKILL.md)

Doubao Native Media Skill

This is a native OpenClaw skill. Do not spin up the upstream MCP server unless the user explicitly asks for MCP compatibility.

Use this skill for

  • Doubao / 豆包 text-to-image
  • image-to-image or multi-reference image generation
  • Doubao text-to-video or image-to-video
  • querying an async Doubao video task by task_id
  • troubleshooting Volcengine Ark endpoint/model issues

Commands

Generate an image

python3 {baseDir}/scripts/doubao_media.py image \
  --prompt "A cinematic cyberpunk alley in rain" \
  --size 2560x1440

Generate a video

python3 {baseDir}/scripts/doubao_media.py video \
  --prompt "A panda astronaut waves on the moon" \
  --video-duration 5 \
  --fps 24 \
  --resolution 1080p

Query a video task

python3 {baseDir}/scripts/doubao_media.py task --task-id your-task-id

Wait for a video task and optionally download the result

python3 {baseDir}/scripts/doubao_media.py wait \
  --task-id your-task-id \
  --timeout 600 \
  --interval 5 \
  --download-to ./doubao-result.mp4

Input rules

  • Always prefer --endpoint-id when the user has a provisioned Volcengine Ark endpoint.
  • Fall back to model names only when endpoint ids are unavailable.
  • For video generation, this skill mirrors the upstream behavior and appends --dur, --fps, --rs, and --ratio to the prompt when they are not already present.
  • If the user supplies image URLs, pass them through exactly; do not download or re-host unless asked.

Troubleshooting

  • If neither --endpoint-id nor a default endpoint env var exists, the script falls back to the default model env var.
  • If the API returns InvalidEndpointOrModel.NotFound, ask the user to verify the Volcengine Ark endpoint authorization first.
  • Video generation is async. If generation succeeds, capture task_id and query it later with the task subcommand, or use wait for automatic polling.

References

  • Read references/api-notes.md when you need request shapes, defaults, or caveats.
Usage Guidance
This skill's code matches its description: it calls Volcengine Ark endpoints and requires a DOUBAO_API_KEY. Before installing: (1) ensure you are comfortable providing a dedicated Ark API key (prefer least-privilege, revokeable key) and that the key is stored securely; (2) confirm the registry metadata is corrected so the required env var is visible (the current bundle claims no required env vars but the script fails without DOUBAO_API_KEY); (3) review the GitHub source (homepage) to ensure no differences from the provided files; (4) be aware the skill can download generated videos to any path you specify — do not pass sensitive system paths; (5) if you need stronger assurance, ask the maintainer to sign/release the package or to explain why registry metadata omitted the API key requirement. If these checks look good, the skill is proportionate to its purpose.
Capability Analysis
Type: OpenClaw Skill Name: doubao-image-video Version: 0.3.1 The skill provides a legitimate interface for interacting with the Volcengine Ark (Doubao) API for image and video generation. The core logic in `scripts/doubao_media.py` uses standard Python libraries to perform API requests and handle file downloads, with no evidence of data exfiltration, malicious execution, or prompt injection. All behaviors are consistent with the stated purpose of the skill.
Capability Assessment
Purpose & Capability
The skill's name and description state it talks to Volcengine Ark for image/video generation and the script (scripts/doubao_media.py) implements exactly that behavior (image/video generation, task polling, optional download). Requiring an Ark API key is coherent with the stated purpose. However, the top-level registry metadata claims there are no required environment variables while the SKILL.md and script require DOUBAO_API_KEY; this metadata mismatch is inconsistent.
Instruction Scope
SKILL.md instructs running the included python script which only contacts the documented Ark endpoints, polls tasks, and optionally downloads resulting video files to a user-specified path. The instructions avoid reading unrelated local files or environment secrets beyond the declared DOUBAO_* variables and explicitly state not to download user-provided image URLs unless asked. Behavior is within the stated scope.
Install Mechanism
This is an instruction-only skill with one included Python script and no install spec. It requires python3 on PATH — nothing is downloaded or executed from arbitrary URLs. Risk from installation is low.
Credentials
The code requires a single sensitive env var DOUBAO_API_KEY to authenticate to Ark, which is appropriate for this purpose. However, the registry metadata provided with the skill lists 'Required env vars: none' while SKILL.md metadata and the script require DOUBAO_API_KEY (and define optional DOUBAO_* env vars). That discrepancy is a provenance/configuration issue that could lead to silent failures or surprising prompts; it should be resolved before trusting the skill. No unrelated credentials are requested.
Persistence & Privilege
The skill is not always-enabled and does not request elevated or persistent platform privileges. It does write downloaded video files only when the user specifies a --download-to path, which is expected behavior.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install doubao-image-video
  3. After installation, invoke the skill by name or use /doubao-image-video
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v0.3.1
Improve Chinese description and trigger wording
v0.3.0
Add wait polling and video download flow
v0.2.0
Reworked as a native OpenClaw skill for Doubao image and video generation
Metadata
Slug doubao-image-video
Version 0.3.1
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 3
Frequently Asked Questions

What is Doubao Image Video?

豆包图片与视频生成原生技能。适用于用户提到豆包、文生图、图生图、文生视频、图生视频、查询视频生成任务、等待任务完成或下载最终视频时,直接调用火山引擎 Ark 接口,不依赖外部 MCP 服务。 It is an AI Agent Skill for Claude Code / OpenClaw, with 174 downloads so far.

How do I install Doubao Image Video?

Run "/install doubao-image-video" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Doubao Image Video free?

Yes, Doubao Image Video is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Doubao Image Video support?

Doubao Image Video is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Doubao Image Video?

It is built and maintained by 156554395 (@156554395); the current version is v0.3.1.

💬 Comments