← Back to Skills Marketplace
systiger

doubao-media

by systiger · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
149
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install doubao-media
Description
Doubao (Volcengine ARK) 文生图、文生视频工具 - 生成后自动发送到对话,无需手动查找文件。Text-to-image and text-to-video with auto-send to chat.
README (SKILL.md)

Doubao Media / 豆包媒体生成

中文: 豆包(字节跳动火山引擎ARK)文生图、文生视频工具。生成后自动发送到对话,无需手动查找文件。

English: Doubao (ByteDance Volcengine ARK) text-to-image and text-to-video tool. Auto-sends generated content to chat, no manual file search needed.


Features / 功能特点

Feature 中文 English
Text-to-Image ✅ 文生图 ✅ Generate images from text
Text-to-Video ✅ 文生视频 ✅ Generate videos from text
Image-to-Video ✅ 图生视频 ✅ Generate videos from images
Auto-Send to Chat ✅ 自动发送到对话 ✅ Auto-send to conversation
Windows Compatible ✅ Windows 兼容 ✅ Windows compatible

Prerequisites / 前置条件

Required / 必需

Install Dependencies / 安装依赖

pip install requests

Set API Key / 设置 API Key

# Windows PowerShell
$env:ARK_API_KEY="your_api_key_here"

# Linux/Mac
export ARK_API_KEY="your_api_key_here"

Usage / 使用方法

Generate Image / 生成图片

python scripts/doubao_media.py img "一只可爱的橘猫在阳光下睡觉"

Result / 结果:

  • Image saved to output/ directory / 图片保存到 output/ 目录
  • Auto-sent to chat / 自动发送到对话

Generate Video / 生成视频

# Sync mode (wait for completion) / 同步模式(等待完成)
python scripts/doubao_media.py vid "一只猫在草地上奔跑" --duration 5

# Async mode (return task ID) / 异步模式(返回任务ID)
python scripts/doubao_media.py vid "一只猫在草地上奔跑" --async

Generate Video from Image / 图生视频

python scripts/doubao_media.py vid "让这只猫动起来" --image "https://example.com/cat.jpg"

Check Video Status / 检查视频状态

python scripts/doubao_media.py status "task_xxxxx"

Parameters / 参数说明

Image Generation / 图片生成

Parameter Default Description
prompt Required Text description / 文本描述
--size 1024x1024 Image size: 1024x1024, 1024x1536, 1536x1024

Video Generation / 视频生成

Parameter Default Description
prompt Required Text description / 文本描述
--duration 5 Duration in seconds (2-12) / 时长(秒)
--ratio 16:9 Aspect ratio: 16:9, 4:3, 1:1, 9:16
--image None Image URL for image-to-video / 图生视频的图片URL
--async False Async mode / 异步模式

Available Models / 可用模型

Model ID Function Description
doubao-seedream-3-0-t2i-250415 Text-to-Image / 文生图 Generate images from text
doubao-seedance-1-0-pro-250528 Text-to-Video / 文生视频 Generate videos from text or images

Output / 输出

Generated files are saved to / 生成的文件保存到:

~/.openclaw/workspace/output/
├── img_YYYYMMDD_HHMMSS.jpeg    # Images / 图片
└── vid_YYYYMMDD_HHMMSS.mp4     # Videos / 视频

Performance / 性能指标

Operation Time Description
Text-to-Image / 文生图 10-30s Depends on complexity
Text-to-Video / 文生视频 1-3min 5-second video

Troubleshooting / 故障排除

Error: ARK_API_KEY not set

Solution / 解决方案:

$env:ARK_API_KEY="your_api_key"  # Windows
export ARK_API_KEY="your_api_key"  # Linux/Mac

Error: InvalidEndpointOrModel.NotFound

Solution / 解决方案: Go to Volcengine Console → Model Management and activate the required models. 前往火山引擎控制台 → 模型管理,开通所需模型。

Required models / 需要开通的模型:

  • Doubao-SeeDream-3.0-T2I (Text-to-Image)
  • Doubao-Seedance-1.0-Pro (Text-to-Video)

Error: UnicodeEncodeError (Windows)

Solution / 解决方案: The script handles this automatically with PYTHONIOENCODING=utf-8.


Security / 安全

  • Never hardcode API keys / 永远不要硬编码 API Key
  • Use environment variables / 使用环境变量存储敏感信息
  • Regularly rotate keys / 定期轮换 API Key

References / 参考资源


Version / 版本: 1.0.0 Last Updated / 最后更新: 2026-03-29

Usage Guidance
Before installing or running this skill: - Expectation vs reality: The registry metadata incorrectly states no env vars required. You must provide ARK_API_KEY (set in environment) or the script will fail. doubao-media.json and SKILL.md correctly mention ARK_API_KEY—update or verify metadata before automated installs. - 'Auto-send to chat' is advertised but not implemented: the script's _send_to_chat only prints a message; it does not actually call any messaging tool or send files. If you rely on auto-send, inspect/implement the messaging integration or treat sending as manual. - Verify the API endpoint: the script calls BASE_URL = https://ark.cn-beijing.volces.com/api/v3. Confirm this is the correct, official Volcengine endpoint for your account and region. - Review network and privacy implications: the skill downloads generated media from URLs the API returns and writes them to ~/.openclaw/workspace/output. Ensure you are comfortable with those files being saved and that your environment permits outbound connections to Volcengine and to the returned file URLs. - Run in a sandbox first: test with a non-privileged or temporary API key and inspect network calls and saved files before using a production key. - If you need the advertised auto-send, request the author to implement a real send (or add instructions for how OpenClaw agents should be invoked to attach/send files). Also ask the publisher to fix registry metadata to declare ARK_API_KEY as required.
Capability Analysis
Type: OpenClaw Skill Name: doubao-media Version: 1.0.0 The skill is a legitimate tool for generating images and videos using the Volcengine ARK (Doubao) API. The Python script `scripts/doubao_media.py` implements standard API interaction, file downloading, and status polling without any signs of malicious behavior, data exfiltration, or unauthorized execution.
Capability Assessment
Purpose & Capability
The description and code align with a Volcengine ARK media generator and the code legitimately requires an ARK_API_KEY. However, the top-level registry metadata declared 'Required env vars: none' while SKILL.md and doubao-media.json both indicate ARK_API_KEY is required. That mismatch is an incoherence that could mislead users or automation that relies on registry metadata.
Instruction Scope
SKILL.md instructs the user to set ARK_API_KEY and run the included Python script (consistent). It also promises 'Auto-Send to Chat' behavior; the script contains a _send_to_chat method but it only prints a message and does not actually invoke any subprocess or API to send the file. The SKILL.md therefore overstates automation: auto-send appears unimplemented (comment only). There are no instructions to read unrelated files or exfiltrate extra data.
Install Mechanism
No install spec (instruction-only) and the only runtime dependency is the 'requests' Python package (SKILL.md tells users to pip install requests). Nothing is downloaded from arbitrary URLs during install. Low install risk.
Credentials
The code correctly requires a single API key (ARK_API_KEY), which is proportionate for a cloud media-generation skill. The problem is that the registry metadata omitted this requirement, while doubao-media.json and SKILL.md declare it—this inconsistency could cause automated systems or users to miss the need for credentials. No other unrelated credentials are requested.
Persistence & Privilege
The skill does not request always:true or any elevated persistence. It writes generated files to ~/.openclaw/workspace/output (consistent with its purpose) and does not modify other skill configs or system-wide settings.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install doubao-media
  3. After installation, invoke the skill by name or use /doubao-media
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
doubao-media 1.0.0 – Initial release - Provides text-to-image, text-to-video, and image-to-video generation via Doubao (Volcengine ARK). - Automatically sends generated content to chat for convenience. - Supports both synchronous and asynchronous video generation modes. - Compatible with Windows; requires Python 3.8+ and the requests library. - Includes detailed usage instructions and troubleshooting tips.
Metadata
Slug doubao-media
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is doubao-media?

Doubao (Volcengine ARK) 文生图、文生视频工具 - 生成后自动发送到对话,无需手动查找文件。Text-to-image and text-to-video with auto-send to chat. It is an AI Agent Skill for Claude Code / OpenClaw, with 149 downloads so far.

How do I install doubao-media?

Run "/install doubao-media" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is doubao-media free?

Yes, doubao-media is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does doubao-media support?

doubao-media is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created doubao-media?

It is built and maintained by systiger (@systiger); the current version is v1.0.0.

💬 Comments