← 返回 Skills 市场
volcengine-skills

Byted Seedance Video Generate

作者 volcengine-skills · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
92
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install byted-seedance-video-generate
功能描述
Generate videos using Seedance models. Invoke when user wants to create videos from text prompts, images, or reference materials.
使用说明 (SKILL.md)

Video Generate Skill

This skill generates videos using Doubao Seedance 1.0/1.5 models.

Trigger Conditions

  1. User wants to generate videos from text descriptions
  2. User wants to create videos based on images (first/last frame)
  3. User wants to create videos with reference materials (images, videos, audio)
  4. User asks for video generation capabilities

Usage

Environment Variables

Before using this skill, ensure the following environment variables are set:

  • ARK_API_KEY or MODEL_VIDEO_API_KEY or MODEL_AGENT_API_KEY: API key for the video generation service
  • MODEL_VIDEO_API_BASE: API base URL (optional, has default)
  • MODEL_VIDEO_NAME: Model name (optional, has default)

Function Signature

async def video_generate(
    params: list,
    batch_size: int = 10,
    max_wait_seconds: int = 1200,
    model_name: str = None,
) -> Dict:

Parameters

params (list[dict])

A list of video generation requests. Each item is a dict with the following fields:

Required per item:

  • video_name (str): Name/identifier of the output video file
  • prompt (str): Text describing the video to generate. Supports Chinese and English.

Optional per item - Input Materials:

  • first_frame (str): URL for the first frame image
  • last_frame (str): URL for the last frame image
  • reference_images (list[str]): 1-4 reference image URLs for style/content guidance
  • reference_videos (list[str]): 0-3 reference video URLs (mp4/mov, 2-15s each, total ≤15s)
  • reference_audios (list[str]): 0-3 reference audio URLs (mp3/wav, 2-15s each, total ≤15s)

Optional per item - Video Output Parameters:

  • ratio (str): Aspect ratio. Options: "16:9" (default), "9:16", "4:3", "3:4", "1:1", "2:1", "21:9", "adaptive"
  • duration (int): Video length in seconds. Range: 2-12s depending on model
  • resolution (str): Video resolution. Options: "480p", "720p", "1080p"
  • frames (int): Total frame count. Must be in [29, 289] and follow format 25 + 4n
  • camera_fixed (bool): Lock camera movement. Default: false
  • seed (int): Random seed for reproducibility. Range: [-1, 2^32-1]
  • watermark (bool): Whether to add watermark. Default: false
  • generate_audio (bool): Whether to generate audio. Only Seedance 1.5 supports this
  • tools (list[dict]): Tool configuration, e.g., [{"type": "web_search"}]

Input Modes

  1. Text-to-Video: Only provide prompt, no images/videos
  2. First Frame Guidance: Provide first_frame for starting image
  3. First + Last Frame Guidance: Provide both for transition video
  4. Reference Images: Provide reference_images for style/content guidance
  5. Multimodal Reference: Combine reference_images, reference_videos, reference_audios

Return Value

Script Return Info

The video_generate.py script will return these info:

{
    "status": "success" | "partial_success" | "error",
    "success_list": [{"video_name": "video_url"}],
    "error_list": ["video_name"],
    "error_details": [{"video_name": "...", "error": {...}}],
    "pending_list": [{"video_name": "...", "task_id": "cgt-xxx", ...}]
}

Based on the script return info, the final response returned to the user consists of a description of the video generation task and the video URL(s). You may download the video from the URL, but the video URL should still be provided to the user for viewing and downloading.

Note: the URL is the 'url' in the success_list of script return info. The URL must return in two ways:

Final Return Info

You must return three types of information:

  1. File format, return both file (if you have some other methods to send the video file) and local path, for example: /root/.openclaw/workspace/skills/video-generate/xxx.mp4

  2. After generation, present list of video URL in Markdown format, for example:

\x3Cvideo src="https://example.com/video1.mp4" width="640" controls>video-1\x3C/video>
\x3Cvideo src="https://example.com/video2.mp4" width="640" controls>video-2\x3C/video>

Code Implementation

See scripts/video_generate.py for the full implementation.

Example Usage

# Text-to-Video
python scripts/video_generate.py -p "小猫骑着滑板穿过公园" -n cat_park -r 16:9 -d 5 --resolution 720p

# First Frame Guidance
python scripts/video_generate.py -p "小猫跳起来" -n cat_jump -f "https://example.com/cat.png" -r adaptive -d 5

# First + Last Frame Guidance
python scripts/video_generate.py -p "平滑过渡动画" -n transition \
    -f "https://example.com/start.png" \
    -l "https://example.com/end.png" \
    -d 6

# Reference Images (style/content guidance)
python scripts/video_generate.py -p "[图1]戴着眼镜的男生和[图2]柯基小狗坐在草坪上" -n styled \
    --ref-images "https://example.com/boy.png" "https://example.com/dog.png" \
    -r 16:9 -d 5

# Multimodal Reference (video + audio)
python scripts/video_generate.py -p "将视频中的人物换成[图1]中的男孩" -n multimodal \
    --ref-images "https://example.com/boy.png" \
    --ref-videos "https://example.com/source.mp4" \
    --ref-audios "https://example.com/voice.wav" \
    -d 5

# With Audio Generation (Seedance 1.5 only)
python scripts/video_generate.py -p "女孩抱着狐狸,可以听到风声和树叶沙沙声" -n with_audio \
    -f "https://example.com/girl_fox.png" \
    --generate-audio \
    -m doubao-seedance-1-5-pro-251215 \
    -d 6 --resolution 1080p

# Query task status
python scripts/video_generate.py -q "cgt-20260222165751-wsnw8"

# Use specific model
python scripts/video_generate.py -p "A futuristic city" -m doubao-seedance-1-5-pro-251215

# No watermark
python scripts/video_generate.py -p "A beautiful landscape" --no-watermark

Command Line Options

Option Short Description
--prompt -p Text description of the video (required)
--name -n Video name identifier (default: video)
--model -m Model name (default: doubao-seedance-1-0-pro-250528)
--ratio -r Aspect ratio (default: 16:9)
--duration -d Video duration in seconds (2-12)
--resolution Video resolution: 480p, 720p, 1080p
--first-frame -f First frame image URL
--last-frame -l Last frame image URL
--ref-images Reference image URLs (space-separated, 1-4 images)
--ref-videos Reference video URLs (space-separated, 0-3 videos)
--ref-audios Reference audio URLs (space-separated, 0-3 audios)
--generate-audio Generate audio (Seedance 1.5 only)
--seed Random seed for reproducibility
--no-watermark Disable watermark
--timeout -t Max wait time in seconds (default: 1200)
--query-task -q Query task status by task_id

Model Fallback

If you encounter a model-related error (like ModelNotOpen), you can downgrade to these models:

  • doubao-seedance-1-5-pro-251215
  • doubao-seedance-1-0-pro-250528

Error Handling

  • IF the script raises the error "PermissionError: ARK_API_KEY or MODEL_VIDEO_API_KEY or MODEL_AGENT_API_KEY not found in environment variables", inform the user that they need to provide the ARK_API_KEY or MODEL_VIDEO_API_KEY or MODEL_AGENT_API_KEY environment variable. Write it to the environment variable file in the workspace. If the file already exists, append it to the end. Ensure the environment variable format is correct, make the environment variable effective, and retry the video generation task that just failed.

Notes

  • Keep prompt concise (recommended ≤ 500 characters)
  • For first/last frame, ensure aspect ratios match your chosen ratio
  • Reference images: 1-4 images, formats: jpeg/png/webp/bmp/tiff/gif
  • Reference videos: 0-3 videos, formats: mp4/mov, total duration ≤ 15s
  • Reference audios: 0-3 audios, formats: mp3/wav, total duration ≤ 15s
  • Multimodal requires at least one image or video (audio-only not supported)
  • Audio generation is only supported by Seedance 1.5 pro
  • If polling times out, use --query-task with the returned task_id
安全使用建议
Before installing, verify the remote API and credential expectations: SKILL.md and the Python script require an API key (ARK_API_KEY or MODEL_VIDEO_API_KEY or MODEL_AGENT_API_KEY) even though the registry metadata lists none — do not supply high-privilege or unrelated credentials. Confirm the default API endpoint (https://ark.cn-beijing.volces.com) is a trusted Seedance/Doubao host for your use; if not, set MODEL_VIDEO_API_BASE to a known endpoint. Ensure your runtime can install httpx or provide it in the environment. Understand that any prompts and reference media URLs you provide will be transmitted to the remote service, so avoid sending sensitive content. If you need higher confidence, ask the publisher for provenance (homepage, official docs, or signed release) or run the script in an isolated environment and inspect network traffic to confirm where data is sent.
功能分析
Type: OpenClaw Skill Name: byted-seedance-video-generate Version: 1.0.0 The skill provides a standard interface for generating videos using the ByteDance Seedance API via the Volcengine Ark platform (ark.cn-beijing.volces.com). The Python script `scripts/video_generate.py` implements task creation, polling, and error handling using the `httpx` library. While `SKILL.md` contains instructions for the agent to automatically configure missing API keys by writing to a workspace environment file, this behavior is aligned with the skill's functional requirements for setup and does not show evidence of malicious intent, data exfiltration, or unauthorized execution.
能力评估
Purpose & Capability
Name/description (Seedance video generation) match the implementation: the script builds generation tasks and polls a remote API. However, the registry metadata claims no required environment variables while both SKILL.md and the script expect an API key (ARK_API_KEY / MODEL_VIDEO_API_KEY / MODEL_AGENT_API_KEY) and optional API base/model name env vars — this is an inconsistency.
Instruction Scope
SKILL.md instructs the agent to provide API keys and to call/run scripts that submit prompts and reference media URLs to a remote service. The instructions require the agent to return both video URLs and a local path/embed snippet. The script itself only interacts with the remote API and polls task status; it will send provided media URLs and prompts to the remote service (expected for this purpose). SKILL.md references a concrete local path for returned files which the script does not obviously create, so the instruction and implementation are not perfectly aligned.
Install Mechanism
There is no install spec (instruction-only + bundled Python script). That minimizes install-time risk, but the script imports httpx (a non-standard package) and assumes a Python runtime; the skill does not declare or install that dependency. The lack of an explicit dependency/install step is a mismatch you should plan for.
Credentials
The script legitimately needs an API key for the remote video-generation service; that is proportional. But registry metadata lists no required env vars while the SKILL.md and code require API keys. Additionally the code defaults API_BASE to an undocumented host (https://ark.cn-beijing.volces.com/api/v3) and accepts ARK_BASE_URL though SKILL.md does not document ARK_BASE_URL; using an unknown default endpoint increases risk and should be validated.
Persistence & Privilege
The skill does not request always:true and does not modify other skills or system-wide settings. It runs on demand and only needs outbound network access to the configured API.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install byted-seedance-video-generate
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /byted-seedance-video-generate 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
byted-seedance-video-generate 1.0.0 - Initial release of the video generation skill powered by Doubao Seedance 1.0/1.5 models. - Supports generating videos from text prompts, images (first/last frame), reference images, reference videos, and audio. - Provides detailed configuration options for aspect ratio, duration, resolution, seed, watermark, and more. - Returns both the downloadable video file path and Markdown-embedded video URLs for user access. - Includes robust error handling for missing API keys and model fallback mechanisms. - Offers command-line usage examples and comprehensive parameter documentation.
元数据
Slug byted-seedance-video-generate
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Byted Seedance Video Generate 是什么?

Generate videos using Seedance models. Invoke when user wants to create videos from text prompts, images, or reference materials. 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 92 次。

如何安装 Byted Seedance Video Generate?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install byted-seedance-video-generate」即可一键安装,无需额外配置。

Byted Seedance Video Generate 是免费的吗?

是的,Byted Seedance Video Generate 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Byted Seedance Video Generate 支持哪些平台?

Byted Seedance Video Generate 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Byted Seedance Video Generate?

由 volcengine-skills(@volcengine-skills)开发并维护,当前版本 v1.0.0。

💬 留言讨论