The Vidu API supports text-based video, image-based video, reference-based video, and video from the first and last frames.

Name: The Vidu API supports text-based video, image-based video, reference-based video, and video from the first and last frames.
Author: x-jihua

by Vidu AI · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

344

Downloads

Stars

Active Installs

Versions

Install in OpenClaw

/install vidu-video-generate

Description

Vidu AI 视频生成。支持文生视频、图生视频、参考生视频、首尾帧视频。对话式调用，自动识别意图。

Usage Guidance

Before installing, be aware this skill will upload any images or text you provide to an external Vidu API and requires a VIDU_API_KEY. The registry metadata incorrectly lists no required env vars — verify that you will supply VIDU_API_KEY and that the key is scoped appropriately. Inspect the included script if you care where data is sent: SKILL.md says it should pick api.vidu.cn for simplified-Chinese users and api.vidu.com otherwise, but the visible code sets BASE_URL to api.vidu.cn with no automatic switch — confirm which endpoint will actually be used so you know where your data goes. Also confirm the CLI actually returns or prints the creation URL as SKILL.md demands (the printed fields in the script do not obviously include creations[0].url for video tasks). If you plan to use sensitive images or personally identifiable data, test the skill in a controlled environment and consider using an API key with limited scope and billing limits. If possible, ask the publisher for an authoritative homepage/source and for the metadata to be corrected; lack of an official homepage and the metadata mismatch reduce confidence.

Capability Analysis

Type: OpenClaw Skill Name: vidu-video-generate Version: 1.0.0 The skill bundle is a legitimate integration for the Vidu AI video generation platform. It provides a CLI tool (scripts/vidu_cli.py) to interact with the Vidu API for tasks like text-to-video, image-to-video, and voice cloning. The code uses standard Python libraries (urllib) for network requests and file downloads, handles API keys via environment variables, and contains no evidence of data exfiltration, malicious execution, or prompt injection. All instructions in SKILL.md are aligned with the stated purpose of video generation.

Capability Assessment

ℹ Purpose & Capability

The skill's name/description (video generation) matches the included CLI and SKILL.md which call an external Vidu API. Requiring an API key (VIDU_API_KEY) is proportionate for this purpose. However the registry metadata at the top said there are no required env vars while SKILL.md and the script both require VIDU_API_KEY — this metadata mismatch is noteworthy and could cause unexpected failures or confusion.

⚠ Instruction Scope

SKILL.md instructs the agent to run the included Python CLI which will read local files (images/text), base64-encode local images, and POST them to external domains. That's expected for a video-generation skill, but there are two inconsistencies: (1) SKILL.md promises automatic API domain selection based on user language, yet the visible script uses a fixed BASE_URL (https://api.vidu.cn/ent/v2) with no implemented language-detection switch; (2) SKILL.md mandates the agent return creations[0].url, but the CLI's visible print_task_result() does not print creation URLs for video endpoints (it prints task id, state, model, prompt, etc.), so the runtime behavior may not produce the exact output the skill specification requires. The CLI also has the ability to read arbitrary local files (images or text files) and upload them — expected but sensitive.

✓ Install Mechanism

No install spec — instruction-only plus an included script. Nothing is downloaded from external URLs or installed automatically, so install-time risk is low. The skill will execute a local Python script (scripts/vidu_cli.py) which is part of the bundle.

⚠ Credentials

The skill requires VIDU_API_KEY (declared in SKILL.md and used in the script). That is appropriate for contacting Vidu, but the top-level registry metadata incorrectly lists no required env vars — a mismatch that could mislead users about what secrets are necessary. Also, the script will upload local images/text to api.vidu.cn or api.vidu.com (SKILL.md describes domain switching); you should confirm which domain will actually be used because sending sensitive images to a .cn domain may have regulatory or privacy implications.

✓ Persistence & Privilege

The skill does not request always:true, does not modify other skills or system-wide config in the visible files, and does not request elevated system persistence. It will, however, run as an autonomous-invocable skill by default (platform normal), which increases blast radius only if combined with other red flags.

How to Use

Make sure OpenClaw is installed (local or Docker)
Run the install command in chat: /install vidu-video-generate
After installation, invoke the skill by name or use /vidu-video-generate
Provide required inputs per the skill's parameter spec and get structured output

Version History

v1.0.0

Initial release of vidu-video-generate skill: - Provides video generation via Vidu AI, supporting text-to-video, image-to-video, reference-image-to-video, and start-end frame video. - Detects user intent automatically based on input and selects appropriate API/model. - Includes detailed model selection strategy for each video type. - Supports language-aware API domain selection (api.vidu.cn vs. api.vidu.com) based on user language. - Results delivered in Markdown file reference format with direct video link. - Handles error cases such as invalid API key, large images, or generation failure.

Metadata

Slug vidu-video-generate

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is The Vidu API supports text-based video, image-based video, reference-based video, and video from the first and last frames.?

Vidu AI 视频生成。支持文生视频、图生视频、参考生视频、首尾帧视频。对话式调用，自动识别意图。 It is an AI Agent Skill for Claude Code / OpenClaw, with 344 downloads so far.

How do I install The Vidu API supports text-based video, image-based video, reference-based video, and video from the first and last frames.?

Run "/install vidu-video-generate" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is The Vidu API supports text-based video, image-based video, reference-based video, and video from the first and last frames. free?

Yes, The Vidu API supports text-based video, image-based video, reference-based video, and video from the first and last frames. is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does The Vidu API supports text-based video, image-based video, reference-based video, and video from the first and last frames. support?

The Vidu API supports text-based video, image-based video, reference-based video, and video from the first and last frames. is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created The Vidu API supports text-based video, image-based video, reference-based video, and video from the first and last frames.?

It is built and maintained by Vidu AI (@x-jihua); the current version is v1.0.0.

More Skills