← 返回 Skills 市场
xixihhhh

Kling 3.0 — AI Video Generation by Kuaishou

作者 MikeWang · GitHub ↗ · v1.0.4 · MIT-0
cross-platform ✓ 安全检测通过
767
总下载
0
收藏
0
当前安装
5
版本数
在 OpenClaw 中安装
/install kling-video
功能描述
Generate, animate, and edit AI videos using Kuaishou's Kling 3.0 and Kling Video O3 — featuring cinematic motion quality, physics simulation, reference-based...
使用说明 (SKILL.md)

Kling 3.0 & O3 — AI Video Generation by Kuaishou

Generate, animate, and edit AI videos using Kuaishou's Kling 3.0 and Kling Video O3 — featuring cinematic motion quality, realistic physics simulation, reference-based generation, and natural-language video editing.

Kling 3.0 excels at creating cinematic short clips with realistic motion, complex camera movements, and faithful prompt adherence. Kling Video O3 adds MVL (Multi-modal Visual Language) technology with reference-based generation and video editing capabilities. All models support optional synchronized sound generation.

Data usage note: This skill sends text prompts, image URLs, and video URLs to the Atlas Cloud API (api.atlascloud.ai) for video generation and editing. No data is stored locally beyond the downloaded output files. API usage incurs charges per second based on the model selected.


Key Capabilities

  • Text-to-Video — Generate video clips from text descriptions
  • Image-to-Video — Animate still images into dynamic video with first/last frame control
  • Reference-to-Video — Generate videos using character, prop, or scene reference images (O3)
  • Video Editing — Natural-language video editing: remove/replace objects, change backgrounds, add effects (O3)
  • Sound Generation — Optional synchronized sound effects and audio
  • Pro & Standard Tiers — Pro for highest quality, Standard for cost-effective production
  • Multiple Aspect Ratios — 16:9, 9:16, 1:1
  • Flexible Duration — V3: 5 or 10 seconds; O3: 3-15 seconds
  • Negative Prompts — Specify what to exclude from generated video (V3)

Setup

  1. Sign up at https://www.atlascloud.ai
  2. Console → API Keys → Create new key
  3. Set env: export ATLASCLOUD_API_KEY="your-key"

Script Usage

This skill includes a Python script for video generation. Zero external dependencies required.

List available video models

python scripts/generate_video.py list-models

Generate a video (text-to-video)

python scripts/generate_video.py generate \
  --model "MODEL_ID" \
  --prompt "Your prompt here" \
  --output ./output \
  duration=5 resolution=720p

Generate a video (image-to-video)

python scripts/generate_video.py generate \
  --model "MODEL_ID" \
  --image "https://example.com/photo.jpg" \
  --prompt "Animate this scene" \
  --output ./output

Upload a local file

python scripts/generate_video.py upload ./local-file.jpg

Run python scripts/generate_video.py generate --help for all options. Extra model params can be passed as key=value (e.g. duration=10 shot_type=multi_camera).


Pricing

All prices are per second of video generated. Atlas Cloud offers 15% off compared to standard API pricing.

Kling V3.0

Model Tier Original Price Atlas Cloud Best For
kwaivgi/kling-v3.0-std/text-to-video Standard $0.18/s $0.153/s Cost-effective text-to-video
kwaivgi/kling-v3.0-std/image-to-video Standard $0.18/s $0.153/s Cost-effective image animation
kwaivgi/kling-v3.0-pro/text-to-video Pro $0.24/s $0.204/s High-quality text-to-video
kwaivgi/kling-v3.0-pro/image-to-video Pro $0.24/s $0.204/s High-quality image animation

Kling Video O3 Pro

Model Original Price Atlas Cloud Best For
kwaivgi/kling-video-o3-pro/text-to-video $0.24/s $0.204/s MVL-enhanced text-to-video
kwaivgi/kling-video-o3-pro/image-to-video $0.24/s $0.204/s MVL-enhanced image animation
kwaivgi/kling-video-o3-pro/reference-to-video $0.24/s $0.204/s Reference-based video generation
kwaivgi/kling-video-o3-pro/video-edit $0.36/s $0.306/s Professional video editing

Kling Video O3 Standard

Model Original Price Atlas Cloud Best For
kwaivgi/kling-video-o3-std/text-to-video - $0.153/s Cost-effective MVL text-to-video
kwaivgi/kling-video-o3-std/image-to-video - $0.153/s Cost-effective MVL image animation
kwaivgi/kling-video-o3-std/reference-to-video - $0.085/s Cost-effective reference-based generation
kwaivgi/kling-video-o3-std/video-edit - $0.238/s Budget video editing

Parameters

Kling V3.0 — Text-to-Video

Parameter Type Required Default Options
prompt string Yes - Video description
negative_prompt string No - What to exclude from the video
duration integer No 5 5, 10 seconds
aspect_ratio string No 16:9 16:9, 9:16, 1:1
cfg_scale number No 0.5 0-1, controls prompt adherence
sound boolean No false Generate synchronized audio

Kling V3.0 — Image-to-Video

Same as V3.0 text-to-video, plus:

Parameter Type Required Description
image string Yes URL of the source image (jpg/jpeg/png, max 10MB, min 300px, aspect ratio 1:2.5 to 2.5:1)
end_image string No URL of the target end frame (for guided motion)

Kling Video O3 — Text-to-Video

Parameter Type Required Default Options
prompt string Yes - Video description
aspect_ratio string No 16:9 16:9, 9:16, 1:1
duration integer No 5 3-15 seconds
sound boolean No false Generate synchronized audio

Kling Video O3 — Image-to-Video

Parameter Type Required Default Description
prompt string Yes - Video description
image string Yes - First frame image URL
end_image string No - Last frame image URL
duration integer No 5 3-15 seconds
generate_audio boolean No false Auto-add audio to video

Kling Video O3 — Reference-to-Video

Parameter Type Required Default Description
prompt string Yes - Video description
images array No - Reference images (up to 7 without video, up to 4 with video)
video string No - Reference video URL
keep_original_sound boolean No true Keep original sound from reference video
sound boolean No false Generate new audio
aspect_ratio string No 16:9 16:9, 9:16, 1:1
duration integer No 5 3-15 seconds

Kling Video O3 — Video Editing

Parameter Type Required Default Description
prompt string Yes - Editing instruction in natural language
video string Yes - Source video URL (max 10s duration)
images array No - Reference images for element, scene, or style (max 4)
keep_original_sound boolean No true Keep original audio from the video

Workflow: Submit → Poll → Download

Text-to-Video Example (V3.0 Pro)

# Step 1: Submit
curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-v3.0-pro/text-to-video",
    "prompt": "A golden retriever running through a sunlit meadow, camera tracking alongside, wildflowers swaying in the breeze",
    "aspect_ratio": "16:9",
    "duration": 5,
    "cfg_scale": 0.5,
    "sound": true
  }'
# Returns: { "code": 200, "data": { "id": "prediction-id" } }

# Step 2: Poll (every 5 seconds until "completed" or "succeeded")
curl -s "https://api.atlascloud.ai/api/v1/model/prediction/{prediction-id}" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY"
# Returns: { "code": 200, "data": { "status": "completed", "outputs": ["https://...video-url..."] } }

# Step 3: Download
curl -o output.mp4 "VIDEO_URL_FROM_OUTPUTS"

Image-to-Video Example (V3.0 Pro)

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-v3.0-pro/image-to-video",
    "image": "https://example.com/landscape.jpg",
    "prompt": "The camera slowly pans across the landscape as clouds drift by and trees sway gently",
    "aspect_ratio": "16:9",
    "duration": 5,
    "sound": false
  }'

Reference-to-Video Example (O3 Pro)

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-video-o3-pro/reference-to-video",
    "prompt": "A young woman walks through a cherry blossom garden, camera follows from behind",
    "images": ["https://example.com/character-ref.jpg"],
    "aspect_ratio": "16:9",
    "duration": 5,
    "sound": false
  }'

Video Editing Example (O3 Pro)

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-video-o3-pro/video-edit",
    "video": "https://example.com/original-video.mp4",
    "prompt": "Remove the person in the background and replace with a blooming cherry tree",
    "keep_original_sound": true
  }'

Standard Tier Example (Cost-Effective)

curl -s -X POST "https://api.atlascloud.ai/api/v1/model/generateVideo" \
  -H "Authorization: Bearer $ATLASCLOUD_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "kwaivgi/kling-v3.0-std/text-to-video",
    "prompt": "Ocean waves crashing on a rocky shore at sunset, seagulls flying overhead",
    "aspect_ratio": "16:9",
    "duration": 5,
    "cfg_scale": 0.5
  }'

Polling Logic

  • processing / starting / running → wait 5s, retry (typically takes ~60-120s)
  • completed / succeeded → done, get URL from data.outputs[]
  • failed → error, read data.error

Atlas Cloud MCP Tools (if available)

If the Atlas Cloud MCP server is configured, use built-in tools:

atlas_generate_video(model="kwaivgi/kling-v3.0-pro/text-to-video", params={...})
atlas_get_prediction(prediction_id="...")

Implementation Guide

  1. Determine task type:

    • Text-to-video: user describes a scene/action in text
    • Image-to-video: user provides an image to animate
    • Reference-to-video: user wants to generate video using character/prop/scene references
    • Video editing: user wants to modify an existing video
  2. Choose model family:

    • Kling V3.0 for standard text-to-video and image-to-video with negative prompts and cfg_scale control
    • Kling Video O3 for MVL-enhanced generation, reference-based video, video editing, and longer durations (3-15s)
  3. Choose tier:

    • Pro for final output, client-facing content, or quality-critical use
    • Standard for most production use, cost-effective generation
  4. Extract parameters:

    • Prompt: describe scene, action, camera movement, and visual details
    • Negative prompt (V3 only): specify undesired elements (e.g., "blurry, distorted faces, watermark")
    • Aspect ratio: infer from context (social reel→9:16, YouTube→16:9, square→1:1)
    • Duration: V3 supports 5 or 10s; O3 supports 3-15s
    • cfg_scale (V3 only): 0.5 default; increase toward 1.0 for stricter prompt adherence
    • Sound: enable if user wants audio; disabled by default
  5. Execute: POST to generateVideo API → poll result → download MP4

  6. Present result: show file path, offer to play

Prompt Tips

Kling produces best results with detailed, descriptive prompts:

  • Scene + Action: "A chef flips a pancake in a busy kitchen, steam rising from the pan"
  • Camera direction: "Camera slowly pans left to reveal...", "Close-up tracking shot of...", "Aerial view sweeping over..."
  • Style: "cinematic", "documentary style", "slow motion", "timelapse", "anime style"
  • Negative prompts (V3): Use to avoid common issues — "blurry, low quality, distorted, watermark, text overlay"
  • cfg_scale tuning (V3): Lower values (0.3-0.5) give more creative freedom; higher values (0.7-1.0) follow the prompt more strictly
  • Reference-to-video (O3): Provide clear character/prop reference images for consistent results

Image Requirements for Image-to-Video

When using image-to-video models, the source image must meet these requirements:

  • Format: JPG, JPEG, or PNG
  • Size: Maximum 10MB
  • Dimensions: Minimum 300px on shortest side
  • Aspect ratio: Between 1:2.5 and 2.5:1
安全使用建议
This skill appears to do precisely what it says: call Atlas Cloud to generate or edit short videos. Before installing, consider: 1) only provide ATLASCLOUD_API_KEY if you trust Atlas Cloud and the skill source; this key grants the skill billing-capable access to your Atlas Cloud account. 2) The script will upload any local file you choose and will download generated videos to disk — do not upload sensitive files. 3) Using the skill consumes paid API seconds (pricing is documented in SKILL.md); verify pricing and your account limits. 4) If you want to limit risk, create a scoped API key on Atlas Cloud (if supported) with minimal permissions and monitor API/key usage after enabling the skill.
功能分析
Type: OpenClaw Skill Name: kling-video Version: 1.0.4 The skill provides a legitimate interface for AI video generation using Kling models via the Atlas Cloud API. The Python script (scripts/generate_video.py) uses standard libraries to interact with api.atlascloud.ai, correctly handling environment-based credentials and implementing basic path safety for downloads, with no evidence of malicious intent or data exfiltration.
能力评估
Purpose & Capability
Name and description (Kling video generation/editing) match the included Python script and SKILL.md. The single required env var (ATLASCLOUD_API_KEY) is the API credential needed to call the Atlas Cloud endpoints documented in the skill.
Instruction Scope
SKILL.md and scripts confine activity to communicating with api.atlascloud.ai (model listing, submit generation, poll results, upload media, download outputs). The script only reads ATLASCLOUD_API_KEY, local files explicitly provided by the user (with a confirmation prompt), and writes downloaded outputs to disk. It does not reference unrelated system paths or other environment variables.
Install Mechanism
No install spec — instruction-only with one Python script using only the standard library (urllib, json, etc.). Nothing is downloaded or installed automatically by the skill.
Credentials
Only ATLASCLOUD_API_KEY is required and is used appropriately (sent in Authorization header to Atlas Cloud over HTTPS). No other secrets or platform credentials are requested.
Persistence & Privilege
Skill is not always-on and does not request elevated platform privileges or modify other skills/config. The agent may invoke the skill autonomously by default (platform normal), but that is consistent with the skill's purpose.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install kling-video
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /kling-video 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.4
- Added a new Python script: `scripts/generate_video.py` for video generation and management. - Users can now list available models, generate videos (text-to-video or image-to-video), and upload local files using command-line commands. - No external dependencies required; extra model parameters can be passed as key=value. - Updated documentation to include script usage instructions and command examples. - No other changes to API or functionality.
v1.0.3
kling-video v1.0.3 — Initial Release - First public release of the Kling Video skill and CLI for Atlas Cloud. - Supports text-to-video, image-to-video, multi-model selection, 4K resolution, multi-aspect ratios, audio option, and output file control. - Includes detailed usage instructions, CLI command format, supported models, and parameter options. - Provides real-time video generation via Atlas Cloud API with easy CLI integration.
v1.0.2
kling-video 1.0.2 - Major documentation update: expanded feature descriptions in SKILL.md for Kling 3.0 and O3. - Added support information for reference-to-video generation and MVL (Multi-modal Visual Language) features. - Updated available models, parameters, and clarified Pro/Standard/O3 model options. - Pricing section revised for per-second billing and Atlas Cloud discounts. - Adjusted supported durations: Kling V3 (5/10s), O3 (3–15s). - Improved trigger and usage guidelines in the description.
v1.0.1
**Adds AI-powered video editing alongside generation and animation.** - Introduced Kling Video O3 support for natural-language video editing: remove/replace objects, change backgrounds, and add visual effects. - Updated skill description and triggers to cover both video generation and editing use cases. - Expanded documentation with new model, parameters, and workflow for video editing. - Added updated pricing and model comparison, highlighting cost advantages and new capabilities. - No functional code changes detected; documentation and feature scope significantly improved.
v1.0.0
Initial release of kling-video — generate AI videos with Kuaishou's Kling 3.0 - Supports text-to-video and image-to-video generation with cinematic motion, physics simulation, and creative control - Pro and Standard tiers available for quality/cost trade-offs; up to 1080p and 5–10 second videos - Optional synchronized sound generation - Simple workflow: submit task → poll for result → download video (with example curl commands) - Detailed documentation for model options, parameters, aspect ratios, and pricing - Designed for content creation needs including product demos, social reels, short films, and more
元数据
Slug kling-video
版本 1.0.4
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 5
常见问题

Kling 3.0 — AI Video Generation by Kuaishou 是什么?

Generate, animate, and edit AI videos using Kuaishou's Kling 3.0 and Kling Video O3 — featuring cinematic motion quality, physics simulation, reference-based... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 767 次。

如何安装 Kling 3.0 — AI Video Generation by Kuaishou?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install kling-video」即可一键安装,无需额外配置。

Kling 3.0 — AI Video Generation by Kuaishou 是免费的吗?

是的,Kling 3.0 — AI Video Generation by Kuaishou 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Kling 3.0 — AI Video Generation by Kuaishou 支持哪些平台?

Kling 3.0 — AI Video Generation by Kuaishou 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Kling 3.0 — AI Video Generation by Kuaishou?

由 MikeWang(@xixihhhh)开发并维护,当前版本 v1.0.4。

💬 留言讨论