← 返回 Skills 市场

ComfyUI Video

Name: ComfyUI Video
Author: a3165458

作者 smeb y · GitHub ↗ · v1.0.0 · MIT-0

cross-platform ⚠ suspicious

729

总下载

当前安装

版本数

在 OpenClaw 中安装

/install comfyui-video

功能描述

Automate AI video generation with ComfyUI and LTX-2.3. Supports text-to-video (T2V), image-to-video (I2V), batch scene rendering for music videos, and multi-...

使用说明 (SKILL.md)

ComfyUI Video Generation

Automate AI video generation using ComfyUI + LTX-2.3 model. Ideal for music video (MV) production, multi-scene batch rendering, and AI video content creation.

Requirements

Item	Spec
GPU	≥24GB VRAM (Turing/Ampere/Ada)
ComfyUI	0.17+
PyTorch	2.6+cu124
Access	SSH tunnel forwarding port 18188

Model Setup

Model	Size	Path
LTX-2.3 dev (bf16)	43GB	`models/checkpoints/ltx-2.3-22b-dev.safetensors`
Gemma 3 12B	23GB	`models/text_encoders/comfy_gemma_3_12B_it.safetensors`
Distilled LoRA	7.1GB	`models/loras/ltxv/ltx2/ltx-2.3-22b-distilled-lora-384.safetensors`
Video VAE (bf16)	-	`models/vae/LTX23_video_vae_bf16.safetensors`

Turing GPUs (e.g., Quadro RTX 8000) do NOT support fp8_e4m3fn. Use bf16/fp16 models only.

Performance Baseline

Per-step time: ~221s (constant, regardless of frame count!)
15 steps: ~57 min
25 steps: ~1h45m
Frames: 72=3s, 121=5s, 480=20s (24fps)

Key insight: Frame count does NOT affect total time. Bottleneck is model forward pass.

Workflow Node Reference

Node	ID	Purpose
LoadImage	2004	I2V reference input
CLIPTextEncode (positive)	2483	Positive prompt
CLIPTextEncode (negative)	2612	Negative prompt
EmptyLTXVLatentVideo	3059	Empty latent
LTXVScheduler	4966	Steps/length params
LoraLoaderModelOnly	4922+	LoRA loader
SaveVideo	4823/4852	Output mp4

Quick Start

Generate a Single Video (I2V)

Load workflow: /workspace/ComfyUI/custom_nodes/ComfyUI-LTXVideo/example_workflows/2.3/LTX-2.3_T2V_I2V_Single_Stage_Distilled_Full.json
Set params using scripts/batch_scenes.js
Click Run
Wait ~1 hour
Download from /workspace/ComfyUI/output/

Batch Scene Generation

Use scripts/batch_scenes.js for automation:

// Load script first, then configure each scene:
await comfyui_batch.configureScene({
  name: "scene_01",
  prompt: "A lonely girl running through rain at night, neon reflections",
  image: "unified_ref.png",
  steps: 15,
  frames: 72
});
// Click Run, repeat for next scene

Step Count Guide

Steps	Quality	Time/Scene	Use Case
8	Rough	~30min	Quick preview
15	Good	~57min	Recommended sweet spot
25	Best	~1h45m	Final quality output

I2V + LoRA at 15 steps achieves ~90% of 25-step quality with 40% less time.

Troubleshooting

VAEDecode Validation Failed

Error: Exception when validating node: 'VAEDecode' Cause: VAE load timing or insufficient VRAM Fix: Reload the entire workflow (fetch + loadGraphData), wait for models to fully load, then run. Never reload during execution.

Browser Tab Lost

Cause: SSH tunnel disconnected Fix:

Rebuild tunnel: ssh -f -N -L 18188:localhost:18188 user@host -p port
Navigate to ComfyUI
Reload workflow

Inconsistent Characters Across Scenes

Cause: Different reference images per scene Fix: Use the SAME reference image for all scenes. Extract a clear frame from an existing video if needed. The I2V input image dictates the visual baseline.

Output Video Not Saved

Check: ssh -p PORT root@HOST "ls -lht /workspace/ComfyUI/output/*.mp4" Fix: Check for VAEDecode errors in log, then re-run.

Monitoring Progress

# Current sampling progress
ssh -p PORT root@HOST "grep 'it/s' /tmp/comfy.log | tail -1"

# Completion check
ssh -p PORT root@HOST "grep 'Prompt executed' /tmp/comfy.log | tail -1"

# Output files
ssh -p PORT root@HOST "ls -lht /workspace/ComfyUI/output/*.mp4"

Best Practices

15 steps is the sweet spot — I2V converges at 15-20 steps, 25 has diminishing returns
Unified reference image — Same input image for all scenes ensures character consistency
Reload workflow every time — Avoids VAEDecode validation failures
Never reload during execution — Current run will fail
Frame selection — 72 frames (3s) for testing, 480 frames (20s) for final output
VRAM management — Wait for each generation to complete before starting next

T2V vs I2V Comparison

Mode	Steps	Quality	Notes
T2V (no LoRA)	15	❌ Very blurry	Not recommended
I2V + LoRA	25	✅ Excellent	Major quality improvement
I2V + LoRA	15	✅ Very good	Best time/quality ratio

Conclusion: I2V + LoRA is the recommended combination.

Resources

scripts/batch_scenes.js — Batch scene automation
references/workflow_nodes.md — Full node ID mapping
references/tips.md — Prompt tips, VRAM management, optimization

安全使用建议

This skill is internally consistent for automating ComfyUI workflows, but take these precautions before using it: 1) Inspect and run the scripts only in a trusted browser session — avoid pasting unknown JS into consoles on machines you don't control. 2) The skill expects SSH access to a host running ComfyUI and large local model files (paths like /workspace/ComfyUI); make sure you trust that remote host and have the necessary credentials/permissions. 3) No secrets are embedded in the skill, but using it requires you to supply SSH credentials yourself; do not expose keys or reuse credentials you cannot trust. 4) Verify model licensing and content policy compliance for any images/people used. If you want a stricter risk posture, test on an isolated VM or a non-production host first.

功能分析

Type: OpenClaw Skill Name: comfyui-video Version: 1.0.0 The skill bundle provides automation for ComfyUI video generation but includes instructions in SKILL.md for the AI agent to execute high-privilege shell commands via SSH (e.g., `ssh -p PORT root@HOST`) to monitor logs and list files. While these actions are plausibly related to managing a remote GPU workspace, the inclusion of root-level SSH commands represents a significant attack surface and risky capability. Additionally, scripts/batch_scenes.js performs direct manipulation of the ComfyUI web API and application state, which could be leveraged for unintended actions if the agent's context is compromised.

能力评估

✓ Purpose & Capability

The name/description match the contents: guidance for ComfyUI + LTX-2.3, a browser-side automation script, workflow node mappings, and SSH-based monitoring. The large-model and GPU requirements are appropriate for the stated task. There are no unrelated credentials, binaries, or external services requested.

✓ Instruction Scope

SKILL.md stays on-topic: it instructs how to load workflows, tune nodes, run batch scenes using the included browser JS helper, and how to check progress via SSH on a host running ComfyUI. It does not instruct reading unrelated local/system files or sending data to unknown endpoints. It does assume you have SSH access to the ComfyUI host and filesystem.

✓ Install Mechanism

No install spec; this is instruction-only with a small browser script. No downloads or archive extraction are requested. The included script runs in the ComfyUI web UI context and contains no obfuscated code or external network calls.

ℹ Credentials

The skill does not declare any required environment variables or credentials (metadata shows none), which is consistent with the browser-console + SSH usage model. However, the runtime guidance expects SSH access and local model files in /workspace/ComfyUI; users will need appropriate SSH keys/credentials and filesystem access to the remote host — these are not provided by the skill and must be supplied by the user.

✓ Persistence & Privilege

The skill is not force-included (always: false) and does not request persistent privileges or modify other skills. It only exposes a helper on window.comfyui_batch when run in browser context, which is normal for a client-side utility.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install comfyui-video
安装完成后，直接呼叫该 Skill 的名称或使用 /comfyui-video 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.0.0

Initial release: LTX-2.3 video generation workflow, batch scene automation, troubleshooting guide

元数据

Slug comfyui-video

版本 1.0.0

许可证 MIT-0

累计安装 3

当前安装数 3

历史版本数 1

常见问题

ComfyUI Video 是什么？

Automate AI video generation with ComfyUI and LTX-2.3. Supports text-to-video (T2V), image-to-video (I2V), batch scene rendering for music videos, and multi-... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 729 次。

如何安装 ComfyUI Video？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install comfyui-video」即可一键安装，无需额外配置。

ComfyUI Video 是免费的吗？

是的，ComfyUI Video 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

ComfyUI Video 支持哪些平台？

ComfyUI Video 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（cross-platform）。

谁开发了 ComfyUI Video？

由 smeb y（@a3165458）开发并维护，当前版本 v1.0.0。