← 返回 Skills 市场
brajendrak00068

AI Agentic Video Editor

作者 brajendrak00068 · GitHub ↗ · v1.0.17 · MIT-0
cross-platform ✓ 安全检测通过
302
总下载
0
收藏
0
当前安装
18
版本数
在 OpenClaw 中安装
/install ai-agentic-video-editor
功能描述
AI agentic video editor. An autonomous AI editor for video that decomposes natural-language prompts into a planned, verified, executed sequence of edits — vi...
使用说明 (SKILL.md)

Levea Agentic Video Editor for OpenClaw

A full agentic video editing surface. Send a prompt, get a planned, verified, executed edit — or call a known canonical action directly with structured params. Both modes flow through the same brain + safety gates as the in-product editor.

Use this skill when the user asks for OpenClaw video editing, AI video editing, natural-language video edits, viral clips, TikTok videos, Instagram Reels, YouTube Shorts, auto captions, subtitles, chroma key, green screen removal, background removal, B-roll, motion tracking, silence removal, audio cleanup, voiceover, music generation, vertical video, multi-platform export, or MP4 export.

Beta: This skill is in beta and outputs can be wrong. Before executing any mutating edit on user content, describe the planned edit and request explicit confirmation from the user. For destructive or irreversible workflows, pass requirePlanApproval: true so the editor halts after planning and the user can approve before execution.

Endpoint

POST {ADSCENE_API_URL}/api/v1/misc/openclaw/v1/execute

Auth: Authorization: Bearer {ADSCENE_API_KEY}

Create an account and generate the OpenClaw API key in Studio: https://studio.livecor.ai/. Use https://api.livecore.ai for ADSCENE_API_URL; Studio is only for signup, login, and key management.

Accepts either single-shot JSON (default) or SSE (Accept: text/event-stream or ?stream=true).

Request body:

{
  "tool": "autonomous_edit" | "\x3Callowlisted-tool>",
  "params": { ... },
  "project_id": "optional-project-id",
  "scene": { /* optional client scene; server-side committed scene wins if newer */ }
}

How you use it

Every edit goes through one tool: autonomous_edit. Pass a natural-language description in params.prompt. The agent classifies the intent, decomposes into atomic steps, plans, executes through safety gates, and verifies the result. There are no other tools the caller needs to learn.

curl -sS -X POST "$ADSCENE_API_URL/api/v1/misc/openclaw/v1/execute" \
  -H "Authorization: Bearer $ADSCENE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "tool": "autonomous_edit",
    "params": {
      "prompt": "Make this a TikTok-ready viral clip: vertical reframe, add bold captions, remove silences, and apply motion tracking to the speaker."
    },
    "project_id": "my-project"
  }'

Behind a single autonomous_edit call the agent can compose any of:

Read / inspect

  • Inspect the timeline structure, layer properties, and current scene state
  • Run computer-vision analysis on frames (object/face/scene detection)
  • Query the transcript with keyword, semantic, or timestamp-window search
  • Pull video intelligence (narrative peaks, speaker diarization, sentiment, pacing)
  • Search the asset gallery (videos, images, audio) by type, duration, name
  • Poll async job status; introspect property schemas

Structural editing

  • Insert / update / replace / delete layers (video, audio, text, image, shape, group, adjustment)
  • Trim, split, retime layers (slow-mo 0.5×, fast-forward 2×, freeze-frame)
  • Reposition on the timeline, sequence layers, snap to transcript
  • Heal timeline gaps, normalize audio, reconcile durations (pre-export safety pass)
  • Multi-step undo / redo

Visual editing

  • Color grading (brightness, contrast, saturation, hue, lift/gamma/gain, RGB curves)
  • Procedural VFX shaders: smoke, dust, fire, explosion, lightning, snow, glitch, scanlines, grain, glassmorphism, bokeh, lava, telomere/corrosion, portal
  • Chroma key (green / blue screen) with similarity, smoothness, spill controls; luma / alpha / depth masks
  • Geometric clip shapes (circle, dome, star, hexagon, …)
  • Crop (absolute or edge-based), 3D rotation + perspective
  • Glow, shadow, inner shadow, gradient fills, text gradients
  • Vertical reframe (9:16) and vertical-reframe montage
  • Split screen (top/bottom, left/right, picture-in-picture)
  • Branding overlays (logo / watermark) from gallery or AI-generated
  • Motion / face tracking with dynamic zoom-follow

Captions & text

  • Auto-generate captions from transcript
  • Style captions with built-in templates or an AI director that picks/generates a custom template at runtime
  • Curved text paths (circle, wave, custom SVG)
  • Per-word entrance / exit animations (typewriter, slide, fade, scale, rotate, bounce, flip, swing, elastic, blur, glitch, wave, plus matching exits)
  • Lottie animation playback control

Audio

  • Clean audio: remove silences, breaths, filler words; word-level mute or cut
  • Auto-ducking on speech detection (sidechain music vs voice)
  • Mix / normalize / denoise / EQ (bass boost, vocal clarity, warm, bright)
  • Sync external master audio to video (offset, mute camera audio)
  • Beat / kick-drum-synced cuts (provide beat_times or bpm)
  • Add SFX, generate music (mood / genre / BPM), generate voiceover (TTS or cloned voice)
  • Render waveform visualizers (bars, wave, circular)

Async generation

  • AI video / B-roll — duration + aspect ratio
  • AI images — single or batch at timestamps
  • AI music — prompt + duration + mood + genre + BPM
  • AI voiceover — TTS or cloned voice library
  • Auto-thumbnail extraction
  • Face blur (all faces or background-only)
  • Image edit (generative instruction-based)

High-level kits (each is a single canonical action that orchestrates many underlying edits)

  • APPLY_VIRAL_KIT — vertical reframe + captions + silence removal + motion tracking + emphasis
  • APPLY_CINEMATIC_DIRECTOR — energy analysis + dynamic zooms + cinematic color grade + mood-based camera moves
  • APPLY_EMPHASIS_SYSTEM — keyword detection + scaling / glow / pulse coordinated with captions
  • OPTIMIZE_PACING — filler-word + silence + low-energy segment removal for retention

Export

  • EXPORT_VIDEO — render to MP4 (resolution / codec / quality tier)
  • GENERATE_VIRAL_CLIPS — auto-segment short-form clips packaged as ZIP
  • GENERATE_MULTI_PLATFORM — TikTok + Reels + Shorts + YouTube + Instagram aspect ratios in one pass

Auto-export follow-up

After any mutating autonomous_edit call, if the scene was actually changed and the agent did not already queue an export, the route fires one automatically as a second run. The final response carries videoUrl (when ready) or jobId (for polling). Read-only and conversational autonomous_edit calls do NOT trigger auto-export.

Optional input parameters (parity with the in-product editor)

Pass any of these inside params (or at the top level of the body) to drive advanced features:

  • prompt — required on every call (the natural-language edit description)
  • workingMemory — durable working-memory snapshot. Re-send to resume after awaiting_approval
  • requirePlanApprovaltrue makes the agent stop after planning and emit awaiting_approval; resume with the same workingMemory + an approval prompt ("yes", "approve", "do it", …)
  • attachedImages — array of base64 screenshots / reference images
  • flaggedIssues — array of strings describing specific problems the user wants fixed
  • captionTemplatePreset, captionTemplateMode — style preset routing for caption generation
  • core_only (also accepted via ?core_only=true) — return a minimal scene shape (rendering-only, no debug fields)
  • assets — additional asset descriptors to make available to the agent

Response shapes

JSON (default)

{
  "type": "success" | "partial_success",
  "tool": "\x3Ctool-name>",
  "success": true,
  "status": "completed" | "failed" | "awaiting_approval",
  "scene": { /* updated scene */ },
  "reply": "Human-readable summary of what changed",
  "videoUrl": "https://.../output.mp4",
  "jobId": "task_...",
  "viral_clips": [ /* clip metadata if generated */ ],
  "zip_url": "https://.../clips.zip",
  "activeTasks": [ /* queued background jobs */ ],
  "pendingAsyncJobs": [ /* in-flight job status */ ],
  "workflowStepsDetailed": [ /* every executed step */ ],
  "workflowSummary": { "title": "...", "summary": "..." },
  "verificationPassed": true,
  "verificationIssues": [],
  "committedToProjectScene": true,
  "processingTime": 12.3,
  "message": "Same as reply",
  "workingMemory": { /* return this in the next call to resume approval-paused runs */ }
}

Failure response (HTTP 4xx/5xx):

{ "success": false, "error": "...", "code": "UNKNOWN_TOOL" | "MISSING_TOOL" | "EXECUTION_ERROR" }

SSE (Accept: text/event-stream or ?stream=true)

The same event stream the in-product editor uses. Notable event types:

  • heartbeat — every 15s, keeps the connection alive
  • status — phase transitions (request_received, runtime_start, …)
  • mode_select{ mode: "qa" | "action" }
  • thinking, tool_call, tool_result — per-step reasoning visibility
  • background_job_completed — async job finished (B-roll, viral clips, …)
  • workflow_completed — main brain loop done, verification may continue
  • success / partial_success — final terminal payload (same shape as JSON above)
  • error — terminal failure

Async job lifecycle

Generation actions (generate_*, EXPORT_VIDEO) return immediately with a jobId in activeTasks / pendingAsyncJobs. Poll status with:

GET {ADSCENE_API_URL}/api/v1/misc/openclaw/v1/jobs/{jobId}
Authorization: Bearer {ADSCENE_API_KEY}

Response:

{
  "success": true,
  "jobId": "task_xxx",
  "status": "queued" | "processing" | "completed" | "failed",
  "progress": 0.74,
  "message": "Rendering frame 142 of 192",
  "result": { /* artifact URLs / clip metadata on completion */ },
  "error": null,
  "createdAt": "...",
  "updatedAt": "..."
}

To pull async-generated content into the timeline once jobs settle, the agent uses APPLY_PENDING internally — autonomous_edit callers don't need to manage this, but direct callers can issue an autonomous_edit prompt like "apply any pending generated content" to harvest.

Plan approval flow

If you pass requirePlanApproval: true, the agent stops after planning and the response carries status: "awaiting_approval" + a populated workingMemory. To proceed, call again with:

{
  "tool": "autonomous_edit",
  "params": {
    "prompt": "yes",
    "workingMemory": { /* the workingMemory from the previous response */ }
  }
}

Accepted approval phrases: yes, y, approve, approved, go, proceed, go ahead, do it, confirm.

Safety, verification, and limits

Every run flows through three deterministic gates (ActionPermissionGate, ArchitectureControlPlane, EditorSafetyPolicy). Destructive actions (CLEAR, mass deletes) require explicit confirmation params. Verification runs after execution and may trigger up to 2 repair loops; failures surface in verificationPassed: false + verificationIssues[]. Concurrent identical requests for the same (user, project, prompt, scene fingerprint) are deduplicated server-side.

Rate-limited per API key. Processing times vary: read-only ~1–3s, structural edits ~3–10s, async generation 30s–5min per artifact, viral-clip / multi-platform exports several minutes.

Supported formats

  • Video in: MP4, MOV, WebM (HTTP/HTTPS URLs, YouTube URLs, gallery IDs)
  • Image in: JPG, PNG, WebP
  • Audio in: MP3, WAV, M4A, AAC (or extracted from video)
  • Output: MP4 (export), ZIP (viral clips / multi-platform bundles)
  • Max video length: depends on plan; soft limit ~30 min for synchronous edits, async generation handles longer
  • Recommended resolution: 1080p or 4K; canvas is configurable per project

Example: end-to-end viral-clip generation

# 1) Kick off the viral-clip pipeline (auto-export follow-up queues rendering)
curl -sS -X POST "$ADSCENE_API_URL/api/v1/misc/openclaw/v1/execute" \
  -H "Authorization: Bearer $ADSCENE_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "tool": "autonomous_edit",
    "params": {
      "prompt": "Generate 5 viral clips, 15-30 seconds each, focused on the most engaging moments. Add bold captions, vertical reframe, remove silences."
    },
    "project_id": "my-project"
  }' | tee /tmp/result.json | jq -r '.jobId // .activeTasks[0].intent.job_id'

# 2) Poll job status until done
JOB_ID=$(jq -r '.jobId // .activeTasks[0].intent.job_id' /tmp/result.json)
while true; do
  STATUS=$(curl -sS "$ADSCENE_API_URL/api/v1/misc/openclaw/v1/jobs/$JOB_ID" \
    -H "Authorization: Bearer $ADSCENE_API_KEY" | jq -r '.status')
  echo "Status: $STATUS"
  [ "$STATUS" = "completed" ] || [ "$STATUS" = "failed" ] && break
  sleep 5
done

# 3) Fetch the final artifact URL(s)
curl -sS "$ADSCENE_API_URL/api/v1/misc/openclaw/v1/jobs/$JOB_ID" \
  -H "Authorization: Bearer $ADSCENE_API_KEY" | jq '.result'
安全使用建议
Before installing, confirm you trust the Levea/Livecore service with the videos you plan to edit. Use a dedicated API key, preview outputs before publishing, and require plan approval for destructive or high-stakes edits.
功能分析
Type: OpenClaw Skill Name: ai-agentic-video-editor Version: 1.0.17 The skill acts as a wrapper for a remote AI video editing service ('Levea') but exhibits a significant red flag: it directs users to two nearly identical domains—`livecor.ai` for account management and `livecore.ai` for the API endpoint. This pattern is highly characteristic of typosquatting or phishing. Additionally, the support contact is a personal Gmail address (`[email protected]`), which is unusual for a service claiming such advanced capabilities. While the `SKILL.md` and `README.md` contain standard API integration logic using `curl` and include safety features like a plan approval flow, the domain discrepancy makes the destination service untrustworthy.
能力标签
cryptorequires-sensitive-credentials
能力评估
Purpose & Capability
The skill’s broad autonomous editing capabilities, including deleting or changing layers and exporting videos, are consistent with its stated purpose as an AI video editor.
Instruction Scope
The instructions explicitly tell the agent to describe planned mutating edits and request user confirmation, and to use requirePlanApproval for destructive or irreversible workflows.
Install Mechanism
No install script or code package is present; the skill is instruction-only and uses documented curl/jq commands.
Credentials
The required API URL and API key are declared and purpose-aligned for a remote video-editing provider integration.
Persistence & Privilege
The skill requires a provider API key that can affect user video projects, but the artifacts do not show hidden persistence, background execution, or unrelated credential use.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install ai-agentic-video-editor
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /ai-agentic-video-editor 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.17
Narrow translation claim to 30+ Latin-script languages. Caption rendering currently ships Latin-script font coverage only; CJK / Arabic / Devanagari rendering is on the roadmap. Avoids over-promising in the README.
v1.0.16
Surface translation features (previously undocumented): caption translation in 100+ languages with timing preserved, and AI dubbing — auto-read transcript, translate, synthesize voiceover in target language with auto-calibrated speaking rate.
v1.0.15
Expand capability surface to cover all shipped features. New in README: transitions (fade/slide/wipe/iris/zoom/whip pan/cross dissolve), video stabilization, reverse playback, title cards, profanity cleanup, audio crossfade, slideshow generation, semantic background replacement, color preset library, blend modes, video noise reduction, markers (chapters/ads/censorship), and expanded VFX/shape/word-animation lists.
v1.0.14
Object removal is shipped, not roadmap. Promote it from a roadmap line to a supported capability (AI segmentation + temporally consistent video inpainting) in both the prompt-recipe table and the capability list.
v1.0.13
Restore the full-brief example to a fenced text code block — rendering as plain prose blurred it into the surrounding paragraphs and lost the 'this is the prompt' callout effect.
v1.0.12
Brief example renders as paragraph
v1.0.11
Full-brief example shows only the prompt (curl removed)
v1.0.10
Beyond one-liners section promoted to top
v1.0.9
Added goals/briefs section: direct commands, watch-and-propose mode, full multi-step briefs
v1.0.8
Added OCR (caption-safe reframe, on-screen text), AI background removal, background blur, object removal roadmap
v1.0.7
Corrected URLs: API api.livecore.ai, Studio studio.livecor.ai
v1.0.6
Beta disclaimers across README and SKILL.md; agent confirms before mutating edits
v1.0.5
Surface only autonomous_edit; one-tool UX
v1.0.4
Expanded features: 19 prompt examples, 8 use-case verticals; 3-hour video support
v1.0.3
Trimmed README to features-only; removed internal architecture sections
v1.0.2
Tightened IP exposure: redacted specific model names, internal class names, dimensions; brand rename Livecore -> Levea
v1.0.1
Comprehensive README: 4-tier intelligence stack, 6 perception models, agent loop, full capability surface
v1.0.0
Initial release: agentic AI video editor
元数据
Slug ai-agentic-video-editor
版本 1.0.17
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 18
常见问题

AI Agentic Video Editor 是什么?

AI agentic video editor. An autonomous AI editor for video that decomposes natural-language prompts into a planned, verified, executed sequence of edits — vi... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 302 次。

如何安装 AI Agentic Video Editor?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install ai-agentic-video-editor」即可一键安装,无需额外配置。

AI Agentic Video Editor 是免费的吗?

是的,AI Agentic Video Editor 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

AI Agentic Video Editor 支持哪些平台?

AI Agentic Video Editor 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 AI Agentic Video Editor?

由 brajendrak00068(@brajendrak00068)开发并维护,当前版本 v1.0.17。

💬 留言讨论