← 返回 Skills 市场
dodge1218

Batch Executor

作者 KairoKid · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
100
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install batch-executor
功能描述
Full batch processor for corpus-scale task execution. Handles Google Drive dumps, ChatGPT exports, Apple Notes, or any large collection of mixed content (ide...
使用说明 (SKILL.md)

Batch Executor

Corpus-scale processing: classify → prioritize → spawn → checkpoint → reconcile.

Unlike task-extractor (for 3-12 inline tasks) or batch-cognition (for idea analysis), this skill EXECUTES at scale with sub-agent parallelism.

When to Use

  • Google Drive folder dump (mixed docs, notes, spreadsheets)
  • ChatGPT conversation export (3K+ prompts)
  • Apple Notes dump (years of ideas)
  • Any input > 20 items or > 10K tokens of raw content
  • File-based input (not inline chat messages — use task-extractor for those)

Architecture

INPUT (any scale)
    ↓
PHASE 1: INGEST — save raw to disk, never lose
    ↓
PHASE 2: CLASSIFY — type each item, estimate effort
    ↓
PHASE 3: TRIAGE — score by value, group by dependency
    ↓
PHASE 4: EXECUTE — spawn sub-agents (max 3 concurrent)
    ↓
PHASE 5: RECONCILE — verify completions, retry failures
    ↓
PHASE 6: REPORT — value stack, patterns, action items

Phase 1: INGEST

Save ALL raw input to systems/batch-executor/corpus/YYYY-MM-DD-SOURCE.md BEFORE any processing.

For file inputs:

  • PDF → extract text via pdf tool
  • CSV/JSON → parse, one item per row/object
  • Markdown → split on ## headers or --- separators
  • ChatGPT export → parse conversations.json, group by chain_id
  • Google Drive → process each file, flatten into items

Create the manifest:

# Corpus Manifest: [source] [date]
# Total items: [N]
# Raw file: [path]
# Status: INGESTED

| # | First 80 chars | Type | Effort | Status |
|---|---------------|------|--------|--------|
| 1 | ... | ? | ? | INGESTED |

Phase 2: CLASSIFY

For each item, assign:

Type Description Action
TASK Has a clear action verb + deliverable EXECUTE
IDEA Speculative, "what if", product concept SCORE (ICE)
REFERENCE Link, citation, spec, documentation CATALOG
DECISION "We decided X", "going with Y" RECORD
HALF_THOUGHT Fragment, incomplete, trails off COMPLETE then re-classify
MODEL_OUTPUT AI-generated, assistant voice EXTRACT core idea, discard wrapper
DUPLICATE Same as item #X MERGE
NOISE Test, filler, meta-commentary SKIP

Effort per item:

  • TRIVIAL (\x3C 1 min): file rename, note capture, config change
  • QUICK (1-5 min): web search, small edit, API call
  • MEDIUM (5-30 min): build a page, write a doc, research topic
  • HEAVY (30+ min): full app build, deep research, multi-step workflow
  • BLOCKED: needs human input, credentials, or external dependency

Update manifest with Type + Effort columns.

Phase 3: TRIAGE

Score each TASK and IDEA using quick ICE:

  • I (Impact): 1-5 — how much does this move the needle?
  • C (Cost): 1-5 — how cheap/fast to do? (inverted: 5 = trivial)
  • E (Exploit): 1-5 — how quickly does this produce value?
  • Score = I × C × E (max 125)

Sort by score descending. Group by dependency chains.

Create execution plan:

# Execution Plan

## Wave 1 (parallel, no dependencies)
- Item #14 (ICE: 100) — HEAVY → sub-agent
- Item #3 (ICE: 80) — MEDIUM → sub-agent
- Item #7 (ICE: 75) — QUICK → inline

## Wave 2 (depends on Wave 1)
- Item #9 (depends on #14) — MEDIUM → after #14 completes

## Skip (NOISE/DUPLICATE)
- Items #2, #5, #11 — reason: [...]

## Blocked (needs human)
- Item #8 — needs API key from Ryan

Phase 4: EXECUTE

Rules:

  1. Max 3 sub-agents concurrent. Wait for one to complete before spawning another.
  2. QUICK items: execute inline (no sub-agent overhead for \x3C 5 min tasks).
  3. MEDIUM/HEAVY items: spawn sub-agent with clear task description + acceptance criteria.
  4. Each sub-agent gets: the item content, relevant context from other items, and the target artifact path.
  5. Track in manifest: status → EXECUTING, then ✅ DONE / ❌ FAILED / ⚠️ PARTIAL.

Sub-agent spawn template:

Task: [item summary]
Context: [relevant items from this corpus]
Deliverable: [specific file/artifact expected]
Acceptance: [how to verify it's done]
Workspace: [path]

Checkpoint every 5 completed items:

  • Update manifest
  • Report to user: "[X]/[N] done. [Y] in progress. Top findings so far: [...]"
  • If user is idle (no response in 30s), continue
  • Commit progress to git

Phase 5: RECONCILE

After all waves complete (or all sub-agents return):

  1. Re-read manifest
  2. For each ❌ FAILED: log reason, decide retry or escalate
  3. For each 🔄 sub-agent still running: check status, kill if stale (> 30 min no progress)
  4. For each ⚠️ PARTIAL: note what's left
  5. Retry failed items once (different approach if possible)

Phase 6: REPORT

Generate final report at systems/batch-executor/reports/YYYY-MM-DD-SOURCE-report.md:

# Corpus Report: [source]
# Processed: [date]
# Total: [N] items
# Results: [done] ✅ | [failed] ❌ | [partial] ⚠️ | [skipped] ⏭️ | [blocked] 🔒

## Value Stack (top items by impact)
1. [item] — [outcome] — [next step]
2. ...

## Patterns Discovered
- [theme or connection across items]

## Action Items (immediate)
- [ ] [task from corpus that needs follow-up]

## Parked (valuable but not now)
- [item] — reason: [why later]

## Blocked (needs human)
- [item] — needs: [what]

## Statistics
- Items by type: TASK [x], IDEA [x], REFERENCE [x], NOISE [x]
- Items by effort: TRIVIAL [x], QUICK [x], MEDIUM [x], HEAVY [x]
- Sub-agents spawned: [x]
- Total execution time: [x min]

Append to systems/batch-cognition/value-stack.md (shared with batch-cognition skill). Log learnings to .learnings/LEARNINGS.md.

Commands

status — show manifest progress pause — stop spawning, let running agents finish resume — continue from where we left off (re-read manifest) skip [#] — skip item number retry [#] — retry failed item block [#] [reason] — mark as blocked priority [#] — move item to top of queue done — trigger report even if items remain

Key Rules

  1. INGEST FIRST. Raw content hits disk before ANY processing.
  2. Max 3 concurrent sub-agents. More = chaos, dropped results, context confusion.
  3. Checkpoint every 5. Git commit progress. User update.
  4. Never mark ✅ without artifact evidence. File exists, build passes, URL responds.
  5. NOISE is not failure. Skipping noise is correct behavior. Report it transparently.
  6. Corpus items cross-reference. Item #14 may be context for item #27. Pass relevant context to sub-agents.
  7. Resume is first-class. If session dies, resume re-reads manifest and continues from last checkpoint.
  8. ICE scoring is fast. 30 seconds per item max. Don't overthink triage — execute.

Integration with Other Skills

  • task-extractor: For inline chat messages (3-12 items). Batch-executor is for file/corpus scale (20+).
  • batch-cognition: For idea analysis (THINK-heavy). Batch-executor is for execution (PLAY-heavy).
  • orchestrator: Batch-executor can be invoked BY the orchestrator when it detects a corpus dump.
  • recorder: After batch-executor completes, route to recorder to update STATUS.md.
安全使用建议
This skill appears to do what it says (bulk corpus processing), but review these issues before installing or running it: 1) It will write raw inputs to disk (systems/batch-executor/...) and commit progress to git — if your agent has a configured git remote this could leak data. Run it in an isolated workspace or ensure no remote git pushes are possible. 2) The SKILL.md references external tools (a `pdf` extractor) and expects to parse Drive/ChatGPT exports but declares no credentials — confirm where Google/Drive/ChatGPT access will come from and never supply credentials unless you trust the runtime. 3) It appends to systems/batch-cognition/value-stack.md (another skill's file) — if you use batch-cognition, expect side-effects; consider sandboxing or editing that line. 4) The skill spawns sub-agents and will pass full item content and related context to them — ensure sub-agents are allowed to see that data. 5) If you want safer use: run on a copy of your corpus in an isolated container, disable autonomous invocation or lower concurrency, verify available binaries (pdf, git) and their behavior, and confirm no external remotes or credentials are accessible. If anything is unclear, ask the skill author for explicit declarations of required tools, expected file paths, and credential handling before use.
能力评估
Purpose & Capability
The SKILL.md describes a corpus-scale executor and the steps it will take (ingest, classify, triage, spawn sub-agents, checkpoint, report). That high-level purpose aligns with the instructions. However the instructions reference external tools and operations (e.g., a `pdf` tool, parsing Google Drive exports, committing to git, and writing into systems/batch-cognition/value-stack.md) that are not declared in the registry metadata (no required binaries, env vars, or config paths). The absence of declared requirements is a mismatch but could be explained by assuming a pre-provisioned agent environment.
Instruction Scope
The instructions direct the agent to save ALL raw input to disk under systems/batch-executor, parse many file types, spawn sub-agents and pass them corpus content and related context, checkpoint progress and commit to git every N items, and append results to a shared file used by another skill (systems/batch-cognition/value-stack.md). Writing raw inputs to disk and sharing item content across spawned sub-agents is expected for batch processing, but: (1) committing progress to git could leak data if Git remotes are configured; (2) appending to a path owned by another skill is cross-skill modification and could overwrite or leak aggregated data; (3) the instructions reference a `pdf` tool and killing/stopping sub-agents without declaring what runtime has permissions to do those operations. The instruction 'If user is idle (no response in 30s), continue' grants the skill substantial autonomy to proceed without explicit human confirmation.
Install Mechanism
This is an instruction-only skill with no install spec and no code files. That minimizes install-time risk because nothing is downloaded or written by an installer. The runtime risks come from the actions the instructions direct the agent to perform, not from an installer.
Credentials
The skill declares no required environment variables or credentials, yet it expects to process Google Drive dumps and potentially interact with git. To legitimately handle Google Drive programmatically you'd normally expect Google API credentials; for git commits/pushes you'd expect git config/credentials. The skill's silence on credentials is a proportionality concern: either it requires the agent runtime to already provide these capabilities (not explicit), or it will prompt for them at runtime (also not described). Additionally, the practice of giving sub-agents full item content and 'relevant context from other items' can cause broad exposure of sensitive data across tasks.
Persistence & Privilege
The skill is not force-enabled (always: false) and allows autonomous invocation by default (platform normal). The concerning element is file and cross-skill persistence: it writes raw inputs and reports to systems/batch-executor paths and explicitly appends to systems/batch-cognition/value-stack.md (a file outside its own namespace). That level of write access to shared skill files increases blast radius and could lead to data being mixed with other skill state.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install batch-executor
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /batch-executor 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
First release. Corpus-scale task execution with sub-agent waves, ICE triage, and checkpointing. Handles Google Drive, ChatGPT exports, Apple Notes dumps.
元数据
Slug batch-executor
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Batch Executor 是什么?

Full batch processor for corpus-scale task execution. Handles Google Drive dumps, ChatGPT exports, Apple Notes, or any large collection of mixed content (ide... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 100 次。

如何安装 Batch Executor?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install batch-executor」即可一键安装,无需额外配置。

Batch Executor 是免费的吗?

是的,Batch Executor 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Batch Executor 支持哪些平台?

Batch Executor 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Batch Executor?

由 KairoKid(@dodge1218)开发并维护,当前版本 v1.0.0。

💬 留言讨论