Description

Full batch processor for corpus-scale task execution. Handles Google Drive dumps, ChatGPT exports, Apple Notes, or any large collection of mixed content (ide...

README (SKILL.md)

Batch Executor

Name: Batch Executor
Author: dodge1218

Corpus-scale processing: classify → prioritize → spawn → checkpoint → reconcile.

Unlike task-extractor (for 3-12 inline tasks) or batch-cognition (for idea analysis), this skill EXECUTES at scale with sub-agent parallelism.

When to Use

Google Drive folder dump (mixed docs, notes, spreadsheets)
ChatGPT conversation export (3K+ prompts)
Apple Notes dump (years of ideas)
Any input > 20 items or > 10K tokens of raw content
File-based input (not inline chat messages — use task-extractor for those)

Architecture

INPUT (any scale)
    ↓
PHASE 1: INGEST — save raw to disk, never lose
    ↓
PHASE 2: CLASSIFY — type each item, estimate effort
    ↓
PHASE 3: TRIAGE — score by value, group by dependency
    ↓
PHASE 4: EXECUTE — spawn sub-agents (max 3 concurrent)
    ↓
PHASE 5: RECONCILE — verify completions, retry failures
    ↓
PHASE 6: REPORT — value stack, patterns, action items

Phase 1: INGEST

Save ALL raw input to systems/batch-executor/corpus/YYYY-MM-DD-SOURCE.md BEFORE any processing.

For file inputs:

PDF → extract text via pdf tool
CSV/JSON → parse, one item per row/object
Markdown → split on ## headers or --- separators
ChatGPT export → parse conversations.json, group by chain_id
Google Drive → process each file, flatten into items

Create the manifest:

# Corpus Manifest: [source] [date]
# Total items: [N]
# Raw file: [path]
# Status: INGESTED

| # | First 80 chars | Type | Effort | Status |
|---|---------------|------|--------|--------|
| 1 | ... | ? | ? | INGESTED |

Phase 2: CLASSIFY

For each item, assign:

Type	Description	Action
TASK	Has a clear action verb + deliverable	EXECUTE
IDEA	Speculative, "what if", product concept	SCORE (ICE)
REFERENCE	Link, citation, spec, documentation	CATALOG
DECISION	"We decided X", "going with Y"	RECORD
HALF_THOUGHT	Fragment, incomplete, trails off	COMPLETE then re-classify
MODEL_OUTPUT	AI-generated, assistant voice	EXTRACT core idea, discard wrapper
DUPLICATE	Same as item #X	MERGE
NOISE	Test, filler, meta-commentary	SKIP

Effort per item:

TRIVIAL (\x3C 1 min): file rename, note capture, config change
QUICK (1-5 min): web search, small edit, API call
MEDIUM (5-30 min): build a page, write a doc, research topic
HEAVY (30+ min): full app build, deep research, multi-step workflow
BLOCKED: needs human input, credentials, or external dependency

Update manifest with Type + Effort columns.

Phase 3: TRIAGE

Score each TASK and IDEA using quick ICE:

I (Impact): 1-5 — how much does this move the needle?
C (Cost): 1-5 — how cheap/fast to do? (inverted: 5 = trivial)
E (Exploit): 1-5 — how quickly does this produce value?
Score = I × C × E (max 125)

Sort by score descending. Group by dependency chains.

Create execution plan:

# Execution Plan

## Wave 1 (parallel, no dependencies)
- Item #14 (ICE: 100) — HEAVY → sub-agent
- Item #3 (ICE: 80) — MEDIUM → sub-agent
- Item #7 (ICE: 75) — QUICK → inline

## Wave 2 (depends on Wave 1)
- Item #9 (depends on #14) — MEDIUM → after #14 completes

## Skip (NOISE/DUPLICATE)
- Items #2, #5, #11 — reason: [...]

## Blocked (needs human)
- Item #8 — needs API key from Ryan

Phase 4: EXECUTE

Rules:

Max 3 sub-agents concurrent. Wait for one to complete before spawning another.
QUICK items: execute inline (no sub-agent overhead for \x3C 5 min tasks).
MEDIUM/HEAVY items: spawn sub-agent with clear task description + acceptance criteria.
Each sub-agent gets: the item content, relevant context from other items, and the target artifact path.
Track in manifest: status → EXECUTING, then ✅ DONE / ❌ FAILED / ⚠️ PARTIAL.

Sub-agent spawn template:

Task: [item summary]
Context: [relevant items from this corpus]
Deliverable: [specific file/artifact expected]
Acceptance: [how to verify it's done]
Workspace: [path]

Checkpoint every 5 completed items:

Update manifest
Report to user: "[X]/[N] done. [Y] in progress. Top findings so far: [...]"
If user is idle (no response in 30s), continue
Commit progress to git

Phase 5: RECONCILE

After all waves complete (or all sub-agents return):

Re-read manifest
For each ❌ FAILED: log reason, decide retry or escalate
For each 🔄 sub-agent still running: check status, kill if stale (> 30 min no progress)
For each ⚠️ PARTIAL: note what's left
Retry failed items once (different approach if possible)

Phase 6: REPORT

Generate final report at systems/batch-executor/reports/YYYY-MM-DD-SOURCE-report.md:

# Corpus Report: [source]
# Processed: [date]
# Total: [N] items
# Results: [done] ✅ | [failed] ❌ | [partial] ⚠️ | [skipped] ⏭️ | [blocked] 🔒

## Value Stack (top items by impact)
1. [item] — [outcome] — [next step]
2. ...

## Patterns Discovered
- [theme or connection across items]

## Action Items (immediate)
- [ ] [task from corpus that needs follow-up]

## Parked (valuable but not now)
- [item] — reason: [why later]

## Blocked (needs human)
- [item] — needs: [what]

## Statistics
- Items by type: TASK [x], IDEA [x], REFERENCE [x], NOISE [x]
- Items by effort: TRIVIAL [x], QUICK [x], MEDIUM [x], HEAVY [x]
- Sub-agents spawned: [x]
- Total execution time: [x min]

Append to systems/batch-cognition/value-stack.md (shared with batch-cognition skill). Log learnings to .learnings/LEARNINGS.md.

Commands

status — show manifest progress pause — stop spawning, let running agents finish resume — continue from where we left off (re-read manifest) skip [#] — skip item number retry [#] — retry failed item block [#] [reason] — mark as blocked priority [#] — move item to top of queue done — trigger report even if items remain

Key Rules

INGEST FIRST. Raw content hits disk before ANY processing.
Max 3 concurrent sub-agents. More = chaos, dropped results, context confusion.
Checkpoint every 5. Git commit progress. User update.
Never mark ✅ without artifact evidence. File exists, build passes, URL responds.
NOISE is not failure. Skipping noise is correct behavior. Report it transparently.
Corpus items cross-reference. Item #14 may be context for item #27. Pass relevant context to sub-agents.
Resume is first-class. If session dies, resume re-reads manifest and continues from last checkpoint.
ICE scoring is fast. 30 seconds per item max. Don't overthink triage — execute.

Integration with Other Skills

task-extractor: For inline chat messages (3-12 items). Batch-executor is for file/corpus scale (20+).
batch-cognition: For idea analysis (THINK-heavy). Batch-executor is for execution (PLAY-heavy).
orchestrator: Batch-executor can be invoked BY the orchestrator when it detects a corpus dump.
recorder: After batch-executor completes, route to recorder to update STATUS.md.

Usage Guidance

This skill appears to do what it says (bulk corpus processing), but review these issues before installing or running it: 1) It will write raw inputs to disk (systems/batch-executor/...) and commit progress to git — if your agent has a configured git remote this could leak data. Run it in an isolated workspace or ensure no remote git pushes are possible. 2) The SKILL.md references external tools (a `pdf` extractor) and expects to parse Drive/ChatGPT exports but declares no credentials — confirm where Google/Drive/ChatGPT access will come from and never supply credentials unless you trust the runtime. 3) It appends to systems/batch-cognition/value-stack.md (another skill's file) — if you use batch-cognition, expect side-effects; consider sandboxing or editing that line. 4) The skill spawns sub-agents and will pass full item content and related context to them — ensure sub-agents are allowed to see that data. 5) If you want safer use: run on a copy of your corpus in an isolated container, disable autonomous invocation or lower concurrency, verify available binaries (pdf, git) and their behavior, and confirm no external remotes or credentials are accessible. If anything is unclear, ask the skill author for explicit declarations of required tools, expected file paths, and credential handling before use.

Capability Assessment

ℹ Purpose & Capability

The SKILL.md describes a corpus-scale executor and the steps it will take (ingest, classify, triage, spawn sub-agents, checkpoint, report). That high-level purpose aligns with the instructions. However the instructions reference external tools and operations (e.g., a `pdf` tool, parsing Google Drive exports, committing to git, and writing into systems/batch-cognition/value-stack.md) that are not declared in the registry metadata (no required binaries, env vars, or config paths). The absence of declared requirements is a mismatch but could be explained by assuming a pre-provisioned agent environment.

⚠ Instruction Scope

The instructions direct the agent to save ALL raw input to disk under systems/batch-executor, parse many file types, spawn sub-agents and pass them corpus content and related context, checkpoint progress and commit to git every N items, and append results to a shared file used by another skill (systems/batch-cognition/value-stack.md). Writing raw inputs to disk and sharing item content across spawned sub-agents is expected for batch processing, but: (1) committing progress to git could leak data if Git remotes are configured; (2) appending to a path owned by another skill is cross-skill modification and could overwrite or leak aggregated data; (3) the instructions reference a `pdf` tool and killing/stopping sub-agents without declaring what runtime has permissions to do those operations. The instruction 'If user is idle (no response in 30s), continue' grants the skill substantial autonomy to proceed without explicit human confirmation.

✓ Install Mechanism

This is an instruction-only skill with no install spec and no code files. That minimizes install-time risk because nothing is downloaded or written by an installer. The runtime risks come from the actions the instructions direct the agent to perform, not from an installer.

⚠ Credentials

The skill declares no required environment variables or credentials, yet it expects to process Google Drive dumps and potentially interact with git. To legitimately handle Google Drive programmatically you'd normally expect Google API credentials; for git commits/pushes you'd expect git config/credentials. The skill's silence on credentials is a proportionality concern: either it requires the agent runtime to already provide these capabilities (not explicit), or it will prompt for them at runtime (also not described). Additionally, the practice of giving sub-agents full item content and 'relevant context from other items' can cause broad exposure of sensitive data across tasks.

ℹ Persistence & Privilege

The skill is not force-enabled (always: false) and allows autonomous invocation by default (platform normal). The concerning element is file and cross-skill persistence: it writes raw inputs and reports to systems/batch-executor paths and explicitly appends to systems/batch-cognition/value-stack.md (a file outside its own namespace). That level of write access to shared skill files increases blast radius and could lead to data being mixed with other skill state.

Version History

v1.0.0

First release. Corpus-scale task execution with sub-agent waves, ICE triage, and checkpointing. Handles Google Drive, ChatGPT exports, Apple Notes dumps.

Metadata

Slug batch-executor

Version 1.0.0

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 1

Frequently Asked Questions

What is Batch Executor?

Full batch processor for corpus-scale task execution. Handles Google Drive dumps, ChatGPT exports, Apple Notes, or any large collection of mixed content (ide... It is an AI Agent Skill for Claude Code / OpenClaw, with 100 downloads so far.

How do I install Batch Executor?

Run "/install batch-executor" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Batch Executor free?

Yes, Batch Executor is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Batch Executor support?

Batch Executor is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Batch Executor?

It is built and maintained by KairoKid (@dodge1218); the current version is v1.0.0.

More Skills

Batch Executor

Batch Executor

When to Use

Architecture

Phase 1: INGEST

Phase 2: CLASSIFY

Phase 3: TRIAGE

Phase 4: EXECUTE

Phase 5: RECONCILE

Phase 6: REPORT

Commands

Key Rules

Integration with Other Skills

What is Batch Executor?

How do I install Batch Executor?

Is Batch Executor free?

Which platforms does Batch Executor support?

Who created Batch Executor?

💬 Comments