Description

Production playbook for running parallel AI coding agents (Claude, Codex, Gemini) with automatic model selection via duty table, token-limit auto-fallback, h...

README (SKILL.md)

Epic AI Swarm Orchestration v3.2.1

Name: Epic AI Swarm Orchestration
Author: linkbag

Production system for running parallel AI coding agents with dynamic model selection, automatic token-limit failover, and quality gates.

Prerequisites

Required CLIs (on PATH)

tmux — agent sandboxing (each agent in its own session)
git — worktree creation, branching, commits, push
gh — GitHub CLI (authenticated via gh auth login)
python3 — JSON manipulation (no pip packages)
openclaw — notification delivery (Telegram/other)

Model CLIs (at least one, authenticated)

claude — Anthropic CLI (OAuth or API key)
codex — OpenAI Codex CLI (optional)
gemini — Google Gemini CLI (optional)

Scripts use host-authenticated CLIs — they do not store credentials.

Quick Start

Copy scripts/ to ~/workspace/swarm/
Edit scripts/swarm.conf with notification target
Run scripts/assess-models.sh to initialize the duty table
Read references/workflow.md for the 3-phase workflow
Read references/duty-table.md for model rotation system
Read references/tools.md for spawn commands

v3.2.1 Bookkeeping Fix

This release fixes a critical integration-watcher bookkeeping bug: spawn-batch.sh and queue-watcher.sh now record the actual tmux session emitted by spawn-agent.sh after duty-table/fallback resolution, instead of predicting names from the requested agent. integration-watcher.sh also refuses to treat unknown/misspelled expected sessions as complete.

Architecture Overview

┌─────────────────────────────────────────────────────────┐
│                    DUTY TABLE                           │
│  assess-models.sh → duty-table.json (daily cron)       │
│  architect=claude/opus, builder=codex, reviewer=gemini  │
└───────────┬─────────────────────────────┬───────────────┘
            │                             │
    ┌───────▼───────┐           ┌─────────▼────────┐
    │ spawn-agent.sh│           │ spawn-batch.sh   │
    │ (single task) │           │ (parallel tasks)  │
    └───────┬───────┘           └────────┬─────────┘
            │  Reads role → agent/model  │
            │  from duty-table.json      │
    ┌───────▼────────────────────────────▼───────────┐
    │              RUNNER (in tmux)                    │
    │  On token limit → model-fallback.sh             │
    │  Auto-retry up to 2x with next available model  │
    │  Updates duty table for future spawns            │
    └───────┬─────────────────────────────────────────┘
            │
    ┌───────▼───────────────────────┐
    │  notify-on-complete.sh        │
    │  → auto-spawns reviewer       │
    │  → integration-watcher.sh     │
    │  → ESR + work log persistence │
    └───────────────────────────────┘

Duty Table System

The duty table (duty-table.json) maps roles to agents/models:

Role	Purpose	Default Assignment
architect	Planning, design	Claude Opus (best reasoning)
builder	Implementation	Codex or Claude Sonnet (fast)
reviewer	Code review + fixes	Gemini Flash or Sonnet
integrator	Branch merging	Claude Opus (deep thinking)

Auto-Assessment

assess-models.sh runs daily (or on-demand) to:

Test all models across all 3 vendors (45s timeout each)
Assign optimal 3-vendor spread to roles
If both Codex + Gemini down → fallback to all-Claude table

Mid-Run Token Failover

When an agent hits a token/rate limit during execution:

Runner detects the error pattern in output
Calls model-fallback.sh with the role + failed model
Gets the next available model from the per-role fallback chain
Retries the task (up to 2 attempts)
Updates duty table so future spawns use the working model
Logs the switch to pending-notifications.txt

See references/duty-table.md for full details.

Core Scripts

Script	Purpose
`spawn-agent.sh`	Spawn single agent (resolves role from duty table)
`spawn-batch.sh`	Spawn parallel agents with auto-queuing
`assess-models.sh`	Test models, update duty table
`model-fallback.sh`	Find next available model for a role
`fallback-swap.sh`	Pre-spawn primary/fallback test
`try-model.sh`	Quick model health check
`notify-on-complete.sh`	Watcher: auto-review + integration
`integration-watcher.sh`	Merge all branches after batch
`queue-watcher.sh`	Auto-spawn queued overflow tasks
`pulse-check.sh`	Detect stuck agents, auto-kill
`check-agents.sh`	Monitor all active agents
`endorse-task.sh`	Human endorsement gate
`esr-log.sh`	Engineering Status Report logging
`daily-standup.sh`	Daily status summary
`cleanup.sh`	Remove old worktrees + logs

Workflow

Phase 1: PLAN (Architect)

Read project context, ESR, codebase
Break work into parallel tasks with prompts
Present plan to human → HOLD until endorsed

Phase 2: BUILD + REVIEW (Builder + Reviewer)

spawn-batch.sh deploys agents in tmux + worktrees
Each agent codes autonomously with structured work log
notify-on-complete.sh auto-spawns reviewer (max 3 fix loops)
Token limits trigger automatic model switch mid-run

Phase 3: SHIP (Integrator)

integration-watcher.sh merges all branches sequentially
Conflict resolution, build verification
ESR + work log persisted to project history
Telegram notification with shipped summary

Configuration

swarm.conf:

SWARM_NOTIFY_TARGET="\x3Ctelegram-user-id>"
SWARM_NOTIFY_CHANNEL="telegram"
SWARM_MAX_CONCURRENT=8

Endorsement System

Every task requires human approval before agents spawn:

endorse-task.sh \x3Ctask-id>           # Single task
spawn-batch.sh ... \x3Ctasks.json>     # Batch endorsement (auto per-task)

30-second cooldown between endorsement and spawn prevents accidental double-runs.

Usage Guidance

This package appears to be a legitimate multi-agent orchestration toolkit, but several gaps deserve attention before you run it: - Inspect the scripts locally before running. Pay special attention to spawn-agent.sh, integration-watcher.sh, notify-on-complete.sh, and any file that performs git operations or network calls. Ensure there are no unexpected curl/wget endpoints, accidental credentials printing, or destructive rm commands targeting broad paths. - Expect to give the environment authenticated CLIs (gh, and whichever model CLIs you want to use). Those CLIs will run with whatever access the host account has — including the ability to create branches, commit, and push. If you do not want remote pushes/merges, do not authenticate gh, or run the scripts in a repo without a remote (or a disposable repo). - Configure swarm.conf carefully. The SKILL.md mentions SWARM_NOTIFY_TARGET and SWARM_NOTIFY_CHANNEL; verify openclaw's configuration and where notifications go (Telegram/other) so you do not leak worklogs or secrets to an external channel. - Run assessment scripts in dry-run mode first (assess-models.sh --dry-run) and exercise spawn scripts in a sandboxed project with no remote to observe behavior and logs. - If possible, run under a least-privileged local account and inside a disposable VM/container so merges/pushes cannot affect real repos until you have audited behavior. What would increase my confidence: explicit registry metadata listing required env vars/credentials (GitHub, model API tokens or confirmation that host CLIs must be pre-authenticated), an explicit statement about whether integration-watcher will automatically push/merge to remote or only create local commits/branches, and any DNS/endpoint list used by notification tooling (openclaw). If you want, I can scan specific script files (spawn-agent.sh, integration-watcher.sh, notify-on-complete.sh, assess-models.sh, model-fallback.sh) and summarize exact git and network calls they perform.

Capability Analysis

Type: OpenClaw Skill Name: epic-ai-swarm-orchestration Version: 3.2.1 The bundle implements a complex multi-agent orchestration system that contains significant security vulnerabilities. Key risks include the use of the `--permission-mode bypassPermissions` flag for the Claude CLI in `spawn-agent.sh`, which explicitly disables safety rails for autonomous agents, and the construction of AI prompts from untrusted work log files in `integration-watcher.sh` and `notify-on-complete.sh`, creating a high risk of cross-agent prompt injection. While these capabilities are aligned with the stated purpose of 'autonomous swarms,' the lack of input sanitization and the broad execution permissions make the bundle highly risky for use in sensitive environments.

Capability Tags

cryptorequires-oauth-tokenrequires-sensitive-credentials

Capability Assessment

ℹ Purpose & Capability

The skill name and description (orchestrating parallel AI agents, model rotation, review loops) align with the included scripts (spawn-agent, spawn-batch, model-fallback, integration-watcher, etc.) and docs. Required CLIs listed in SKILL.md (tmux, git, gh, python3, openclaw, model CLIs) are coherent with the stated purpose. However, registry metadata declared no required env vars/credentials while the SKILL.md and scripts clearly expect host-authenticated CLIs and a configured swarm.conf (SWARM_NOTIFY_TARGET, SWARM_NOTIFY_CHANNEL). That discrepancy (manifest vs runtime instructions) is noteworthy.

⚠ Instruction Scope

The runtime instructions and scripts perform broad system operations: create git worktrees/branches, launch tmux sessions, write work logs (in /tmp), update duty-table.json, auto-merge branches (integration-watcher), and persist logs/docs into the repository. They also scan /tmp for blocker files and maintain queue files. These behaviors are consistent with orchestration, but they mean the skill will modify your repository and local filesystem and (if gh is authenticated) could push/merge changes remotely. The SKILL.md does not clearly limit what repos or remotes are used, nor does it enumerate the exact git operations (push/pr creation policy). Notification delivery uses 'openclaw' (Telegram/other) which will transmit session summaries off-host. All of this grants the skill broad I/O and network side-effects that are not fully declared in the registry metadata.

✓ Install Mechanism

There is no install spec — the skill is instruction + included scripts. No external archive downloads or package installs are requested by the registry entry. This reduces supply-chain risk from the registry itself, but the included scripts will be copied to disk when the user follows the Quick Start steps (the SKILL.md instructs copying scripts/ to ~/workspace/swarm).

⚠ Credentials

The registry lists no required environment variables or primary credential, but SKILL.md and scripts rely on authenticated CLIs (gh, model CLIs) and a swarm.conf with SWARM_NOTIFY_TARGET / SWARM_NOTIFY_CHANNEL and SWARM_MAX_CONCURRENT. The scripts will operate using host-authenticated credentials for GitHub and model vendor CLIs; these effectively grant repo write/push capability and access to model APIs. The skill does not declare these credential requirements in the metadata, making the credential scope under-specified and raising a proportionality concern.

ℹ Persistence & Privilege

always is false (ok). The skill will run long-lived background helpers (queue-watcher starts integration-watcher) and will write persistent state in the project (duty-table.json, docs/history, pending-notifications.txt, queue files), and manipulate git branches/commits. It does not declare modifications to other skills or system-wide settings. Because autonomous invocation is allowed by default on the platform, the fact that the skill can be invoked by an agent combined with its ability to modify repos and send notifications increases blast radius — recommend limiting use to isolated/sandboxed repos unless you trust it.

Version History

v3.2.1

Fix integration watcher bookkeeping: record actual tmux session names after duty-table/fallback resolution, include deepseek in queue active counts, and prevent unknown expected sessions from being treated as complete.

v3.2.0

**Major update with expanded documentation and improved batch/task management.** - Added detailed documentation: ESR, decision logs, role/leadership definitions, and integration history. - Introduced role-specific docs and duty table template for agent assignment. - Enhanced batch-task workflow: explicit endorsement, spawn, monitoring, and integration merging procedures. - Expanded setup and dependency instructions, including platform-specific (macOS) notes. - New supporting scripts for task queueing, deployment notifications, and work history logging. - Enforced hard rules: mandatory task endorsement, batch spawning protocol, and strict tmux notification handling.

v3.1.1

v3.1.1: Full audit — spawn-batch + queue-watcher now duty-table driven, role-based task JSON, dynamic model assignment across all scripts, auto-fallback on token limits, daily cron assessment

v3.1.0

v3.1.0 — Endorsement cooldown: spawn-agent.sh now enforces 30s minimum between endorsement and spawn, preventing orchestrator from endorsing+spawning in the same turn without human approval. spawn-batch.sh uses --batch flag to legitimately bypass cooldown for pre-approved batches.

v3.0.1

v3.0.1 — Scan report fixes: declared CLI dependencies (tmux/git/gh/python3/openclaw + model CLIs) in metadata, documented credential usage and auto-merge policy, clarified endorsement flow (spawn-batch creates files AFTER human verbal approval, not bypassing it), added Testing Safely section, added --dry-run note for cleanup.sh. Zero script logic changes.

v3.0.0

v3.0 — Full swarm system with structured handoffs, task state machine, inbox queue, escalation protocol, decision logging, maxConcurrent auto-queue, daily standup, and cleanup. Battle-tested across 300+ agent runs.

Metadata

Slug epic-ai-swarm-orchestration

Version 3.2.1

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 6

Frequently Asked Questions

What is Epic AI Swarm Orchestration?

Production playbook for running parallel AI coding agents (Claude, Codex, Gemini) with automatic model selection via duty table, token-limit auto-fallback, h... It is an AI Agent Skill for Claude Code / OpenClaw, with 179 downloads so far.

How do I install Epic AI Swarm Orchestration?

Run "/install epic-ai-swarm-orchestration" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is Epic AI Swarm Orchestration free?

Yes, Epic AI Swarm Orchestration is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does Epic AI Swarm Orchestration support?

Epic AI Swarm Orchestration is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created Epic AI Swarm Orchestration?

It is built and maintained by linkbag (@linkbag); the current version is v3.2.1.

More Skills

Epic AI Swarm Orchestration