Battle-Tested Agent
/install battle-tested-agent
Battle-Tested Agent
19 production-hardened patterns for AI agents. Every one earned from failure.
Use this skill when you are:
- hardening an agent that will run repeatedly or autonomously
- tightening memory, verification, or anti-hallucination behavior
- reducing compaction failures, weak handoffs, or orchestration drift
- reviewing an agent workspace for missing production patterns
- debugging why an agent keeps losing context, guessing, or dropping work
Do not use this skill for:
- persona writing or onboarding polish
- one-off prompt tweaks with no reusable pattern behind them
- adding new tools, servers, or runtime capabilities
- turning a simple workspace into process theater
Default workflow
-
Audit first Run
bash scripts/audit.sh \x3Cworkspace>to see which patterns are present. The script checks for all 16 patterns and tells you what to fix first. -
Start with the smallest tier that fits Implement starter patterns first, then intermediate, then advanced. Do not cargo-cult every pattern into every agent.
-
Patch the actual failure mode Change the mechanism, not just the wording. "ALWAYS check X" is not a fix — a verification gate is a fix.
-
Keep patterns lightweight Add only the pieces that materially reduce failures or operator burden.
Pattern tiers
- Starter (5): baseline reliability for almost every agent
- Intermediate (5): daily-driver patterns for briefs, heartbeats, and recurring work
- Advanced (6): multi-agent orchestration, handoffs, and self-improvement discipline
Pattern clusters
Some patterns reinforce each other naturally. Adopt them together when the failure mode calls for it:
- Trust chain: WAL Protocol + Anti-Hallucination + Agent Verification — ensures data is captured, sourced, and measured before reporting
- Handoff loop: Delegation Rules + Completion Contract + Acceptance Gate + Task State Tracking — prevents work from disappearing between agents or being certified without proof
- Survival kit: Working Buffer + Compaction Injection Hardening + Silent Worker Recovery — keeps context alive across long sessions and prevents silent delegated drift
- Quality gate: QA Gates + Verify Implementation + Decision Logs — ensures output quality and traceable reasoning
- Delegation hardening: Brief Quality Gate + Scoped Verifier Gate — keeps delegation tight without turning the whole system into bureaucracy
When patterns conflict
If two patterns seem to give contradictory advice:
- Safety patterns win over speed patterns. Ambiguity Gate overrides Simple Path First when the request is ambiguous. Verify before acting, even if the simple path is obvious.
- Evidence patterns win over action patterns. Anti-Hallucination overrides "just try it" when reporting data. Never guess a number to move faster.
Assets — how to use them
The assets/ folder contains starter files you copy into your workspace and customize.
They are templates, not drop-in replacements.
# Merge delegation and decision log rules into your existing AGENTS.md
cp assets/AGENTS-additions.md ~/workspace/ # Review, then merge
# Add QA gates
cp assets/QA-gates.md ~/workspace/QA.md
# Set up self-improvement tracking
mkdir -p ~/workspace/.learnings
cp assets/learnings-template.md ~/workspace/.learnings/LEARNINGS.md
cp assets/errors-template.md ~/workspace/.learnings/ERRORS.md
cp assets/features-template.md ~/workspace/.learnings/FEATURE_REQUESTS.md
Read references/audit-usage.md for the full rollout order and bootstrap workflow.
References
references/starter-patterns.md— WAL, anti-hallucination, ambiguity, simple-path-first, unblock-before-shelvereferences/intermediate-patterns.md— verification, working buffer, QA gates, decision logs, verify implementationreferences/advanced-patterns.md— delegation, brief quality, proof-based handoffs, acceptance gates, orchestration, stale-worker recovery, compaction hardening, recurrence trackingreferences/audit-usage.md— audit script usage, install/copy snippets, and expected outcomes
Included scripts
scripts/audit.sh— workspace audit for all 19 patterns (supports AGENTS.md, CLAUDE.md, SOUL.md, and system.md)
Rules of thumb
- Audit before expanding
- Prefer progressive disclosure over giant core files
- Silence is better than hallucination
- Ambiguity is a stop sign, not permission
- The orchestrator should preserve oversight, not sink into implementation
- Mechanism changes beat wording changes
- After acting, verify the new state before declaring success
- Partial progress is not success; recovery steps matter as much as first-attempt steps
Outcome
A leaner, more resilient agent that survives compaction, hands work off cleanly, reports only what is verified, and improves without spiraling into bureaucracy.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install battle-tested-agent - 安装完成后,直接呼叫该 Skill 的名称或使用
/battle-tested-agent触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Battle-Tested Agent 是什么?
19 production-hardened patterns for AI agents — memory, verification, ambiguity handling, compaction survival, delegation, proof-based handoffs, stale-worker... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 551 次。
如何安装 Battle-Tested Agent?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install battle-tested-agent」即可一键安装,无需额外配置。
Battle-Tested Agent 是免费的吗?
是的,Battle-Tested Agent 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Battle-Tested Agent 支持哪些平台?
Battle-Tested Agent 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Battle-Tested Agent?
由 Don Zurbrick(@zurbrick)开发并维护,当前版本 v1.5.0。