← 返回 Skills 市场
zurbrick

Battle-Tested Agent

作者 Don Zurbrick · GitHub ↗ · v1.5.0 · MIT-0
cross-platform ✓ 安全检测通过
551
总下载
0
收藏
4
当前安装
10
版本数
在 OpenClaw 中安装
/install battle-tested-agent
功能描述
19 production-hardened patterns for AI agents — memory, verification, ambiguity handling, compaction survival, delegation, proof-based handoffs, stale-worker...
使用说明 (SKILL.md)

Battle-Tested Agent

19 production-hardened patterns for AI agents. Every one earned from failure.

Use this skill when you are:

  • hardening an agent that will run repeatedly or autonomously
  • tightening memory, verification, or anti-hallucination behavior
  • reducing compaction failures, weak handoffs, or orchestration drift
  • reviewing an agent workspace for missing production patterns
  • debugging why an agent keeps losing context, guessing, or dropping work

Do not use this skill for:

  • persona writing or onboarding polish
  • one-off prompt tweaks with no reusable pattern behind them
  • adding new tools, servers, or runtime capabilities
  • turning a simple workspace into process theater

Default workflow

  1. Audit first Run bash scripts/audit.sh \x3Cworkspace> to see which patterns are present. The script checks for all 16 patterns and tells you what to fix first.

  2. Start with the smallest tier that fits Implement starter patterns first, then intermediate, then advanced. Do not cargo-cult every pattern into every agent.

  3. Patch the actual failure mode Change the mechanism, not just the wording. "ALWAYS check X" is not a fix — a verification gate is a fix.

  4. Keep patterns lightweight Add only the pieces that materially reduce failures or operator burden.

Pattern tiers

  • Starter (5): baseline reliability for almost every agent
  • Intermediate (5): daily-driver patterns for briefs, heartbeats, and recurring work
  • Advanced (6): multi-agent orchestration, handoffs, and self-improvement discipline

Pattern clusters

Some patterns reinforce each other naturally. Adopt them together when the failure mode calls for it:

  • Trust chain: WAL Protocol + Anti-Hallucination + Agent Verification — ensures data is captured, sourced, and measured before reporting
  • Handoff loop: Delegation Rules + Completion Contract + Acceptance Gate + Task State Tracking — prevents work from disappearing between agents or being certified without proof
  • Survival kit: Working Buffer + Compaction Injection Hardening + Silent Worker Recovery — keeps context alive across long sessions and prevents silent delegated drift
  • Quality gate: QA Gates + Verify Implementation + Decision Logs — ensures output quality and traceable reasoning
  • Delegation hardening: Brief Quality Gate + Scoped Verifier Gate — keeps delegation tight without turning the whole system into bureaucracy

When patterns conflict

If two patterns seem to give contradictory advice:

  • Safety patterns win over speed patterns. Ambiguity Gate overrides Simple Path First when the request is ambiguous. Verify before acting, even if the simple path is obvious.
  • Evidence patterns win over action patterns. Anti-Hallucination overrides "just try it" when reporting data. Never guess a number to move faster.

Assets — how to use them

The assets/ folder contains starter files you copy into your workspace and customize. They are templates, not drop-in replacements.

# Merge delegation and decision log rules into your existing AGENTS.md
cp assets/AGENTS-additions.md ~/workspace/ # Review, then merge

# Add QA gates
cp assets/QA-gates.md ~/workspace/QA.md

# Set up self-improvement tracking
mkdir -p ~/workspace/.learnings
cp assets/learnings-template.md ~/workspace/.learnings/LEARNINGS.md
cp assets/errors-template.md ~/workspace/.learnings/ERRORS.md
cp assets/features-template.md ~/workspace/.learnings/FEATURE_REQUESTS.md

Read references/audit-usage.md for the full rollout order and bootstrap workflow.

References

  • references/starter-patterns.md — WAL, anti-hallucination, ambiguity, simple-path-first, unblock-before-shelve
  • references/intermediate-patterns.md — verification, working buffer, QA gates, decision logs, verify implementation
  • references/advanced-patterns.md — delegation, brief quality, proof-based handoffs, acceptance gates, orchestration, stale-worker recovery, compaction hardening, recurrence tracking
  • references/audit-usage.md — audit script usage, install/copy snippets, and expected outcomes

Included scripts

  • scripts/audit.sh — workspace audit for all 19 patterns (supports AGENTS.md, CLAUDE.md, SOUL.md, and system.md)

Rules of thumb

  • Audit before expanding
  • Prefer progressive disclosure over giant core files
  • Silence is better than hallucination
  • Ambiguity is a stop sign, not permission
  • The orchestrator should preserve oversight, not sink into implementation
  • Mechanism changes beat wording changes
  • After acting, verify the new state before declaring success
  • Partial progress is not success; recovery steps matter as much as first-attempt steps

Outcome

A leaner, more resilient agent that survives compaction, hands work off cleanly, reports only what is verified, and improves without spiraling into bureaucracy.

安全使用建议
This skill appears internally consistent and low-risk, but take normal precautions before running anything that writes to your files: 1) Inspect scripts/audit.sh locally to confirm behavior (it only greps and checks files). 2) Run the audit against a safe/copy workspace first (e.g., a small test directory) to observe output. 3) Do not blindly run the cp snippets — review and merge templates manually or use cp -n/interactive mode to avoid overwriting important files. 4) Running the audit on very large workspaces may be slow because it recurses with grep; run it against targeted paths if needed. If you need higher confidence, ask the author for a signed release or run the script in a sandboxed environment before applying templates to production workspaces.
功能分析
Type: OpenClaw Skill Name: battle-tested-agent Version: 1.5.0 The bundle provides a set of 'production-hardened' reliability patterns and templates designed to reduce AI hallucinations and improve agent orchestration. It includes a workspace audit script (scripts/audit.sh) that checks for the implementation of these patterns and various markdown templates for logging decisions and errors. The content is focused on defensive prompt engineering, such as 'Compaction Injection Hardening' and 'Anti-Hallucination Rules,' with no evidence of malicious intent, data exfiltration, or unauthorized execution.
能力评估
Purpose & Capability
Name/description match the contained assets and the included audit script. Required binaries (bash, grep, find, wc) are exactly what the audit.sh script uses. No extraneous credentials, hosts, or unrelated tools are requested.
Instruction Scope
SKILL.md tells the agent to run scripts/audit.sh against a workspace and optionally copy template assets into the user's workspace. The audit script scans local workspace files (grep/find) for patterns; it does not call external endpoints or attempt to read system-wide secrets. Note: the provided cp snippets will write into the user's workspace and could overwrite files if run without review — review/copy carefully.
Install Mechanism
No install spec — instruction-only with one small shell script. Nothing is downloaded or written to system directories by the skill itself.
Credentials
No environment variables, credentials, or config paths are required. The skill's operations are limited to user-specified workspace directories.
Persistence & Privilege
always:false (normal). The skill does not request permanent agent inclusion or modify other skills; autonomous invocation is allowed by default but this skill's content is passive and agent-initiated.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install battle-tested-agent
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /battle-tested-agent 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.5.0
Add delegation hardening patterns: brief quality gate, completion contract, acceptance gate, silent-worker recovery, scoped verifier use; expand audit to 19 patterns
v1.4.0
Add Pattern 2a: Isolated Agent Fabrication Guard. Crons/sub-agents that report real-world data without orchestrator review must include a verification rule in their prompt. Born from a production incident where a Lodgify cron fabricated guest names, dates, and cancellation statuses. Silence is always safer than fabrication.
v1.3.0
Improved public messaging, added stronger trigger language and pattern-cluster guidance, clarified conflict rules, and updated the audit workflow and intermediate patterns.
v1.2.1
Restructured the skill into a leaner SKILL.md with starter/intermediate/advanced references, preserved assets and audit script, and improved runtime discoverability.
v1.2.0
v1.2.0: Added pattern #3 Ambiguity Gate (inspired by OpenClaw safety audit arxiv.org/pdf/2602.14364 — 0% pass rate on intent misunderstanding). 16 patterns total. Audit script updated.
v1.1.1
v1.1.1: Tiered UX (Starter/Intermediate/Advanced), 3 new patterns (Handoff Template, Orchestrator Rule, Task State Tracking), recurrence tracking, audit checks 15 patterns, Medium link
v1.1.0
**Expanded from 7 to 15 battle-tested patterns for OpenClaw agents, including a tiered checklist and audit script.** - Introduces pattern tiers: Starter, Intermediate, Advanced, guiding users on implementation priority. - Major SKILL.md overhaul with concise step-by-step audit, pattern checklist, and tier descriptions. - New patterns: Simple Path First, Unblock Before Shelve, Recurrence Tracking, Multi-Agent Delegation, Handoff Template, Orchestrator Rule, and more. - Audit script usage now highlighted; shows which patterns are missing from a workspace. - Streamlined install and setup instructions for faster onboarding. - Skill description, examples, and failure stories condensed and updated for clarity.
v1.0.2
Fix: Corrected timeline from '5 months' to '3 weeks'. Truth matters more than credibility.
v1.0.1
Fix: Added missing asset templates (AGENTS-additions, QA-gates, learnings, errors, features). Fixed undefined {baseDir} placeholder — now uses relative paths per OpenClaw skill conventions.
v1.0.0
Initial release: 12 production-hardened patterns, audit script, born from 5 months of real failures.
元数据
Slug battle-tested-agent
版本 1.5.0
许可证 MIT-0
累计安装 4
当前安装数 4
历史版本数 10
常见问题

Battle-Tested Agent 是什么?

19 production-hardened patterns for AI agents — memory, verification, ambiguity handling, compaction survival, delegation, proof-based handoffs, stale-worker... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 551 次。

如何安装 Battle-Tested Agent?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install battle-tested-agent」即可一键安装,无需额外配置。

Battle-Tested Agent 是免费的吗?

是的,Battle-Tested Agent 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Battle-Tested Agent 支持哪些平台?

Battle-Tested Agent 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Battle-Tested Agent?

由 Don Zurbrick(@zurbrick)开发并维护,当前版本 v1.5.0。

💬 留言讨论