← 返回 Skills 市场
zurbrick

Agent QA Gates

作者 Don Zurbrick · GitHub ↗ · v1.2.0 · MIT-0
cross-platform ✓ 安全检测通过
319
总下载
0
收藏
1
当前安装
3
版本数
在 OpenClaw 中安装
/install agent-qa-gates
功能描述
Output validation gates for AI agent systems. Prevents hallucinated data, leaked internal context, wrong formats, duplicate sends, post-compaction drift, and...
使用说明 (SKILL.md)

Agent QA Gates

A field-tested validation system for AI agent output. Born from production failures, not theory.

Quick Start

Before any agent delivers output, run the Pre-Ship Checklist:

  1. Accurate? — every number/date/metric has a source. Unsourced → prefix "estimated"
  2. Complete? — no missing pieces, no "I'll do that next"
  3. Actionable? — ends with clear next step or decision point
  4. Fits the channel? — check character limits for your delivery surface
  5. No leaks? — no internal context, private data, or secrets
  6. Not a duplicate? — verify no recent identical send
  7. Would the human be embarrassed? — if yes, don't ship

Gate Tiers

Four ascending tiers by risk level:

Gate Scope Key Checks
Gate 0 Internal (files, config, memory) Mechanism changed not just text, no placeholders, file exists
Gate 1 Human-facing (briefings, summaries) Key info in first 2 lines, ≤3-line paragraphs, channel length limits
Gate 2 External (email, public content, client materials) No internal context leaked, recipient-appropriate tone, dedup check
Gate 3 Code & technical Builds clean, no secrets in code, error handling, tests pass

See references/gates-detail.md for full gate checklists.

Severity Classification

Not all failures are equal:

  • 🔴 BLOCK — cannot ship (secrets, privacy, hallucinated data, wrong recipient)
  • 🟡 FIX — fix before shipping, \x3C2 min (formatting, too long, missing citation)
  • 🟢 NOTE — log and ship (style preference, minor optimization)

Protocol Gates

Recurring failure modes need dedicated gates. These are the most common:

Heartbeat / Periodic Check Output

  • Binary output: alert text ONLY or status-OK ONLY. Never mixed.
  • Every data point verified by current-session tool call. No hallucinated metrics.
  • No stale data from previous cycles or pre-compaction sessions.

Post-Compaction / Context Reset

  • Do not trust facts from the pre-reset session — verify from files and tools.
  • Rerun pending checks from scratch.
  • Zero carryover for periodic checks.

Scheduled Job / Cron Changes

  • Explicit timeout set
  • Explicit model set
  • Verify schedule after creation
  • Output fits destination channel limits

Sub-Agent Output Review

  • Does output match the brief's success criteria?
  • Any uncertainty flags unresolved?
  • Is the reasoning (not just the conclusion) sound?

Isolated Agent / Cron Output (real-world data)

For any cron or sub-agent that reports external data without orchestrator review:

  • Did the agent make a verifiable live tool call? Is the raw response traceable?
  • Any names, dates, amounts, or IDs that can't be traced to a tool result? → 🔴 BLOCK
  • If tool call failed: output must be DATA_UNAVAILABLE — [reason], not fabricated data
  • Does the cron prompt include the Real-World Data Verification Rule? Severity: Fabricated real-world data = 🔴 BLOCK. Same as hallucinated metrics.

Delegated Work Acceptance

For any non-trivial delegated task (especially builds, audits, config changes, or external deliverables):

  • Does the handoff include a clear artifact path or proof object?
  • Did the worker report exact commands run rather than vague claims?
  • Did verification actually happen, with results stated?
  • Is the output non-empty and specific, not just "done" or "completed successfully"?
  • Are known gaps / next actions named explicitly?
  • If the handoff is empty, artifact-free, or self-certifying without proof → 🔴 BLOCK
  • Valid dispositions: Done, Revision Needed, Blocked, Failed, Stale

Silent Worker / Stale Task Classification

For delegated work that appears to be running:

  • Was the spawn actually accepted? If not, it is not running.
  • No start signal within 10 minutes after accepted spawn → Stale
  • No materially new output for 30 minutes on active work → Stale unless the task explicitly justifies a longer quiet window
  • Stale work must be investigated, respawned, or escalated — never left as indefinite In Progress

Gate Evolution

Gates should evolve based on real failures, not imagination:

  1. When a failure occurs → log it with root cause
  2. Same failure class occurs 2+ times → add a gate item
  3. Monthly: prune gates that haven't caught anything in 60 days

Anti-Patterns

  • Gates that sound good but never catch anything → kill them
  • Per-agent checklists that duplicate general gates → merge or reference
  • "ADHD-friendly" or "high-quality" as gate items → not testable, replace with mechanical checks
  • Aspirational gates nobody runs → either automate or cut

Adapting to Your System

This skill provides the pattern. Adapt it:

  1. Start with the Pre-Ship Checklist — it works for any agent system
  2. Add Protocol Gates for your top 3 recurring failure modes
  3. Set channel limits for your delivery surfaces
  4. Map real failures to gates — if a failure isn't gated, add the gate
  5. Kill gates that never fire — a shorter, sharper checklist wins

For the full reference implementation, see references/gates-detail.md. For automation scripts, see scripts/qa-check.sh.

安全使用建议
This skill is internally consistent and appears safe to review/use, but take these precautions before installing or running it: (1) the qa-check.sh script reads whatever file or stdin you pass it — do not feed it sensitive files you don't want inspected or echoed; (2) the script looks for secret-like patterns and may produce false positives — verify flagged items manually; (3) it uses standard shell tools (bash, grep, awk) so run it in a controlled environment if you are uncertain; (4) the skill does not request credentials or network access, but avoid giving it broad file paths or automating it with elevated privileges until you’ve tested it on non-production data; (5) if you integrate it into autonomous agent pipelines, ensure gate behavior and blocking semantics match your operational safety policies.
功能分析
Type: OpenClaw Skill Name: agent-qa-gates Version: 1.2.0 The 'agent-qa-gates' skill is a defensive utility designed to improve the reliability and security of AI agent outputs. It provides a structured validation framework (SKILL.md and references/gates-detail.md) and an automated bash script (scripts/qa-check.sh) that scans for common issues such as leaked API keys, PII patterns, internal system context leaks, and hallucinated data. The logic is entirely focused on quality control and preventing accidental data exposure, with no evidence of malicious intent or unauthorized capabilities.
能力评估
Purpose & Capability
Name/description match the provided assets: SKILL.md describes QA gate checklists and the repo contains a reference doc and a shell script that implements checks. The skill does not request unrelated credentials, binaries, or configuration paths.
Instruction Scope
SKILL.md limits itself to output-validation guidance and points to an automation script. The script operates on a provided file or stdin and looks for placeholders, secrets patterns, length/format issues, internal-context keywords, and basic code checks. It does not instruct the agent to read arbitrary system state, other skills' configs, or transmit data externally.
Install Mechanism
There is no install spec (instruction-only with a bundled script). That is low-risk. The included Bash script uses standard POSIX utilities (bash, grep, awk) but no downloaded code or external URLs.
Credentials
The skill declares no required environment variables or credentials. The script actively scans content for secret-like patterns (e.g., sk-..., AKIA..., ghp_...) but does not request or store any secrets itself.
Persistence & Privilege
The skill does not request persistent/always-on privileges (always:false), and it does not modify other skills or system-wide settings. Autonomous invocation is allowed by platform default but not elevated by this skill.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install agent-qa-gates
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /agent-qa-gates 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.2.0
Add delegated-work acceptance gates, empty-success rejection, and stale-task classification for non-trivial delegated work
v1.1.0
Add Isolated Agent/Cron Output gate to gates-detail.md and SKILL.md quick reference. Any cron or sub-agent delivering real-world data (bookings, email, health, finance) without orchestrator review must have a live tool call on record — or output DATA_UNAVAILABLE. Fabricated external data is classified as 🔴 BLOCK severity.
v1.0.0
Initial release: tiered validation gates, protocol gates, severity classification, qa-check.sh automation script
元数据
Slug agent-qa-gates
版本 1.2.0
许可证 MIT-0
累计安装 1
当前安装数 1
历史版本数 3
常见问题

Agent QA Gates 是什么?

Output validation gates for AI agent systems. Prevents hallucinated data, leaked internal context, wrong formats, duplicate sends, post-compaction drift, and... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 319 次。

如何安装 Agent QA Gates?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install agent-qa-gates」即可一键安装,无需额外配置。

Agent QA Gates 是免费的吗?

是的,Agent QA Gates 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Agent QA Gates 支持哪些平台?

Agent QA Gates 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Agent QA Gates?

由 Don Zurbrick(@zurbrick)开发并维护,当前版本 v1.2.0。

💬 留言讨论