功能描述

Use when building features, fixing complex bugs, or doing major refactoring. Transforms your agent into a structured engineering team: Plan → Build (via ACP)...

使用说明 (SKILL.md)

Harness Engineering Mode 🏭

Name: Harness Factory — AI Engineering Team
Author: guixiang123124

You are now operating as a Harness Engineering Lead. Instead of writing all code yourself, you orchestrate a structured team workflow using ACP sessions.

\x3CWHEN-TO-USE> Activate this skill when:

Building a new feature (>50 lines of code expected)
Fixing a complex bug spanning multiple files
Major refactoring or architecture changes
Any task where quality matters more than speed
User explicitly asks for "harness mode" or "use the factory"

Do NOT use for:

Quick one-line fixes (just edit directly)
Reading/exploring code (just read)
Configuration changes
Questions or discussions \x3C/WHEN-TO-USE>

The 5-Phase Workflow

Phase 1: PLAN (You do this — do not skip!)

Before ANY code is written:

Read the codebase thoroughly. Understand:
- Existing patterns and conventions
- File structure and dependencies
- What should NOT be changed
Write SPRINT.md in the project root using this exact format:

# Sprint: [Feature Name]

## Goal
[One sentence — what are we building?]

## Success Criteria
- [ ] [Specific, testable criterion]
- [ ] All changed files compile (py_compile / tsc --noEmit)
- [ ] No existing features broken
- [ ] Security reviewed (CORS, auth, rate limits)
- [ ] No console.log / TODO / as any left in code

## File Scope
**Can modify:** [list specific files]
**Must NOT touch:** [list files that should not change]

## Context
[Project background, existing patterns to follow, key design decisions.
This is the MOST IMPORTANT section — give the Builder everything it needs
to understand the project without reading every file.]

## Technical Notes
[API patterns, DB schema, frontend conventions, etc.]

The quality of SPRINT.md determines the quality of the output. Spend time here.

Phase 2: BUILD (Claude Code via ACP)

Option A: Native Claude Code Agent (Recommended)

If Claude Code is installed locally, use the --agent flag with our builder template:

# Create worktree for isolated building (recommended)
cd [project_path]
git worktree add ../[project]-build feature/[feature-name]

# Run Builder with agent template
claude --agent=builder -w ../[project]-build --bare -p "Read SPRINT.md and implement all success criteria. Write BUILDER_REPORT.md when done."

The builder agent template (in agents/builder.md) provides:

Strict scoping rules — only modify declared files
Mechanical check requirements — compile + lint before reporting
Structured report format

Option B: ACP Session (Remote/Cross-platform)

Spawn a Builder session via OpenClaw's ACP:

sessions_spawn:
  runtime: "acp"
  agentId: "claude"
  mode: "run"
  task: |
    You are a Builder agent in a Harness Engineering workflow.
    
    PROJECT: [full project path]
    
    Read SPRINT.md in the project root. It contains:
    - Your task specification
    - Success criteria you must meet
    - Files you can/cannot modify
    - Project context and patterns to follow
    
    Instructions:
    1. Read SPRINT.md first
    2. Read all files listed in "Can modify" section
    3. Implement each success criterion
    4. Run compile checks (py_compile for .py, tsc for .ts)
    5. Write BUILDER_REPORT.md summarizing all changes
    
    RULES:
    - Follow existing code patterns exactly
    - Do NOT modify files outside the declared scope
    - Do NOT add new dependencies without documenting why
    - Every function must have a docstring
    - Handle errors gracefully
  cwd: "[project path]"

Option C: Parallel Builders (Large tasks)

For tasks that can be parallelized, split SPRINT.md into sub-sprints and run multiple Builders in separate worktrees:

# Create multiple worktrees
git worktree add ../build-api sprint/api
git worktree add ../build-frontend sprint/frontend
git worktree add ../build-tests sprint/tests

# Run builders in parallel
claude --agent=builder -w ../build-api --bare -p "Read SPRINT-API.md..." &
claude --agent=builder -w ../build-frontend --bare -p "Read SPRINT-FRONTEND.md..." &
claude --agent=builder -w ../build-tests --bare -p "Read SPRINT-TESTS.md..." &
wait

Wait for the Builder(s) to complete.

Phase 3: EVALUATE (You do this — be strict!)

Run this checklist on every Builder output:

Mechanical checks (must ALL pass):

py_compile on every changed .py file
tsc --noEmit on frontend (if changed)
grep -r "console.log\|TODO\|FIXME\|HACK\|as any" — must be clean
Changed files are within declared scope

Code review (score each 1-10):

Dimension	Weight	Score
Functionality — Does it work as specified?	30%	/10
Code Quality — Clean, DRY, documented?	25%	/10
Security — Auth, CORS, input validation?	25%	/10
Edge Cases — Empty input, timeouts, errors?	20%	/10

Weighted total = (F×0.3 + Q×0.25 + S×0.25 + E×0.2)

≥ 7.0 → PASS — proceed to Phase 5
5.0 - 6.9 → ITERATE — go to Phase 4
\x3C 5.0 → MAJOR REWRITE — rewrite SPRINT.md with more context and restart

Phase 4: ITERATE (Send feedback to Builder)

If score \x3C 7.0, write REVIEW.md:

# Review: Round [N] — Score: [X/10]

## Critical Issues (must fix)
1. [specific issue with file path and line reference]

## Improvements Needed
1. [specific improvement]

## What Was Done Well
1. [positive feedback — important for calibration]

Then send back to the Builder. You have two options:

Option A: Same session (if Builder session is persistent)

sessions_send:
  sessionKey: [builder_session_key]
  message: "Read REVIEW.md in the project root. Fix all Critical Issues. This is Round [N]."

Option B: New session (if using one-shot mode)

sessions_spawn:
  runtime: "acp"
  agentId: "claude"
  mode: "run"  
  task: "Read SPRINT.md and REVIEW.md in [project_path]. Fix all issues listed in REVIEW.md. Write updated BUILDER_REPORT.md."
  cwd: "[project path]"

Return to Phase 3 and re-evaluate.

Max 5 rounds. If not passing after 5 rounds, escalate to the user.

Phase 5: SHIP

Once score ≥ 7.0:

Commit with descriptive message: git add -A && git commit -m "feat: [description]"
Push to remote: git push
Deploy if applicable (follow project-specific deploy process)
Verify the feature works in production
Write HARNESS_REPORT.md summarizing the full process:
- How many rounds
- What was caught in review
- Final score
- Lessons learned

Advanced: Independent Reviewer

For critical features (payments, auth, data deletion), add an independent review.

Option A: Native Claude Code Agent (Recommended)

# Use the reviewer agent template — it sees only the code, not the Builder's reasoning
claude --agent=reviewer -w ../[project]-build --bare -p "Read SPRINT.md and review all changed files. Write REVIEWER_REPORT.md."

Option B: ACP Session

sessions_spawn:
  runtime: "acp"
  agentId: "claude"
  mode: "run"
  task: |
    You are an independent Code Reviewer. You have NOT seen the Builder's 
    reasoning — only the final code.
    
    Review ALL recent changes in [project_path].
    Focus on: security vulnerabilities, edge cases, type safety, error handling.
    
    Score each dimension 1-10 and write REVIEWER_REPORT.md.
  cwd: "[project path]"

The Reviewer's separate session prevents evaluation bias — it judges the code, not the Builder's intentions.

Claude Code Pro Tips

These features (from Claude Code creator Boris Cherny) enhance the harness workflow:

Feature	Usage	Why It Helps
`--agent`	`claude --agent=builder`	Custom system prompt per role
`-w` (worktree)	`claude -w ../feature-branch`	Isolated build environment
`--bare`	`claude --bare -p "..."`	10x faster SDK startup
`/branch`	Run during session	Fork conversation to test alternatives
`/btw`	Run during session	Ask a question without interrupting task
`/loop`	`/loop 5m /babysit`	Auto-monitor running tasks
Chrome ext	Install separately	Let Builder verify frontend visually

Sprint Sizing Guide (Learned from Production)

Sprint Type	Ideal Size	Builder Success Rate	Notes
Backend API endpoint	\x3C100 lines	95%+	Include schema + existing patterns
Backend refactor	\x3C150 lines	80%+	Clear before/after spec
Frontend component	\x3C100 lines per component	70%+	One component per sprint
Frontend page rewrite	>300 lines	\x3C30%	⚠️ Split into component sprints
Full-stack feature	Any size	\x3C20%	⚠️ Always split backend + frontend

Rule of thumb: If a sprint touches >2 files or >150 lines, split it.

Anti-Patterns (Don't Do This)

❌ Bad	✅ Good
Skip planning, jump to code	Write SPRINT.md first
Vague success criteria	Specific, testable criteria
"Fix everything" task	Scoped, focused sprint
Skip compile checks	Always verify mechanically
Accept first output	At least 2 rounds of review
Same agent builds and reviews	Separate sessions for review
Giant sprint (20+ criteria)	Break into 2-3 focused sprints
Frontend page rewrite in one sprint	Split into component-level sprints
Trust Builder output without py_compile/tsc	Always run mechanical checks
Skip admin auth on mutation endpoints	Lead must check security in Evaluate

Integration with Superpowers

If Superpowers skills are installed, the harness workflow integrates:

brainstorming → Use before Phase 1 for requirements gathering
writing-plans → Enhances SPRINT.md with detailed task breakdown
requesting-code-review → Adds to Phase 3 evaluation
verification-before-completion → Final check before Phase 5

Configuration

Add to your AGENTS.md to enable automatic triggering:

### 🏭 Harness Engineering
When task involves: new feature, complex bug fix, refactoring, multi-file changes
→ Read `~/.openclaw/skills/harness-engineering/SKILL.md` and follow the 5-phase workflow.

安全使用建议

This skill is a cohesive set of templates and agent prompts for running a multi-stage engineering workflow. Before installing or using it: 1) Note that the SKILL.md recommends using a local 'claude' agent or an 'acp' remote session — both can read and act on your project files. If you choose the remote/ACP path, expect project files and SPRINT.md content to be sent to a remote agent; do not use it on repositories containing secrets or sensitive data unless you trust the remote execution environment. 2) The skill does not declare required binaries or credentials (claude CLI, git) — verify you have the intended tooling and understand any provider credentials used by ACP. 3) Constrain scope in SPRINT.md (explicit 'Can modify' and 'Must NOT touch' lists) and exclude sensitive files. 4) Prefer local-only Builder runs (Option A) if you want to avoid external transmission. 5) If you need higher assurance, ask the skill author to explicitly declare any external endpoints, required binaries, and how data is transmitted/retained by remote agents.

功能分析

Type: OpenClaw Skill Name: harness-factory Version: 0.7.0 The harness-factory skill bundle implements a structured multi-agent engineering workflow (Plan-Build-Review-Iterate) designed to improve code quality and security. It utilizes standard development tools such as git, compilers (py_compile, tsc), and linters, and includes comprehensive security and quality checklists (security-checklist.md, code-quality-checklist.md). The instructions in SKILL.md and the agent templates (builder.md, reviewer.md) focus on professional software engineering practices without any evidence of malicious intent, data exfiltration, or unauthorized execution.

能力评估

ℹ Purpose & Capability

The skill claims to orchestrate a Harness Engineering workflow and the provided templates, builder/reviewer agent prompts and checklists align with that purpose. However, the SKILL.md assumes use of external tooling (claude CLI, ACP remote sessions, git worktrees) yet the registry metadata declares no required binaries or environment variables — a practical mismatch (not necessarily malicious) that could surprise users who don't have or don't want to use those tools.

⚠ Instruction Scope

The instructions explicitly require reading the codebase and creating worktrees and (optionally) spawning ACP builder sessions that instruct a remote agent to 'Read SPRINT.md' and implement code. That implies sending project context/files to an external agent/runtime when using Option B (ACP). The SKILL.md does not clearly warn users that remote sessions will access project files or identify what is transmitted, creating a risk of unintended code or secret exposure.

✓ Install Mechanism

This is an instruction-only skill with no install spec and no code to execute locally from the skill bundle. That lowers direct install risk — the skill will only run whatever the agent/platform executes based on the instructions/templates.

ℹ Credentials

No environment variables or credentials are declared, which is consistent with an instruction-only template. However, the workflow depends on external runtimes (ACP/claude) which typically require credentials or network access; the omission of any mention of required tooling, auth, or where data may be sent is a proportionality/visibility gap that users should be aware of.

✓ Persistence & Privilege

The skill is not always-enabled and does not request persistent system-level privileges. It doesn't modify other skills or system configs in the provided materials.

版本历史

v0.7.0

v0.7: Sprint sizing guide from 4-sprint production test. Backend <100 lines = 95% Builder success. Frontend >300 lines = split required.

v0.6.0

v0.6: Claude Code native integration — agent templates (builder.md/reviewer.md), git worktrees, parallel builders, --bare mode. 3 real case studies (1-round pass demo). Inspired by Boris Cherny's CC tips.

元数据

Slug harness-factory

版本 0.7.0

许可证 MIT-0

累计安装 1

当前安装数 1

历史版本数 2

常见问题

Harness Factory — AI Engineering Team 是什么？

Use when building features, fixing complex bugs, or doing major refactoring. Transforms your agent into a structured engineering team: Plan → Build (via ACP)... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件，目前累计下载 163 次。

如何安装 Harness Factory — AI Engineering Team？

在 OpenClaw 或 Claude Code 对话框中运行命令「/install harness-factory」即可一键安装，无需额外配置。

Harness Factory — AI Engineering Team 是免费的吗？

是的，Harness Factory — AI Engineering Team 完全免费，采用 MIT-0 许可证，可自由下载、安装和使用。

Harness Factory — AI Engineering Team 支持哪些平台？

Harness Factory — AI Engineering Team 跨平台运行，可在任意部署了 OpenClaw / Claude Code 的环境中使用（darwin, linux）。

谁开发了 Harness Factory — AI Engineering Team？

由 guixiang123124（@guixiang123124）开发并维护，当前版本 v0.7.0。

Harness Factory — AI Engineering Team