Description

Autonomous development workflow with multi-agent Claw Architecture. Single-agent mode for simple features; multi-agent mode decomposes cross-domain work into...

README (SKILL.md)

Buildwright

Name: buildwright
Author: raunakkathuria

Spec-driven autonomous development. Humans approve intent; agents handle everything else.

What this skill does

When activated, Buildwright directs the agent to:

Read your codebase and steering documents
Write a one-page spec (docs/specs/[feature]/spec.md)
Stop for human approval — unless BUILDWRIGHT_AUTO_APPROVE=true
Implement the feature with TDD
Run quality gates: typecheck, lint, test, build
Run optional security scans (if semgrep / gitleaks / trufflehog are installed)
Run a Staff Engineer prompt-based code review
Commit, push, and open a PR via gh

Requirements

Credentials (required)

Credential	Purpose	Scope	How to provide
`GITHUB_TOKEN`	Push commits and open PRs via `gh`	`repo` scope (read/write)	`export GITHUB_TOKEN=ghp_...` or configure in OpenClaw config under `skills.entries.buildwright.apiKey`

The token must have repo scope to push branches and create pull requests. For minimal privilege, use a fine-grained personal access token scoped to a single repository with "Contents: Read and write" and "Pull requests: Read and write" permissions.

Alternatively, if you use SSH for git push, the GITHUB_TOKEN is still needed for gh pr create. You can use gh auth login to authenticate the GitHub CLI separately.

Binaries (required)

Binary	Purpose
`git`	Commits and pushes
`gh`	Opens PRs via GitHub CLI

Optional tools

Binary	Purpose
`semgrep`	SAST security scan
`gitleaks` / `trufflehog`	Secrets detection

Agent Personas (prompt-based, no binaries)

Staff Engineer and Security Engineer are prompt-engineering personas — instructions loaded from .buildwright/agents/ files in the workspace. They are not external tools or binaries. The agent adopts these personas to review specs and code using defined criteria and confidence thresholds. These files contain only prompt instructions and review checklists — no secrets or credentials.

Configuration

BUILDWRIGHT_AUTO_APPROVE (optional, not a credential)

This is an optional boolean flag that controls whether the agent waits for human approval at the spec stage. It is not a secret and not declared in requires.env because it is not required to run the skill.

Value	Behavior
Not set	Interactive (default) — stops and waits for "approved" before building
`false`	Interactive — same as default
`true`	Autonomous — commits spec to git (audit trail) and proceeds without waiting

Recommendation for first use: Leave BUILDWRIGHT_AUTO_APPROVE unset until you have reviewed a few specs and are comfortable with the workflow. Start with interactive mode in a sandbox repository to observe behavior before enabling autonomous commits and PRs.

Commands

/bw-new-feature \x3Cdescription>

Full pipeline for new features. Auto-detects greenfield vs existing projects.

/bw-new-feature "Add OAuth2 login"

Flow: Detect (greenfield or existing?) → Research → Spec → Staff Engineer validates → Human approves → TDD build → Verify → Security scan → Code review → PR

Artifacts produced:

docs/specs/[feature]/research.md — what the agent found in your codebase
docs/specs/[feature]/spec.md — implementation plan with approaches considered

/bw-claw \x3Cfeature>

Multi-agent pipeline using the Claw Architecture. Architect decomposes the feature into domain-specific claw tasks (UI, API, DB), defines interface contracts, and coordinates execution.

/bw-claw "Add profile photo upload for team members"

Flow: Architect analyzes → Decomposes into claw tasks → Defines interface contract → Claws execute per domain (TDD) → Architect integrates → Buildwright quality gates → PR

Best for: Features that cross domain boundaries (e.g., need DB schema + API endpoint + UI component).

Artifacts produced:

docs/specs/[feature]/claw-plan.md — decomposition plan with interface contracts
docs/specs/[feature]/claw-[domain].md — per-claw execution report

/bw-quick \x3Ctask>

Fast path for bug fixes and small tasks (\x3C2 hrs). No spec, no approval step. Runs security scan and code review on the changed diff before committing.

Flow: Understand → Research → TDD → Verify → Security scan → Code review → Commit

/bw-quick "Fix the login timeout bug"

/bw-ship [message]

Quality pipeline for existing work: verify → security → review → PR.

/bw-ship "feat(auth): add OAuth2 support"

/bw-verify

Quick checks only: typecheck → lint → test → build.

/bw-analyse

Analyse an existing codebase and write structured docs to .buildwright/codebase/. Creates docs from scratch if missing; auto-refreshes existing docs when BUILDWRIGHT_AUTO_APPROVE=true (only asks in interactive mode). Creates tech.md from template if it doesn't exist, then populates it with the discovered stack and commands. Run this first on any brownfield project to give every subsequent session real context.

/bw-analyse

Produces: STACK.md, ARCHITECTURE.md, CONVENTIONS.md, CONCERNS.md under .buildwright/codebase/.

/bw-plan \x3Cquestion or task file>

Research a question or topic and produce a written deliverable — no implementation, no commits. Use when someone asks a question or needs an analysis, plan, or report before (or instead of) writing code.

/bw-plan "what are the performance risks in this Flutter app?"
/bw-plan "plan a migration from monolith to microservices"
/bw-plan tasks/flutter-perf-review.md

Flow: Understand question/task → Clarify if needed → Research (read code + run read-only tools) → Synthesize findings → Write deliverable → Summarize

Accepts two invocation styles:

Inline question — describe the question or topic directly; the agent infers scope and writes to docs/plans/\x3Cslug>/\x3Cdate>/plan.md
Task file — a structured .md file with Inputs, Rules, Research Areas, and Outputs blocks; the agent parses and executes it exactly

Use /bw-plan when you want: a performance review, architecture decision record, migration plan, technology evaluation, static analysis report, or any "research this and give me a written output" task.

Use /bw-new-feature instead when you want the plan executed (research + spec + implement + ship).

Artifacts produced: whatever the task specifies; at minimum plan.md in the output directory.

Hard constraints: never modifies source files, never commits or creates PRs, every finding must cite evidence.

/bw-help

Show all available commands.

Failure Behavior

If any gate fails after retries, the agent commits completed work, pushes, and opens a PR with a structured failure report. It does not leave orphaned branches or silent failures.

Retry Policy

Gate	Retries	Rationale
Verify (typecheck, lint, test, build)	2x	Fixable by the agent
Security scan	None	Requires human judgment
Code review	None	Architectural decisions need humans

Security Considerations

This skill performs autonomous code changes, commits, and pull requests. Understand what it does before enabling it on repositories with sensitive or production code.

What the skill reads: Your repository source code, .buildwright/agents/ persona files (prompt instructions only, no secrets), and .buildwright/steering/ context files.

What the skill writes: Spec files under docs/specs/, source code changes, git commits on feature branches, and pull requests via gh.

What the skill does NOT do: It does not modify .env files, access secrets stores, run destructive git operations (force push, reset), or merge PRs. All changes go to feature branches with PRs for human review.

Recommended setup for first use:

Start with a fork or sandbox repository, not production code
Leave BUILDWRIGHT_AUTO_APPROVE unset (interactive mode) to review specs before builds
Use a fine-grained GitHub token scoped to a single repository with minimal permissions
Rotate tokens regularly and revoke when no longer needed
Review generated PRs before merging — the skill creates PRs, it does not merge them

More Information

Full documentation, source code, and setup instructions: https://github.com/raunakkathuria/buildwright

Usage Guidance

This skill appears coherent, but it will be able to read and modify whatever repository you run it in and use the provided GITHUB_TOKEN to push branches and create PRs. Before enabling it in a production repo: 1) prefer a fine‑grained GitHub token limited to a single repository and only the needed permissions (Contents read/write, Pull Requests read/write); 2) test the skill in a sandbox repository first with BUILDWRIGHT_AUTO_APPROVE unset so it stops for manual approval; 3) inspect any .buildwright/agents/ prompt files in your repo to confirm they contain only review prompts (no secrets or unexpected instructions); 4) store the token securely and rotate/revoke it if you stop using the skill; 5) monitor created branches and PRs until you’re comfortable with its behavior.

Capability Analysis

Type: OpenClaw Skill Name: buildwright Version: 0.0.9 The 'buildwright' skill is a legitimate autonomous development workflow tool designed to automate feature implementation, TDD, security scanning, and PR creation. It requires standard development binaries (git, gh) and a GITHUB_TOKEN for its stated purpose, with clear documentation on security considerations and an optional 'auto-approve' mode for autonomy. No evidence of data exfiltration, malicious execution, or prompt injection was found in SKILL.md or _meta.json.

Capability Assessment

✓ Purpose & Capability

The skill is an autonomous development pipeline that commits, pushes, and opens PRs; requiring git, gh, and a GITHUB_TOKEN with repo scope matches that stated purpose. Optional scanners (semgrep, gitleaks, trufflehog) are reasonable for the declared security-scan steps.

ℹ Instruction Scope

SKILL.md explicitly instructs the agent to read the repository, .buildwright prompt files, generate specs, run tests/lints/security scans, and commit/push via gh. Reading the codebase and those workspace files is expected for feature implementation and review, but the agent will have broad read/write access to the repository it runs in — review which repository is used and what files live there before running.

✓ Install Mechanism

Instruction-only skill with no install steps or external downloads; lowest install risk because it doesn't write or execute bundled code. It relies on existing system binaries (git, gh).

✓ Credentials

Only one credential (GITHUB_TOKEN) is required and is the declared primary credential; this matches the need to create branches, push commits, and open PRs. The SKILL.md recommends using a fine-grained token scoped to a single repository, which is appropriate. BUILDWRIGHT_AUTO_APPROVE is non-secret and not required.

✓ Persistence & Privilege

The skill is not marked always:true and does not request modifications to other skills or system-wide settings. It can run autonomously (normal for skills) and will create commits/PRs if allowed — this is expected behavior for a CI/agent that automates development actions.

Version History

v0.0.9

**Expanded research and planning tools; improved platform compatibility.** - Added `/bw-plan` command for generating written analysis, plans, or reports without making code changes or commits. - Clarified usage of `/bw-plan`: supports both inline questions and structured task files; produces only written deliverables. - Updated compatibility to include support for Codex CLI. - No other file or behavioral changes detected in this release.

v0.0.8

Version 0.0.8 - Updated version metadata to 0.0.8. - /bw-analyse now creates docs from scratch if missing, auto-refreshes existing docs in autonomous mode, and creates a `tech.md` template if it doesn't exist. - Added clarification that /bw-analyse only prompts for confirmation in interactive mode. - No detected code or binary changes; documentation improved with more details on the /bw-analyse command.

v0.0.7

- Version bump to 0.0.7. - /bw-quick flow enhanced: explicitly runs security scan and code review on the changed diff before committing. - No other user-facing changes documented.

v0.0.3

**Adds codebase analysis and discovery documentation.** - Introduced /bw-analyse command to analyze existing codebases and write structured discovery docs under `.buildwright/codebase/`. - Analysis produces `STACK.md`, `ARCHITECTURE.md`, `CONVENTIONS.md`, and `CONCERNS.md`, plus updates `tech.md` with stack and commands. - Recommended to run /bw-analyse first on brownfield projects for improved future context. - No other user-facing changes noted.

v0.0.2

- Added compatibility and support for the Cursor development environment. - Updated documentation to reflect Cursor support alongside existing tools.

v0.0.1

Initial release introducing Buildwright, an autonomous development workflow with Claw Architecture. - Supports both single-agent (simple features) and multi-agent (cross-domain) workflows. - Integrates TDD, security scanning (optional tools), code review, and quality gates for reliable automation. - Provides commands for new features, multi-agent builds, quick fixes, quality checks, and detailed help. - Requires git, GitHub CLI (`gh`), and a GitHub token with repo scope; supports optional security scanners. - Interactive and autonomous approval modes available for spec validation. - Emphasizes safety: human review before merges, no destructive git actions, comprehensive audit trail.

v1.0.2

Declared required bins (git, gh) and env var (BUILDWRIGHT_AUTO_APPROVE) in openclaw metadata to resolve registry mismatch flagged by security scan.

v1.0.1

Fixed security scan findings: declared required binaries (git, gh) and optional security tools; clarified BUILDWRIGHT_AUTO_APPROVE default is interactive — autonomous requires explicit opt-in; explained personas are prompt-based; removed install instructions from skill body.

v1.0.0

- Initial release of Buildwright v1.0.0. - Introduces an autonomous, spec-driven development workflow requiring only one human approval at the spec stage. - Supports feature development, quick bug fixes, quality gate enforcement (TDD, security scan, code review), and seamless shipping through dedicated commands. - Includes agent personas for Staff Engineer and Security Engineer code reviews. - Compatible with Claude Code, OpenCode, and OpenClaw; requires git. - Provides detailed documentation and easy installation instructions.

Metadata

Slug buildwright

Version 0.0.9

License MIT-0

All-time Installs 0

Active Installs 0

Total Versions 9

Frequently Asked Questions

What is buildwright?

Autonomous development workflow with multi-agent Claw Architecture. Single-agent mode for simple features; multi-agent mode decomposes cross-domain work into... It is an AI Agent Skill for Claude Code / OpenClaw, with 498 downloads so far.

How do I install buildwright?

Run "/install buildwright" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is buildwright free?

Yes, buildwright is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does buildwright support?

buildwright is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created buildwright?

It is built and maintained by Raunak Kathuria (@raunakkathuria); the current version is v0.0.9.

More Skills

buildwright