← 返回 Skills 市场

Execution Harness

Name: Execution Harness
Author: lanyasheng

作者 _silhouette · GitHub ↗ · v1.1.0 · MIT-0

cross-platform ✓ 安全检测通过

134

总下载

当前安装

版本数

在 OpenClaw 中安装

/install execution-harness

功能描述

Agent 执行可靠性体系的导航入口。当用户询问 agent 为什么停了、怎么防 agent 提前退出、execution harness 是什么、40 个 pattern 有哪些、6 轴架构、hook 怎么配时匹配。不执行具体操作——各子 skill 分别处理。

使用说明 (SKILL.md)

Execution Harness — 导航入口

40 patterns x 6 轴，让 Claude Code agent 把活干完。不是框架，不做模型调用——只管 hook 和脚本。

这个 skill 不执行操作。它帮你找到该用哪个子 skill。

你遇到什么问题？

问题	用哪个 skill	关键 pattern
Agent 做了一半就停了	execution-loop	1.1 Ralph Stop Hook — 阻止提前退出，5 个安全阀
"应该可以"但没跑测试	execution-loop	1.2 Doubt Gate — 检测投机语言，要求验证
`cargo build` 重试 12 次	tool-governance	2.1 Tool Error Escalation — 3 次提示、5 次 block
`rm -rf` 毁了未提交代码	tool-governance	2.3 Checkpoint + Rollback — 自动 git stash
压缩后忘了设计决策	context-memory	3.1 Handoff Documents — 决策写磁盘
Context 快满了还在读大文件	context-memory	3.4 Token Budget — 80%+ 禁止直读
限速后 tmux 挂死	error-recovery	5.1 Rate Limit Recovery — cron 扫描恢复
5 个 agent 编辑同一文件	multi-agent	4.3 File Claim and Lock — 10min TTL 排他锁
提交了编译不过的代码	quality-verification	6.4 Test-Before-Commit — commit 前跑测试

6 轴速查

轴	Skill	Pattern 数	核心能力
1	execution-loop	7	Ralph Stop Hook、Doubt Gate、Drift Re-anchoring
2	tool-governance	6	错误升级、权限否决、破坏性命令拦截
3	context-memory	8	Handoff 文档、Compaction 抢救、Token Budget
4	multi-agent	6	Coordinator/Fork/Swarm、文件锁、盲审分离
5	error-recovery	7	限速恢复、Crash 恢复、模型降级建议
6	quality-verification	6	编辑后 lint、commit 前测试、session 指标

最小配置

3 个 hook 解决最常见的 3 个问题：

// ~/.claude/settings.json
{
  "hooks": {
    "Stop": [{"hooks": [
      {"type": "command", "command": "bash execution-loop/scripts/ralph-stop-hook.sh"},
      {"type": "command", "command": "bash execution-loop/scripts/doubt-gate.sh"}
    ]}],
    "PreToolUse": [{"hooks": [
      {"type": "command", "command": "bash tool-governance/scripts/tool-error-advisor.sh"}
    ]}],
    "PostToolUse": [{"matcher": {"tool_name": "Write|Edit|MultiEdit"}, "hooks": [
      {"type": "command", "command": "bash quality-verification/scripts/post-edit-check.sh"}
    ]}]
  }
}

仓库

GitHub: lanyasheng/execution-harness | 90 tests | 依赖：bash、jq、python3、pytest

安全使用建议

This bundle is a coherent 'execution harness' and primarily operates on local files and hooks. Before installing or enabling the hooks, do the following: 1) Inspect the included bash scripts (scripts/*) to ensure they do only what you expect — pay attention to any 'git', 'rm', or tmux commands and to where files are written (HOME paths). 2) Backup your ~/.claude/settings.json and any repositories the harness might touch. 3) Run the scripts in a sandbox or container (or under a test HOME) to observe behavior. 4) Start by enabling only non-destructive hooks (e.g., read-only token-budget or context-usage) and review outputs before enabling hooks that run git stash, checkpoint/rollback, or send input to tmux. 5) If you run the test suite (pytest) do so in an isolated environment. 6) Only enable hooks if you trust the source or have audited the code; although there are no external endpoints or requested credentials in the metadata, scripts with filesystem and tmux access can still disrupt local workflows if misconfigured.

功能分析

Type: OpenClaw Skill Name: execution-harness Version: 1.1.0 The execution-harness skill bundle is a comprehensive framework designed to improve the reliability, safety, and persistence of AI agents. It provides a suite of bash scripts and hooks that implement defensive patterns, such as blocking dangerous shell commands (tool-input-guard.sh), preventing premature agent termination (ralph-stop-hook.sh), and ensuring code quality through automated linting (post-edit-check.sh). The bundle contains no evidence of malicious intent, data exfiltration, or unauthorized persistence; instead, it focuses on providing deterministic execution controls and safety valves that align with its stated purpose of helping agents complete complex tasks safely.

能力标签

requires-oauth-token

能力评估

ℹ Purpose & Capability

The declared purpose is a 'navigation entry' for execution reliability patterns and explicitly says it 'does not execute operations' itself. The package contains many bash hook scripts and tests and the SKILL.md gives example settings that call those scripts (e.g., bash execution-loop/scripts/ralph-stop-hook.sh). This is explainable (the hub points users to sub-skills and shipped scripts), but there's a mild mismatch between the 'not execute' wording and the fact that the repo includes runnable scripts and example configs that will execute them if the user wires them into ~/.claude/settings.json. Review the scripts before enabling.

ℹ Instruction Scope

Runtime instructions and examples direct agents (or operator's config) to run local bash scripts and to read/write session/transcript files under the user's home (e.g., ~/.openclaw/shared-context/sessions, .working-state, ~/.claude/settings.json). The scripts operate on local filesystem artifacts, tmux panes, run linters/tests, and manipulate state (git stash, write handoff files). They do not reference external endpoints or ask for secrets in the metadata, but they do have broad local-file access — expected for this purpose, but something to audit before enabling.

✓ Install Mechanism

No install spec is provided (instruction-only metadata), which minimizes automatic remote code installation. The bundle includes scripts and tests but no automated download-from-untrusted-URL step. The user would need to place/clone these files and add hooks manually or via their agent config; that manual step is the security control point.

✓ Credentials

The skill declares no required environment variables, credentials, or config paths. The included scripts expect runtime-provided variables (e.g., TRANSCRIPT_PATH, NC_SESSION, COMPACTION_EXTRACT_INTERVAL) typical for hooks; they do not request API keys or tokens. No unrelated service credentials are demanded.

ℹ Persistence & Privilege

The skill is not always-enabled and does not request elevated platform privileges, but the SKILL.md explicitly recommends editing ~/.claude/settings.json to add hooks that will run the shipped scripts. Enabling those hooks gives the scripts persistent operational presence (they'll run on hook events) and lets them write into user home directories (sessions, handoffs, working-state). This is expected for a hooks-based harness, but it increases the blast radius — inspect scripts and back up configs before enabling.

如何使用

确保已安装 OpenClaw（本地或 Docker 部署）
在对话框中输入安装命令：/install execution-harness
安装完成后，直接呼叫该 Skill 的名称或使用 /execution-harness 触发
根据 Skill 的参数说明提供必要输入，即可获得结构化输出

版本历史

v1.1.0

修复7个脚本bug(含1个CRITICAL死代码)，新增5个测试，总测试数90→95

v2.0.2

v2.0.2: lightweight root SKILL.md (route-only, ~478 tokens), removed obsolete docs

v2.0.1

v2.0.1: cleanup — remove obsolete reports, add 90 tests, rewrite README

v2.0.0

v2: 6-axis restructure (38 patterns, 17 scripts, 10 meta-principles). Distilled from harness-books, claude-reviews-claude, ccunpacked.dev, claude-howto.

元数据

Slug execution-harness

版本 1.1.0

许可证 MIT-0

累计安装 0

当前安装数 0

历史版本数 4

常见问题