ClawBrain Benchmark
/install clawbrain-pro-benchmark
ClawBrain Benchmark
测试你的 AI 在 OpenClaw 中的真实表现。看看它做简单事行不行,做复杂事会不会掉链子。
使用方法
直接说"跑一下 benchmark"或"测试一下模型效果"。
测试什么
10 大类、205 个真实场景:
| 类别 | 测什么 | 为什么重要 |
|---|---|---|
| 文件操作 | 读、写、编辑文件 | 基本功 |
| 搜索 | 查资料、抓网页 | 日常需求 |
| 消息 | 微信、钉钉发消息 | 沟通协作 |
| 终端 | 跑命令、管服务 | 开发运维 |
| 多步任务 | 搜索→整理→保存→通知 | 真正做事的能力 |
| 错误恢复 | 出错了怎么办 | 靠不靠谱 |
| 模糊指令 | "帮我准备下" | 聪不聪明 |
| 视觉理解 | 看图、截图识别 | 多模态能力 |
评测结果(v1.0)
| 模型 | 综合 | 文件 | 搜索 | 终端 | 错误恢复 | 模糊指令 | 多步 |
|---|---|---|---|---|---|---|---|
| ClawBrain Auto | 90% | 100% | 100% | 100% | 100% | 100% | 80% |
| ClawBrain Pro | 86% | 100% | 100% | 100% | 100% | 100% | 80% |
| 单模型 A | 83% | 95% | 100% | 90% | 80% | 65% | 73% |
| 单模型 B | 81% | 85% | 100% | 90% | 76% | 55% | 73% |
| 单模型 C | 73% | 100% | 100% | 90% | 56% | 65% | 80% |
ClawBrain 通过编排引擎实现:主动思考→多模型协作→输出验证→错误恢复,综合表现超越任何单模型。
完整报告:https://clawbrain.dev/blog/openclaw-model-comparison
- Make sure OpenClaw is installed (local or Docker)
- Run the install command in chat:
/install clawbrain-pro-benchmark - After installation, invoke the skill by name or use
/clawbrain-pro-benchmark - Provide required inputs per the skill's parameter spec and get structured output
What is ClawBrain Benchmark?
测试你的 OpenClaw 在 205 个真实场景下的表现,对比 ClawBrain v1.0 编排引擎的提升效果. It is an AI Agent Skill for Claude Code / OpenClaw, with 159 downloads so far.
How do I install ClawBrain Benchmark?
Run "/install clawbrain-pro-benchmark" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.
Is ClawBrain Benchmark free?
Yes, ClawBrain Benchmark is completely free, licensed under MIT-0. You can download, install and use it at no cost.
Which platforms does ClawBrain Benchmark support?
ClawBrain Benchmark is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).
Who created ClawBrain Benchmark?
It is built and maintained by michaelfeng (@michaelfeng); the current version is v1.0.2.