Anthropic Cost Optimizer
/install anthropic-cost-optimizer
OpenClaw Cost Optimizer
Analyzes the user's OpenClaw config and rewrites it to minimize Anthropic API
spend. Works on YAML and JSON configs. Reads pricing tables from
references/pricing.md before starting.
Step 0 — Load pricing reference
Read references/pricing.md now. It contains current model rates and the
savings formulas used for estimates. Do this before any analysis.
Step 1 — Locate the config
Check these locations in order, stop at the first match:
- Path the user explicitly provides
./openclaw.yamlor./openclaw.jsonin the current working directory~/.openclaw/config.yaml~/.openclaw/openclaw.yaml~/openclaw.yaml
If none found, ask the user to paste their config directly.
Step 2 — Audit: the five cost levers
Work through every lever. Flag each issue with severity HIGH / MEDIUM / LOW.
Lever 1 — Prompt caching (HIGH impact, ~60–90% input cost reduction)
Flag HIGH if any of these are missing for Anthropic models:
cacheRetentionnot set, or set to"none"
Fix: Add cacheRetention: "long" for the primary model. Use "short" (5 min)
for agents that handle fast-changing context, "long" (1 hr) for everything else.
agents:
defaults:
models:
anthropic/claude-opus-4-6:
params:
cacheRetention: "long"
Note: caching is API-key only. Subscription setup-tokens do not honor it.
Lever 2 — Model routing (HIGH impact)
Flag HIGH if: a single model (especially Opus) is used for ALL agents with no per-agent overrides.
Fix: Route by task complexity:
| Agent type | Recommended model |
|---|---|
| Main reasoning / planning | anthropic/claude-opus-4-6 |
| Coding, editing, review | anthropic/claude-sonnet-4-6 |
| Linting, search, triage, classify | anthropic/claude-haiku-4-5 |
agents:
list:
- id: planner
params:
model: anthropic/claude-opus-4-6
- id: coder
params:
model: anthropic/claude-sonnet-4-6
- id: reviewer
params:
model: anthropic/claude-sonnet-4-6
- id: triage
params:
model: anthropic/claude-haiku-4-5
Lever 3 — Thinking level (MEDIUM impact)
Flag MEDIUM if: thinking: "adaptive" or thinking: "high" is set as the
default for ALL agents, including simple ones.
Fix: Scope thinking: "adaptive" only to the primary reasoning agent.
Set thinking: "low" or omit for utility agents.
agents:
defaults:
models:
anthropic/claude-opus-4-6:
params:
thinking: "low" # default: low
list:
- id: planner
params:
thinking: "adaptive" # override only where needed
Lever 4 — 1M context window (MEDIUM impact)
Flag MEDIUM if: context1m: true is set globally or for agents that
don't need it.
The 1M context beta header triggers surcharge pricing for Anthropic API calls. Most agents don't need it.
Fix: Remove context1m: true from the global defaults. Re-add only to
agents that demonstrably require >200K tokens of context.
Lever 5 — Fast mode (LOW impact, but easy win)
Flag LOW if: fastMode is not enabled for Sonnet agents doing quick tasks.
Fix: Add fastMode: true to Sonnet model params for agents doing rapid
back-and-forth tasks (search, triage, review). This reduces latency and can
lower costs on high-frequency agents.
models:
anthropic/claude-sonnet-4-6:
params:
fastMode: true
cacheRetention: "long"
Step 3 — Generate the optimized config
Rewrite the full config applying all fixes. Preserve every key the user had that is not being changed. Add comments above each changed block explaining what changed and why.
Format:
# COST OPTIMIZER — changed blocks annotated below
# Original: \x3Cwhat it was> → Optimized: \x3Cwhat it is now>
Step 4 — Show the diff and cost estimate
Before writing anything, show the user:
- Issues found — list each one with severity and one-line description
- Estimated monthly savings — use the formula from
references/pricing.mdwith assumed 500K tokens/day baseline - Diff — show only the changed blocks (not the full file unless it's short)
Example output format:
Issues found (3):
[HIGH] No prompt caching — estimated +$340/mo at current usage
[HIGH] All agents using Opus — Haiku suitable for triage/review agents
[MEDIUM] thinking: adaptive set globally — limits to reasoning agent only
Estimated savings: ~$290/mo (68% reduction)
Before: ~$430/mo → After: ~$140/mo
Changed blocks:
+ agents.defaults.models.anthropic/claude-opus-4-6.params.cacheRetention: "long"
+ agents.list[triage].params.model: anthropic/claude-haiku-4-5
~ agents.list[planner].params.thinking: adaptive → low (default); kept adaptive on planner only
Step 5 — Confirm and write
Ask the user: "Apply these changes to your config?"
- If yes: write the optimized config back to the same file path. Then tell the
user to run
openclaw gateway restart(or it will auto-restart if config watch is enabled). - If no / partial: apply only confirmed changes.
- If they want to review first: show the full optimized YAML before writing.
Edge cases
- Subscription setup-token auth: Note that
cacheRetentionhas no effect with subscription tokens — it's API-key only. Tell the user if their auth config usessk-ant-oat-*tokens. - Multiple agents with no IDs: If the config has unnamed agents, suggest adding IDs to enable per-agent model routing.
- Non-Anthropic providers: Only audit Anthropic model blocks. Leave OpenAI, Gemini, and other provider blocks untouched.
- Config not found: If no config file is found and no paste provided, offer to generate a fresh cost-optimized starter config instead.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install anthropic-cost-optimizer - 安装完成后,直接呼叫该 Skill 的名称或使用
/anthropic-cost-optimizer触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Anthropic Cost Optimizer 是什么?
Audits and rewrites your OpenClaw config to minimize API token costs. Use this skill whenever the user mentions high bills, API costs, billing changes, "too... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 107 次。
如何安装 Anthropic Cost Optimizer?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install anthropic-cost-optimizer」即可一键安装,无需额外配置。
Anthropic Cost Optimizer 是免费的吗?
是的,Anthropic Cost Optimizer 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Anthropic Cost Optimizer 支持哪些平台?
Anthropic Cost Optimizer 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Anthropic Cost Optimizer?
由 Next Frontier AI(@nextfrontierbuilds)开发并维护,当前版本 v1.0.0。