Load Testing
/install load-testing
Load Testing (Deep Workflow)
Load tests answer whether the system meets behavior under target load—not “how many RPS the tool prints.” Tie every run to SLOs, workload realism, and analysis that engineers can act on.
When to Offer This Workflow
Trigger conditions:
- Major launch, traffic spike season, infra resize
- Latency/timeout under peak; need evidence for capacity decisions
- Comparing architectures or debottlenecking
Initial offer:
Use seven stages: (1) goals & SLOs, (2) workload model, (3) scenarios & scripts, (4) environment & data, (5) run & observe, (6) analyze bottlenecks, (7) fixes & retest. Confirm tool (k6, Locust, Gatling, JMeter) and environment policy (prod-like staging vs synthetic).
Stage 1: Goals & SLOs
Goal: Define success in measurable terms.
Questions
- Peak RPS/users, growth assumption, duration of peak
- SLOs: p95/p99 latency, error rate, throughput per critical endpoint
- Scope: read-heavy vs write-heavy; background jobs interaction
Exit condition: Numeric targets + out of scope (e.g., “third-party API mocked”).
Stage 2: Workload Model
Goal: Representative mix—not one URL forever.
Practices
- Transaction mix from analytics or access logs (proportions)
- Think time between steps for user journeys
- Payload size distribution; auth token behavior
- Spike vs soak vs step ramp—match real failure modes
Exit condition: Workload profile documented (table or script comments).
Stage 3: Scenarios & Scripts
Goal: Deterministic, idempotent load scripts where possible.
Practices
- Correlate virtual user with trace/request id for debugging
- Parameterize data to avoid cache fantasy (every request hits same key)
- Order operations to match real causality (login → browse → checkout)
Pitfalls
- Client-side bottleneck (single generator machine)—distribute load generators
Exit condition: Smoke run at small k validates script correctness.
Stage 4: Environment & Data
Goal: Fidelity without destroying prod.
Rules
- Staging scale proportional; feature flags aligned
- Data volume similar order-of-magnitude for DB plans
- External deps: mock, sandbox, or throttle awareness
Exit condition: Safety checklist: no prod writes unless explicitly planned and isolated.
Stage 5: Run & Observe
Goal: System-wide visibility during test.
Instrumentation
- App: latency histograms, error codes, queue depth
- Infra: CPU, memory, connections, GC, disk IOPS
- DB: slow queries, locks, replication lag
- Tracing sample during test for hot spans
Exit condition: Dashboard or runbook link for the test window.
Stage 6: Analyze Bottlenecks
Goal: Identify dominant constraint: app, DB, network, dependency.
Process
- Utilization vs saturation (e.g., CPU high but wait on locks—different fix)
- Compare p95 vs max—tail often separate issue
- Reproduce bottleneck with smaller experiment when unclear
Exit condition: Written hypothesis with evidence (graphs, trace ids).
Stage 7: Fixes & Retest
Goal: Controlled changes with retest protocol.
Practices
- One major change per retest when debugging
- Document baseline vs after for regression to capacity planning
Final Review Checklist
- SLO-aligned goals and workload mix
- Realistic scenarios; distributed load if needed
- Environment safe and representative enough
- Full-stack observability during runs
- Bottleneck analysis leads to actionable tickets
Tips for Effective Guidance
- Warm caches explicitly if prod is always warm—otherwise misleading good numbers.
- Throughput without latency SLO is meaningless.
- Call out coordination overhead (locks, hot keys) vs raw CPU.
Handling Deviations
- Cannot match prod data: state assumptions and test directional only.
- Serverless: account for cold start and account concurrency limits in interpretation.
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install load-testing - 安装完成后,直接呼叫该 Skill 的名称或使用
/load-testing触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Load Testing 是什么?
Deep load testing workflow—goals and SLOs, workload modeling, scenario design, environment fidelity, execution, metrics interpretation, and bottlenecks to fi... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 191 次。
如何安装 Load Testing?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install load-testing」即可一键安装,无需额外配置。
Load Testing 是免费的吗?
是的,Load Testing 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Load Testing 支持哪些平台?
Load Testing 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Load Testing?
由 codenova58(@codenova58)开发并维护,当前版本 v1.0.0。