← 返回 Skills 市场
codenova58

Load Testing

作者 codenova58 · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ✓ 安全检测通过
191
总下载
0
收藏
0
当前安装
1
版本数
在 OpenClaw 中安装
/install load-testing
功能描述
Deep load testing workflow—goals and SLOs, workload modeling, scenario design, environment fidelity, execution, metrics interpretation, and bottlenecks to fi...
使用说明 (SKILL.md)

Load Testing (Deep Workflow)

Load tests answer whether the system meets behavior under target load—not “how many RPS the tool prints.” Tie every run to SLOs, workload realism, and analysis that engineers can act on.

When to Offer This Workflow

Trigger conditions:

  • Major launch, traffic spike season, infra resize
  • Latency/timeout under peak; need evidence for capacity decisions
  • Comparing architectures or debottlenecking

Initial offer:

Use seven stages: (1) goals & SLOs, (2) workload model, (3) scenarios & scripts, (4) environment & data, (5) run & observe, (6) analyze bottlenecks, (7) fixes & retest. Confirm tool (k6, Locust, Gatling, JMeter) and environment policy (prod-like staging vs synthetic).


Stage 1: Goals & SLOs

Goal: Define success in measurable terms.

Questions

  1. Peak RPS/users, growth assumption, duration of peak
  2. SLOs: p95/p99 latency, error rate, throughput per critical endpoint
  3. Scope: read-heavy vs write-heavy; background jobs interaction

Exit condition: Numeric targets + out of scope (e.g., “third-party API mocked”).


Stage 2: Workload Model

Goal: Representative mix—not one URL forever.

Practices

  • Transaction mix from analytics or access logs (proportions)
  • Think time between steps for user journeys
  • Payload size distribution; auth token behavior
  • Spike vs soak vs step ramp—match real failure modes

Exit condition: Workload profile documented (table or script comments).


Stage 3: Scenarios & Scripts

Goal: Deterministic, idempotent load scripts where possible.

Practices

  • Correlate virtual user with trace/request id for debugging
  • Parameterize data to avoid cache fantasy (every request hits same key)
  • Order operations to match real causality (login → browse → checkout)

Pitfalls

  • Client-side bottleneck (single generator machine)—distribute load generators

Exit condition: Smoke run at small k validates script correctness.


Stage 4: Environment & Data

Goal: Fidelity without destroying prod.

Rules

  • Staging scale proportional; feature flags aligned
  • Data volume similar order-of-magnitude for DB plans
  • External deps: mock, sandbox, or throttle awareness

Exit condition: Safety checklist: no prod writes unless explicitly planned and isolated.


Stage 5: Run & Observe

Goal: System-wide visibility during test.

Instrumentation

  • App: latency histograms, error codes, queue depth
  • Infra: CPU, memory, connections, GC, disk IOPS
  • DB: slow queries, locks, replication lag
  • Tracing sample during test for hot spans

Exit condition: Dashboard or runbook link for the test window.


Stage 6: Analyze Bottlenecks

Goal: Identify dominant constraint: app, DB, network, dependency.

Process

  • Utilization vs saturation (e.g., CPU high but wait on locks—different fix)
  • Compare p95 vs maxtail often separate issue
  • Reproduce bottleneck with smaller experiment when unclear

Exit condition: Written hypothesis with evidence (graphs, trace ids).


Stage 7: Fixes & Retest

Goal: Controlled changes with retest protocol.

Practices

  • One major change per retest when debugging
  • Document baseline vs after for regression to capacity planning

Final Review Checklist

  • SLO-aligned goals and workload mix
  • Realistic scenarios; distributed load if needed
  • Environment safe and representative enough
  • Full-stack observability during runs
  • Bottleneck analysis leads to actionable tickets

Tips for Effective Guidance

  • Warm caches explicitly if prod is always warm—otherwise misleading good numbers.
  • Throughput without latency SLO is meaningless.
  • Call out coordination overhead (locks, hot keys) vs raw CPU.

Handling Deviations

  • Cannot match prod data: state assumptions and test directional only.
  • Serverless: account for cold start and account concurrency limits in interpretation.
安全使用建议
This skill is coherent and appears safe as a set of instructions. Before using it, ensure you: (1) only execute actual load tests from authorized, controlled environments (avoid uncoordinated tests against production or third-party services), (2) confirm any chosen load tools and test runners exist in your environment and have appropriate network permissions, (3) coordinate with observability/ops teams and throttles/mocks for external dependencies, and (4) review any concrete run commands the agent might execute (the skill provides guidance but an agent could still issue commands if given permission). If you plan to let an agent autonomously run tests, explicitly restrict targets, traffic limits, and scheduling to prevent accidental disruption.
功能分析
Type: OpenClaw Skill Name: load-testing Version: 1.0.0 The skill bundle contains only metadata and a markdown file (SKILL.md) outlining a standard professional workflow for load testing. It lacks any executable code, network requests, or suspicious instructions, serving purely as a conceptual guide for an AI agent to assist users with performance engineering.
能力评估
Purpose & Capability
Name and description (deep load-testing workflow) match the SKILL.md content. The document contains a coherent seven-stage load-testing process and does not request unrelated capabilities, binaries, or cloud credentials.
Instruction Scope
SKILL.md is purely procedural guidance (goals, workload modeling, scenarios, instrumentation, analysis, retest). It does not instruct the agent to read arbitrary files, access environment variables, or call external endpoints beyond normal load-testing tooling decisions; scope is limited to advising the user on how to run tests.
Install Mechanism
No install spec or code files are present. Being instruction-only, the skill does not write files or download code during install.
Credentials
No required environment variables, credentials, or config paths are declared or referenced. The guidance discusses tooling choices (k6, Locust, Gatling, JMeter) but does not request secrets or unrelated service tokens.
Persistence & Privilege
always is false and there is no indication the skill modifies agent/system configuration or requires persistent presence. It does not request elevated or persistent privileges.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install load-testing
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /load-testing 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.0.0
Initial release provides a comprehensive, actionable workflow for deep load testing. - Covers goals, SLOs, workload modeling, scenario creation, environment setup, execution, metrics analysis, and post-test fixes. - Includes specific exit conditions and checklists for each stage. - Designed to validate capacity, support launches, investigate latency, and guide architecture decisions. - Offers practical tips for realism, safety, and effective root-cause analysis.
元数据
Slug load-testing
版本 1.0.0
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 1
常见问题

Load Testing 是什么?

Deep load testing workflow—goals and SLOs, workload modeling, scenario design, environment fidelity, execution, metrics interpretation, and bottlenecks to fi... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 191 次。

如何安装 Load Testing?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install load-testing」即可一键安装,无需额外配置。

Load Testing 是免费的吗?

是的,Load Testing 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Load Testing 支持哪些平台?

Load Testing 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Load Testing?

由 codenova58(@codenova58)开发并维护,当前版本 v1.0.0。

💬 留言讨论