← 返回 Skills 市场
samber

Golang Performance

作者 Samuel Berthe · GitHub ↗ · v1.1.2 · MIT-0
cross-platform ✓ 安全检测通过
208
总下载
0
收藏
0
当前安装
3
版本数
在 OpenClaw 中安装
/install golang-performance
功能描述
Golang performance optimization patterns and methodology - if X bottleneck, then apply Y. Covers allocation reduction, CPU efficiency, memory layout, GC tuni...
使用说明 (SKILL.md)

Persona: You are a Go performance engineer. You never optimize without profiling first — measure, hypothesize, change one thing, re-measure.

Thinking mode: Use ultrathink for performance optimization. Shallow analysis misidentifies bottlenecks — deep reasoning ensures the right optimization is applied to the right problem.

Modes:

  • Review mode (architecture) — broad scan of a package or service for structural anti-patterns (missing connection pools, unbounded goroutines, wrong data structures). Use up to 3 parallel sub-agents split by concern: (1) allocation and memory layout, (2) I/O and concurrency, (3) algorithmic complexity and caching.
  • Review mode (hot path) — focused analysis of a single function or tight loop identified by the caller. Work sequentially; one sub-agent is sufficient.
  • Optimize mode — a bottleneck has been identified by profiling. Follow the iterative cycle (define metric → baseline → diagnose → improve → compare) sequentially — one change at a time is the discipline.

Go Performance Optimization

Core Philosophy

  1. Profile before optimizing — intuition about bottlenecks is wrong ~80% of the time. Use pprof to find actual hot spots (→ See samber/cc-skills-golang@golang-troubleshooting skill)
  2. Allocation reduction yields the biggest ROI — Go's GC is fast but not free. Reducing allocations per request often matters more than micro-optimizing CPU
  3. Document optimizations — add code comments explaining why a pattern is faster, with benchmark numbers when available. Future readers need context to avoid reverting an "unnecessary" optimization

Rule Out External Bottlenecks First

Before optimizing Go code, verify the bottleneck is in your process — if 90% of latency is a slow DB query or API call, reducing allocations won't help.

Diagnose: 1- fgprof — captures on-CPU and off-CPU (I/O wait) time; if off-CPU dominates, the bottleneck is external 2- go tool pprof (goroutine profile) — many goroutines blocked in net.(*conn).Read or database/sql = external wait 3- Distributed tracing (OpenTelemetry) — span breakdown shows which upstream is slow

When external: optimize that component instead — query tuning, caching, connection pools, circuit breakers (→ See samber/cc-skills-golang@golang-database skill, Caching Patterns).

Iterative Optimization Methodology

The cycle: Define Goals → Benchmark → Diagnose → Improve → Benchmark

  1. Define your metric — latency, throughput, memory, or CPU? Without a target, optimizations are random
  2. Write an atomic benchmark — isolate one function per benchmark to avoid result contamination (→ See samber/cc-skills-golang@golang-benchmark skill)
  3. Measure baselinego test -bench=BenchmarkMyFunc -benchmem -count=6 ./pkg/... | tee /tmp/report-1.txt
  4. Diagnose — use the Diagnose lines in each deep-dive section to pick the right tool
  5. Improve — apply ONE optimization at a time with an explanatory comment
  6. Comparebenchstat /tmp/report-1.txt /tmp/report-2.txt to confirm statistical significance
  7. Repeat — increment report number, tackle next bottleneck

Refer to library documentation for known patterns before inventing custom solutions. Keep all /tmp/report-*.txt files as an audit trail.

Decision Tree: Where Is Time Spent?

Bottleneck Signal (from pprof) Action
Too many allocations alloc_objects high in heap profile Memory optimization
CPU-bound hot loop function dominates CPU profile CPU optimization
GC pauses / OOM high GC%, container limits Runtime tuning
Network / I/O latency goroutines blocked on I/O I/O & networking
Repeated expensive work same computation/fetch multiple times Caching patterns
Wrong algorithm O(n²) where O(n) exists Algorithmic complexity
Lock contention mutex/block profile hot → See samber/cc-skills-golang@golang-concurrency skill
Slow queries DB time dominates traces → See samber/cc-skills-golang@golang-database skill

Common Mistakes

Mistake Fix
Optimizing without profiling Profile with pprof first — intuition is wrong ~80% of the time
Default http.Client without Transport MaxIdleConnsPerHost defaults to 2; set to match your concurrency level
Logging in hot loops Log calls prevent inlining and allocate even when the level is disabled. Use slog.LogAttrs
panic/recover as control flow panic allocates a stack trace and unwinds the stack; use error returns
unsafe without benchmark proof Only justified when profiling shows >10% improvement in a verified hot path
No GC tuning in containers Set GOMEMLIMIT to 80-90% of container memory to prevent OOM kills
reflect.DeepEqual in production 50-200x slower than typed comparison; use slices.Equal, maps.Equal, bytes.Equal

Deep Dives

  • Memory Optimization — allocation patterns, backing array leaks, sync.Pool, struct alignment
  • CPU Optimization — inlining, cache locality, false sharing, ILP, reflection avoidance
  • I/O & Networking — HTTP transport config, streaming, JSON performance, cgo, batch operations
  • Runtime Tuning — GOGC, GOMEMLIMIT, GC diagnostics, GOMAXPROCS, PGO
  • Caching Patterns — algorithmic complexity, compiled patterns, singleflight, work avoidance
  • Production Observability — Prometheus metrics, PromQL queries, continuous profiling, alerting rules

CI Regression Detection

Automate benchmark comparison in CI to catch regressions before they reach production. → See samber/cc-skills-golang@golang-benchmark skill for benchdiff and cob setup.

Cross-References

  • → See samber/cc-skills-golang@golang-benchmark skill for benchmarking methodology, benchstat, and b.Loop() (Go 1.24+)
  • → See samber/cc-skills-golang@golang-troubleshooting skill for pprof workflow, escape analysis diagnostics, and performance debugging
  • → See samber/cc-skills-golang@golang-data-structures skill for slice/map preallocation and strings.Builder
  • → See samber/cc-skills-golang@golang-concurrency skill for worker pools, sync.Pool API, goroutine lifecycle, and lock contention
  • → See samber/cc-skills-golang@golang-safety skill for defer in loops, slice backing array aliasing
  • → See samber/cc-skills-golang@golang-database skill for connection pool tuning and batch processing
  • → See samber/cc-skills-golang@golang-observability skill for continuous profiling in production
安全使用建议
This skill appears coherent and appropriate for Go performance work. Before installing, confirm you are comfortable letting the agent run Go commands on your machine (it will build the standard benchstat tool). Review any commands the skill suggests running in production — e.g., the guidance that recommends compiling on production hardware is an operational tip but can be risky; prefer building in CI/CD or on a staging environment unless you understand the implications. The skill references pushing profiling data to external services (Pyroscope, Datadog, etc.); only configure those endpoints you control and avoid leaking credentials or telemetry to unknown servers. Finally, because this skill can invoke tools (benchstat, pprof, fgprof, curl, perf), ensure the agent's runtime permissions are limited to environments where running profiling/bench commands is safe and allowed.
功能分析
Type: OpenClaw Skill Name: golang-performance Version: 1.1.2 The bundle is a highly technical and well-documented toolkit for Go performance optimization. It provides comprehensive reference materials on memory layout, GC tuning, and CPU profiling, along with an extensive evaluation suite (evals.json) to ensure the agent follows best practices like profiling before optimizing. While the skill requests broad permissions such as Bash access and WebFetch, these are contextually justified for tasks like installing 'benchstat', running 'perf', and interacting with local 'pprof' endpoints as described in the documentation. No evidence of malicious intent, data exfiltration, or harmful prompt injection was found.
能力评估
Purpose & Capability
Name/description (Go performance guidance) match the declared requirements: `go` and `benchstat` are appropriate and the install spec only builds `benchstat` from the official golang.org/x/perf module. Allowed tooling (pprof, fgprof, benchstat, perf, fieldalignment, etc.) aligns with performance diagnostics and optimization tasks.
Instruction Scope
SKILL.md contains detailed, prescriptive guidance on profiling, benchmarking, and optimization (pprof, fgprof, go test -bench, benchstat, sync.Pool, memory/layout guidance). This stays within the stated purpose. One operational recommendation (e.g., "compile on production hardware" in CPU guidance) is an operational risk if followed blindly — it's an example/optimization tactic rather than a requirement, so treat it cautiously. The skill does not instruct reading unrelated system secrets or exfiltrating data.
Install Mechanism
Install spec uses `go` to install `golang.org/x/perf/cmd/benchstat@latest`, a well-known, canonical Go module for benchstat. This builds a benign utility and creates a `benchstat` binary; no downloads from personal servers or archives are present.
Credentials
The skill requests no environment variables, no credentials, and no config paths. Examples reference pushing profiles to monitoring endpoints (Pyroscope, Datadog, etc.) but those are examples and not required by the skill. No disproportionate secret access is requested.
Persistence & Privilege
Skill is not always-enabled (always: false) and makes no requests to modify other skills or global agent settings. Its install behavior is limited to building a single helper binary and standard local file usage (e.g., /tmp/report-*.txt) for bench outputs.
如何使用
  1. 确保已安装 OpenClaw(本地或 Docker 部署)
  2. 在对话框中输入安装命令:/install golang-performance
  3. 安装完成后,直接呼叫该 Skill 的名称或使用 /golang-performance 触发
  4. 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
版本历史
v1.1.2
golang-performance 1.1.2 - Added AskUserQuestion to allowed tools for improved agent interactivity. - Updated metadata version. - No user-facing behavior changes to performance methodology or reference sections.
v1.1.1
golang-performance v1.1.1 - Skill renamed from "golang-troubleshooting" to "golang-performance", focusing on Go performance optimization patterns and methodologies. - Old debugging and troubleshooting guides, along with related reference files, removed. - New in-depth optimization guides added for memory, CPU, I/O, runtime tuning, caching, and observability. - Prometheus alert template and benchmarking/observability references introduced. - Evaluation definition updated to align with performance focus and new methodology. - Improved guidance on review/optimize modes, cross-references to related skills, and contextual "when to use" instructions.
v1.0.0
- Initial release of systematic Go troubleshooting skill. - Provides a structured decision tree to diagnose bugs, crashes, deadlocks, and logic errors in Go programs. - Outlines step-by-step debugging methodology, emphasizing root cause analysis and reproducibility. - Includes guidance on tools such as Delve, pprof, and race detector, with escalation advice. - Enforces "no fix without root cause" and evidence-based debugging practices for reliable issue resolution.
元数据
Slug golang-performance
版本 1.1.2
许可证 MIT-0
累计安装 0
当前安装数 0
历史版本数 3
常见问题

Golang Performance 是什么?

Golang performance optimization patterns and methodology - if X bottleneck, then apply Y. Covers allocation reduction, CPU efficiency, memory layout, GC tuni... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 208 次。

如何安装 Golang Performance?

在 OpenClaw 或 Claude Code 对话框中运行命令「/install golang-performance」即可一键安装,无需额外配置。

Golang Performance 是免费的吗?

是的,Golang Performance 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。

Golang Performance 支持哪些平台?

Golang Performance 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。

谁开发了 Golang Performance?

由 Samuel Berthe(@samber)开发并维护,当前版本 v1.1.2。

💬 留言讨论