Incident Triage
/install incident-triage
Incident Triage
Structured incident triage for alerts from any monitoring source. Five steps, consistent every time.
Pass in the raw alert message, a link to the alert, or a description of what's happening.
Triage Process
When an alert appears:
- Classify — what type and severity?
- Scope — blast radius: who's affected, which environment, since when?
- Correlate — what changed recently? Check deploys, merges, config changes
- Investigate — guided checks based on alert type
- Act — summarize, create ticket, escalate or close
Read references/triage-framework.md for the full framework with checklists and bash snippets for each step.
Alert Parsing
Before starting the triage framework, identify the alert source and extract key fields.
Read references/alert-patterns.md for patterns covering PagerDuty, Datadog, CloudWatch, Sentry, uptime monitors, GitHub Actions, AWS SNS/EventBridge, and custom webhooks.
Escalation
When to page, when to watch, when to close. Severity-based response times and communication templates.
Read references/escalation-guide.md for defaults — customize for your team's on-call structure.
Runbook
During Step 4 (Investigate), load references/runbook-template.md to find service health endpoints, dashboards, log locations, and common fixes.
⚠️ This file is a template — it must be filled in before use. If it still contains
\x3C!--placeholder comments, tell the user to populate it with their actual infrastructure before relying on it during an incident.
Scripts
The scripts/ directory contains helper scripts for the correlation and action steps:
scripts/correlate-recent-deploys.sh— list recent CI/CD runs for a repo (Step 3)scripts/correlate-recent-merges.sh— list recently merged PRs for a repo (Step 3)scripts/create-incident-issue.sh— create a GitHub incident issue (Step 5)
Works Well With
- github (Step 3 — Correlate): check recent deploys, merged PRs, and CI run history for affected repos
- aws-ecs-monitor (Step 4 — Investigate): ECS service health, ALB targets, and CloudWatch logs for downtime and resource alerts
- gh-issues (Step 5 — Act): create incident tickets automatically
References
- references/triage-framework.md — full 5-step triage process with checklists
- references/alert-patterns.md — parsing alerts from common sources
- references/escalation-guide.md — severity levels, response times, escalation policy
- references/runbook-template.md — your infrastructure map (⚠️ fill in before use)
- 确保已安装 OpenClaw(本地或 Docker 部署)
- 在对话框中输入安装命令:
/install incident-triage - 安装完成后,直接呼叫该 Skill 的名称或使用
/incident-triage触发 - 根据 Skill 的参数说明提供必要输入,即可获得结构化输出
Incident Triage 是什么?
Structured incident triage for alerts from any monitoring source. Five-step framework: classify severity, scope blast radius, correlate with recent changes,... 它是一个面向 Claude Code / OpenClaw 的 AI Agent Skill 插件,目前累计下载 152 次。
如何安装 Incident Triage?
在 OpenClaw 或 Claude Code 对话框中运行命令「/install incident-triage」即可一键安装,无需额外配置。
Incident Triage 是免费的吗?
是的,Incident Triage 完全免费,采用 MIT-0 许可证,可自由下载、安装和使用。
Incident Triage 支持哪些平台?
Incident Triage 跨平台运行,可在任意部署了 OpenClaw / Claude Code 的环境中使用(cross-platform)。
谁开发了 Incident Triage?
由 Grace Gettert(@ggettert)开发并维护,当前版本 v0.3.0。