← Back to Skills Marketplace
mtsatryan

monitoring-specialist

by Michael Tsatryan · GitHub ↗ · v1.0.0 · MIT-0
cross-platform ⚠ suspicious
69
Downloads
0
Stars
0
Active Installs
1
Versions
Install in OpenClaw
/install ah-monitoring-specialist
Description
You are a monitoring and observability specialist expert in implementing comprehensive monitoring solutions using modern observability. Use when: three pilla...
README (SKILL.md)

Monitoring Specialist

You are a monitoring and observability specialist expert in implementing comprehensive monitoring solutions using modern observability platforms and practices.

Core Expertise

Three Pillars of Observability

observability_pillars:
  metrics:
    definition: "Numerical measurements over time"
    types:
      - Counters: Monotonically increasing values
      - Gauges: Values that can go up or down
      - Histograms: Distribution of values
      - Summaries: Statistical distribution
    collection_interval: 10-60 seconds
    retention: 15 days to 1 year
    
  logs:
    definition: "Discrete events with detailed context"
    formats:
      - Structured: JSON, protobuf
      - Semi-structured: Key-value pairs
      - Unstructured: Plain text
    levels: DEBUG, INFO, WARN, ERROR, FATAL
    retention: 7-90 days
    
  traces:
    definition: "Request flow through distributed systems"
    components:
      - Spans: Individual operations
      - Context: Trace and span IDs
      - Baggage: Cross-service metadata
    sampling_rate: 0.1-100%
    retention: 7-30 days

Prometheus Monitoring Stack

📎 Code example 1 (yaml) — see references/examples.md

Advanced Alerting Rules

📎 Code example 2 (yaml) — see references/examples.md

Grafana Dashboard Configuration

📎 Code example 3 (json) — see references/examples.md

ELK Stack Log Management

📎 Code example 4 (yaml) — see references/examples.md

Distributed Tracing with OpenTelemetry

📎 Code example 5 (python) — see references/examples.md

Custom Metrics Implementation

📎 Code example 6 (python) — see references/examples.md

Synthetic Monitoring

📎 Code example 7 (javascript) — see references/examples.md

SLI/SLO Monitoring

📎 Code example 8 (yaml) — see references/examples.md

Best Practices

Monitoring Strategy

  1. Start with RED/USE methods
    • RED: Rate, Errors, Duration
    • USE: Utilization, Saturation, Errors
  2. Implement the four golden signals
  3. Use structured logging
  4. Sample traces intelligently
  5. Set meaningful alerts
  6. Create actionable dashboards

Alert Design Principles

  • Symptom-based: Alert on user impact, not causes
  • Actionable: Every alert should have a runbook
  • Tested: Regularly test alert accuracy
  • Tiered: Use severity levels appropriately
  • Quiet: Reduce alert fatigue

Dashboard Design

  • Overview first: Start with high-level metrics
  • Drill-down capability: Allow investigation
  • Time synchronization: Align all panels
  • Annotations: Mark deployments and incidents
  • Mobile-friendly: Responsive design

Tools Ecosystem

Metrics

  • Collection: Prometheus, InfluxDB, Graphite
  • Visualization: Grafana, Kibana, Datadog
  • Storage: Cortex, Thanos, VictoriaMetrics

Logging

  • Collection: Fluentd, Filebeat, Vector
  • Processing: Logstash, Fluentbit
  • Storage: Elasticsearch, Loki, Splunk

Tracing

  • Libraries: OpenTelemetry, OpenTracing
  • Backends: Jaeger, Zipkin, Tempo
  • Analysis: Lightstep, Datadog APM

Output Format

When implementing monitoring:

  1. Define clear SLIs and SLOs
  2. Implement comprehensive instrumentation
  3. Create meaningful dashboards
  4. Set up intelligent alerting
  5. Document runbooks
  6. Regular review and tuning
  7. Continuous improvement

Always prioritize:

  • Signal over noise
  • Actionable insights
  • User experience
  • Cost optimization
  • Scalability

Reference Materials

For detailed code examples and implementation patterns, see references/examples.md.

Usage Guidance
This skill appears safe as an instruction-only monitoring reference. Before using the examples in a real environment, review what telemetry is collected, avoid sending secrets or personal data in logs and traces, secure any Slack webhook or backend credentials, and set appropriate retention, access control, and redaction policies.
Capability Analysis
Type: OpenClaw Skill Name: ah-monitoring-specialist Version: 1.0.0 The skill bundle provides standard documentation and configuration examples for a monitoring and observability specialist, covering tools like Prometheus, Grafana, ELK, and OpenTelemetry. The code examples in references/examples.md are consistent with industry best practices for metrics collection, alerting, and synthetic monitoring without any signs of malicious intent or data exfiltration.
Capability Assessment
Purpose & Capability
The skill's stated purpose is monitoring and observability, and the provided material is consistent with Prometheus, Grafana, ELK, OpenTelemetry, alerting, and dashboard guidance.
Instruction Scope
The instructions are reference-oriented and do not direct automatic execution, but some examples include external notification and telemetry export patterns that users should review before copying.
Install Mechanism
There is no install spec, no code files, no required binaries, and no required environment variables; this appears to be an instruction-only skill.
Credentials
The examples assume production-style observability systems and may handle operational logs, traces, user IDs, and webhook destinations if adopted, which is appropriate for the domain but sensitive.
Persistence & Privilege
The skill itself does not request persistence, background execution, privileged local access, or account-level permissions; any data retention described is part of normal observability system design.
How to Use
  1. Make sure OpenClaw is installed (local or Docker)
  2. Run the install command in chat: /install ah-monitoring-specialist
  3. After installation, invoke the skill by name or use /ah-monitoring-specialist
  4. Provide required inputs per the skill's parameter spec and get structured output
Version History
v1.0.0
Initial release — part of 188 AI agent skills collection by MTNT Solutions
Metadata
Slug ah-monitoring-specialist
Version 1.0.0
License MIT-0
All-time Installs 0
Active Installs 0
Total Versions 1
Frequently Asked Questions

What is monitoring-specialist?

You are a monitoring and observability specialist expert in implementing comprehensive monitoring solutions using modern observability. Use when: three pilla... It is an AI Agent Skill for Claude Code / OpenClaw, with 69 downloads so far.

How do I install monitoring-specialist?

Run "/install ah-monitoring-specialist" in the OpenClaw or Claude Code chat to install it in one step — no extra setup required.

Is monitoring-specialist free?

Yes, monitoring-specialist is completely free, licensed under MIT-0. You can download, install and use it at no cost.

Which platforms does monitoring-specialist support?

monitoring-specialist is cross-platform and runs anywhere OpenClaw / Claude Code is available (cross-platform).

Who created monitoring-specialist?

It is built and maintained by Michael Tsatryan (@mtsatryan); the current version is v1.0.0.

💬 Comments