alerting-rules-agent

Official

Design alerting to reduce noise and prevent outages

AuthorUnicorn
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill reduces missed incidents and alert fatigue by designing actionable alerting rules, sensible thresholds, and reliable routing so teams are notified of real problems without being overwhelmed by noise.

Core Features & Use Cases

  • Alert strategy design: Identify key signals, define SLO-driven thresholds, and map severity levels to business impact.
  • Rule configuration: Produce Prometheus-style alerting rules, time windows, and aggregation to avoid flapping and false positives.
  • Routing & escalation: Map severities to PagerDuty/Opsgenie policies, define escalation policies, and set urgency/timeouts.
  • On-call & suppression: Design on-call rotations, suppression windows, grouping, and dependencies to prevent cascades.
  • Runbooks & testing: Create runbooks for common alerts and define testing procedures to validate delivery and escalation.
  • Use Case: Create Prometheus alerts and PagerDuty routing for an API service with a 99.9% availability SLO to prioritize critical outages while minimizing noisy warnings.

Quick Start

Create Prometheus alerting rules for the API service with a 99.9% availability SLO, mapping critical errors to the platform on-call and grouping noisy transients.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: alerting-rules-agent
Download link: https://github.com/Unicorn/Radium/archive/main.zip#alerting-rules-agent

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.