alerting-rules-agent
OfficialDesign alerting to reduce noise and prevent outages
AuthorUnicorn
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill reduces missed incidents and alert fatigue by designing actionable alerting rules, sensible thresholds, and reliable routing so teams are notified of real problems without being overwhelmed by noise.
Core Features & Use Cases
- Alert strategy design: Identify key signals, define SLO-driven thresholds, and map severity levels to business impact.
- Rule configuration: Produce Prometheus-style alerting rules, time windows, and aggregation to avoid flapping and false positives.
- Routing & escalation: Map severities to PagerDuty/Opsgenie policies, define escalation policies, and set urgency/timeouts.
- On-call & suppression: Design on-call rotations, suppression windows, grouping, and dependencies to prevent cascades.
- Runbooks & testing: Create runbooks for common alerts and define testing procedures to validate delivery and escalation.
- Use Case: Create Prometheus alerts and PagerDuty routing for an API service with a 99.9% availability SLO to prioritize critical outages while minimizing noisy warnings.
Quick Start
Create Prometheus alerting rules for the API service with a 99.9% availability SLO, mapping critical errors to the platform on-call and grouping noisy transients.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: alerting-rules-agent Download link: https://github.com/Unicorn/Radium/archive/main.zip#alerting-rules-agent Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.