uptime-monitoring-specialist
CommunityDetect outages and maintain service availability
System Documentation
What problem does it solve?
Production services can suffer undetected outages, slowdowns, and unclear incident ownership without reliable monitoring and alerting. This Skill provides a structured approach to implement health checks, continuous uptime monitoring, multi-region checks, escalation policies, and public status reporting so teams can detect, respond to, and learn from availability incidents quickly.
Core Features & Use Cases
- Health Check Design: Guidance for implementing /health, /ready, and /alive endpoints that verify critical dependencies and return appropriate status codes and diagnostics.
- Monitoring Configuration: Instructions for configuring external monitors (UptimeRobot, Pingdom, Better Uptime) with intervals, expected responses, and multi-region probes to detect regional outages.
- Alerting and Incident Management: Patterns for alert thresholds, escalation policies, PagerDuty/Slack integrations, on-call rotations, and status page publishing to communicate incidents and SLA impact.
- SLA & Metrics Tracking: Methods to compute uptime percentages, track MTTR, and query incident metrics for monthly reporting and compliance checks.
- Troubleshooting Playbook: Stepwise checks for DNS, SSL, cloud platform service status, and log review to accelerate root cause identification and recovery.
Quick Start
Configure a /health endpoint that validates core dependencies and add an external uptime monitor to check that endpoint every 1-5 minutes with alerts to your on-call channel.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: uptime-monitoring-specialist Download link: https://github.com/Whaleylaw/llm-lawyer/archive/main.zip#uptime-monitoring-specialist Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.