sre-engineering
CommunityCoordinate reliable incident responses at scale.
Software Engineering#monitoring#communication#playbook#on-call#sre#incident-management#incident-response
Authorchicongst
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Use when handling production incidents, reliability concerns, on-call triage, infrastructure issues, monitoring, alerting, and post-mortem analysis.
Core Features & Use Cases
- Incident triage and coordination across on-call teams to rapidly assess impact and blast radius.
- Structured response framework with parallel investigation tracks, time-boxed actions, and regular stakeholder communications.
- Post-incident analysis, reporting, and continuous improvement through documented RCA and preventive measures.
Quick Start
Describe and execute an incident response plan for a production outage, including roles, timelines, and mitigation steps.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: sre-engineering Download link: https://github.com/chicongst/agent-skills-installer/archive/main.zip#sre-engineering Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.