sre-engineering

Community

Coordinate reliable incident responses at scale.

Authorchicongst
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Use when handling production incidents, reliability concerns, on-call triage, infrastructure issues, monitoring, alerting, and post-mortem analysis.

Core Features & Use Cases

  • Incident triage and coordination across on-call teams to rapidly assess impact and blast radius.
  • Structured response framework with parallel investigation tracks, time-boxed actions, and regular stakeholder communications.
  • Post-incident analysis, reporting, and continuous improvement through documented RCA and preventive measures.

Quick Start

Describe and execute an incident response plan for a production outage, including roles, timelines, and mitigation steps.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: sre-engineering
Download link: https://github.com/chicongst/agent-skills-installer/archive/main.zip#sre-engineering

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.