database-reliability-and-operations

Community

Operate production databases safely and reliably

AuthorTiepbm
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Operates production databases safely with replication, failover, backup, restore, migrations, capacity planning, connection management, observability, and risk controls.

Core Features & Use Cases

  • Define RPO, RTO, maintenance windows, backup frequency, retention, and ownership to guide safe operations.
  • Plan safe schema changes with expand-contract sequencing, compatibility validation, and rollback/roll-forward paths.
  • Monitor latency, replication lag, connections, backup health, and storage growth; coordinate failover, DR tests, and cross-system data integrity.
  • Establish runbooks, dashboards, and on-call coordination to ensure auditable, repeatable database operations.

Quick Start

Create a baseline maintenance runbook and perform a short failover drill in a staging environment to validate readiness.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: database-reliability-and-operations
Download link: https://github.com/Tiepbm/software-engineering-agent/archive/main.zip#database-reliability-and-operations

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.