infra-monitoring
CommunityConfigure and audit AICP observability.
System Documentation
What problem does it solve?
Configures or audits AICP's monitoring stack. AICP ships a complete observability layer (Prometheus + Grafana behind a Docker compose profile + 7 baseline alerts + per-component metrics endpoints). This skill is for working WITH the existing stack — adding/tuning alerts, reviewing dashboards, verifying metrics flow, debugging missing signals — not for replacing the stack (replacing would be architecture-propose scope).
Core Features & Use Cases
- Verify the monitoring stack is running, endpoints are reachable, and Prometheus targets are healthy.
- Review dashboards and baseline alerts; tune signals and provenance for reliable on-call routing.
- Perform post-incident audits and quarterly observability reviews to ensure signals align with reliability goals.
Quick Start
Load the skill and follow the four operations to verify stack health, audit dashboards, and document changes.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: infra-monitoring Download link: https://github.com/cyberpunk042/devops-expert-local-ai/archive/main.zip#infra-monitoring Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.