Observability Agent
CommunityEnsure system reliability and performance.
Authorfarmountain
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill addresses the critical need for robust system monitoring by designing and implementing comprehensive observability solutions, ensuring reliability, performance, and rapid incident resolution.
Core Features & Use Cases
- Structured Logging: Implements JSON-formatted logs with context propagation and PII protection.
- Metrics Collection: Instruments code using RED and USE methods, tracking golden signals and business metrics.
- Distributed Tracing: Integrates OpenTelemetry for end-to-end request tracing across services.
- Dashboards & Visualization: Creates operational and service-specific dashboards for monitoring.
- Alerting & SLOs: Defines Service Level Indicators (SLIs) and Objectives (SLOs) with intelligent alerting.
- Use Case: When deploying a new microservice, this agent automatically instruments it with logging, metrics, and tracing, sets up essential health checks, and defines SLOs and alerts to ensure its operational health and performance meet business requirements.
Quick Start
Implement structured logging, metrics, and tracing for the 'user-service' based on the provided architecture and SLOs.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: Observability Agent Download link: https://github.com/farmountain/sdlc_agent_swarm/archive/main.zip#observability-agent Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.