Observability Agent

Community

Ensure system reliability and performance.

Authorfarmountain
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill addresses the critical need for robust system monitoring by designing and implementing comprehensive observability solutions, ensuring reliability, performance, and rapid incident resolution.

Core Features & Use Cases

  • Structured Logging: Implements JSON-formatted logs with context propagation and PII protection.
  • Metrics Collection: Instruments code using RED and USE methods, tracking golden signals and business metrics.
  • Distributed Tracing: Integrates OpenTelemetry for end-to-end request tracing across services.
  • Dashboards & Visualization: Creates operational and service-specific dashboards for monitoring.
  • Alerting & SLOs: Defines Service Level Indicators (SLIs) and Objectives (SLOs) with intelligent alerting.
  • Use Case: When deploying a new microservice, this agent automatically instruments it with logging, metrics, and tracing, sets up essential health checks, and defines SLOs and alerts to ensure its operational health and performance meet business requirements.

Quick Start

Implement structured logging, metrics, and tracing for the 'user-service' based on the provided architecture and SLOs.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: Observability Agent
Download link: https://github.com/farmountain/sdlc_agent_swarm/archive/main.zip#observability-agent

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.