troubleshoot-envoy

Official

Triage Envoy failures with Netdata signals

Authornetdata
Version1.0.0
Installs0

System Documentation

What problem does it solve?

It helps diagnose why an Envoy proxy is misbehaving by mapping real-time Netdata observations to specific Envoy failure archetypes and guiding remediation-oriented troubleshooting.

Core Features & Use Cases

  • Envoy failure archetype triage: Targets connection pool exhaustion cascades, memory pressure spirals, xDS control plane disconnects, retry amplification, and stats cardinality explosions.
  • Netdata-backed evidence via MCP: Queries Envoy-related Netdata chart contexts through an MCP surface to validate the current incident state.
  • Operator playbook diagnostic tree: Applies the same domain decomposition (Availability, Throughput, Latency, Errors, Saturation, Internal State, Configuration State) to structure analysis and next actions.
  • Use case: When on-call engineers receive alerts or users report elevated errors/latency/saturation, this skill helps select the right signal domain, pull the relevant time windows, and recommend remediation steps aligned to the playbook.

Quick Start

Ask an AI agent: "Troubleshoot my Envoy using Netdata via MCP and identify the most likely failure archetype, then recommend remediation based on the operator playbook."

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: troubleshoot-envoy
Download link: https://github.com/netdata/skills/archive/main.zip#troubleshoot-envoy

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.