troubleshoot-kafka

Official

Triage Kafka failures using Netdata signals.

Authornetdata
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Diagnose Kafka incidents like ISR shrinkage, leaderless partitions, consumer lag, controller overload, and broker disk exhaustion with a structured, signal-driven approach instead of guessing from generic health checks.

Core Features & Use Cases

  • Netdata MCP signal triage: Queries Netdata via MCP to confirm which Kafka failure archetype is actually occurring.
  • Operator playbook diagnostic tree: Applies the Netdata operator playbook’s failure-pattern logic to guide step-by-step investigation and narrowing.
  • Remediation verification loop: Re-queries the same MCP signals after remediation to ensure the anomalous signals return to expected ranges.

Quick Start

Ask the AI: "Investigate my Kafka cluster and tell me whether the incident is caused by ISR shrinkage, leaderless partitions, consumer lag, controller overload, or broker disk exhaustion, then recommend the most likely remediation using Netdata MCP signals."

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: troubleshoot-kafka
Download link: https://github.com/netdata/skills/archive/main.zip#troubleshoot-kafka

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.