troubleshoot-kafka
OfficialTriage Kafka failures using Netdata signals.
Authornetdata
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Diagnose Kafka incidents like ISR shrinkage, leaderless partitions, consumer lag, controller overload, and broker disk exhaustion with a structured, signal-driven approach instead of guessing from generic health checks.
Core Features & Use Cases
- Netdata MCP signal triage: Queries Netdata via MCP to confirm which Kafka failure archetype is actually occurring.
- Operator playbook diagnostic tree: Applies the Netdata operator playbook’s failure-pattern logic to guide step-by-step investigation and narrowing.
- Remediation verification loop: Re-queries the same MCP signals after remediation to ensure the anomalous signals return to expected ranges.
Quick Start
Ask the AI: "Investigate my Kafka cluster and tell me whether the incident is caused by ISR shrinkage, leaderless partitions, consumer lag, controller overload, or broker disk exhaustion, then recommend the most likely remediation using Netdata MCP signals."
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: troubleshoot-kafka Download link: https://github.com/netdata/skills/archive/main.zip#troubleshoot-kafka Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.