fleet-watchdog

Community

Proactive fleet health and auto-repair.

Authorsupportersimulator
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Fleet health issues across distributed nodes lead to downtime and manual troubleshooting overhead. The Fleet Watchdog automates health checks, flags failures early, and triggers auto-repair while managing idle sessions and task suggestions to maintain fleet resilience.

Core Features & Use Cases

  • Proactive health checks for peer connectivity (NATS) and HTTP channels with configured thresholds.
  • Automatic repair escalation through the fleet repair system, with safety rails like autoRepairLevel and idle handling.
  • Idle detection and proactive task generation to keep fleet productive during downtime.
  • Big-picture awareness and audit through evidence ledger and logs.

Quick Start

Enable watchdog with default settings and verify the status endpoint to confirm it is running.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: fleet-watchdog
Download link: https://github.com/supportersimulator/multi-fleet/archive/main.zip#fleet-watchdog

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.