mx-exporter

Community

Streamline GPU monitoring and improve cluster performance.

Authordongg622
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill enables seamless collection and visualization of GPU indicators, facilitating efficient monitoring and maintenance of GPU clusters.

Core Features & Use Cases

  • GPU Metrics Collection: Export GPU temperature, power, memory, and utilization data to Prometheus in real-time.
  • Cluster Monitoring: Deploy in Kubernetes or via host installation to track GPU performance across servers.
  • Use Case: Automate GPU health checks within a Kubernetes environment, alerting if temperature exceeds thresholds or utilization drops unexpectedly.

Quick Start

Install mx-exporter via Wheel or Docker, then start the service to begin collecting GPU metrics for analysis.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: mx-exporter
Download link: https://github.com/dongg622/china-ai-chip-skill/archive/main.zip#mx-exporter

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.