mx-exporter
CommunityStreamline GPU monitoring and improve cluster performance.
Authordongg622
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill enables seamless collection and visualization of GPU indicators, facilitating efficient monitoring and maintenance of GPU clusters.
Core Features & Use Cases
- GPU Metrics Collection: Export GPU temperature, power, memory, and utilization data to Prometheus in real-time.
- Cluster Monitoring: Deploy in Kubernetes or via host installation to track GPU performance across servers.
- Use Case: Automate GPU health checks within a Kubernetes environment, alerting if temperature exceeds thresholds or utilization drops unexpectedly.
Quick Start
Install mx-exporter via Wheel or Docker, then start the service to begin collecting GPU metrics for analysis.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: mx-exporter Download link: https://github.com/dongg622/china-ai-chip-skill/archive/main.zip#mx-exporter Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.