mceid

Community

GPU hardware error identification and reporting system for proactive maintenance.

Authordongg622
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides a comprehensive system for querying and analyzing GPU hardware errors, enabling effective fault diagnosis and health monitoring.

Core Features & Use Cases

  • Error Identification: Allows users to quickly view and interpret GPU hardware error messages and statuses.
  • Fault Diagnosis: Supports diagnoses related to GPU faults, RAS errors, PCIe issues, and device health.
  • Use Case: FAE engineers can utilize this Skill to promptly identify hardware failures and perform preventive maintenance by analyzing GPU error logs and system messages.

Quick Start

Ask the AI to retrieve all current GPU hardware errors and interpret the error codes for immediate troubleshooting.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: mceid
Download link: https://github.com/dongg622/china-ai-chip-skill/archive/main.zip#mceid

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.