hy-smi
CommunityEfficient management and monitoring of Hygon DCU devices.
Product & Management#performance tuning#device management#hardware monitoring#hygon#dcu#hy-smi#fault diagnosis
Authordongg622
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill streamlines the management and health monitoring of Hygon DCU hardware, enabling system administrators to quickly diagnose, check status, and troubleshoot device issues.
Core Features & Use Cases
- Device health inspection: Perform comprehensive device health checks and error diagnostics to ensure optimal operation.
- Performance tuning: Adjust power and clock settings to optimize GPU/DCU performance for intensive workloads.
- Real-time monitoring: Continuously observe power usage, temperature, and utilization metrics during operation.
- Fault troubleshooting: Detect and reset errors, analyze RAS logs, and diagnose hardware faults such as ECC errors or XGMI issues.
- Device topology insight: Visualize and examine device interconnections to facilitate maintenance and configuration.
Quick Start
Use the hy-smi skill to perform a health check on your Hygon GPU/DCU system.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: hy-smi Download link: https://github.com/dongg622/china-ai-chip-skill/archive/main.zip#hy-smi Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.