perf-moe-hardware-configs
OfficialOptimize MoE training with hardware-specific configs.
Software Engineering#performance#benchmark#training configuration#parallelism#hardware optimization#moe
AuthorNVIDIA-NeMo
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides detailed hardware-specific training playbooks and performance benchmarks for MoE models, helping users optimize training configurations.
Core Features & Use Cases
- Performance Benchmarking: Offers approximate throughput ranges and MFU metrics for various hardware platforms.
- Configuration Guidance: Provides recommended parallelism, routing, and recompute strategies based on hardware.
- Use Case: A researcher tuning MoE models can consult this Skill to select the best configuration for H100 or B200 systems tailored to their model size.
Quick Start
Request the optimal hardware configuration for training a 685B MoE model on an H100 system.
Dependency Matrix
Required Modules
None requiredComponents
referencesassets
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: perf-moe-hardware-configs Download link: https://github.com/NVIDIA-NeMo/Megatron-Bridge/archive/main.zip#perf-moe-hardware-configs Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.