perf-moe-hardware-configs

Official

Optimize MoE training with hardware-specific configs.

AuthorNVIDIA-NeMo
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides detailed hardware-specific training playbooks and performance benchmarks for MoE models, helping users optimize training configurations.

Core Features & Use Cases

  • Performance Benchmarking: Offers approximate throughput ranges and MFU metrics for various hardware platforms.
  • Configuration Guidance: Provides recommended parallelism, routing, and recompute strategies based on hardware.
  • Use Case: A researcher tuning MoE models can consult this Skill to select the best configuration for H100 or B200 systems tailored to their model size.

Quick Start

Request the optimal hardware configuration for training a 685B MoE model on an H100 system.

Dependency Matrix

Required Modules

None required

Components

referencesassets

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: perf-moe-hardware-configs
Download link: https://github.com/NVIDIA-NeMo/Megatron-Bridge/archive/main.zip#perf-moe-hardware-configs

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.