Name: perf-moe-dispatcher-selection
Availability: InStock
Author: NVIDIA-NeMo

System Documentation

What problem does it solve?

This Skill helps developers choose the optimal MoE token dispatcher (such as alltoall, DeepEP, or HybridEP) based on hardware platform, model size, and EP degree, ensuring maximum performance and efficiency.

Core Features & Use Cases

Dispatcher Recommendation: Guides users in selecting the best dispatcher configuration for H100, B200, GB200, or GB300 systems.
Performance Tuning Advice: Provides insights on tuning SM counts and routing modes for specific models and hardware.
Use Case: A researcher working on large-scale MoE models on GB200 systems can determine whether to use DeepEP or HybridEP for optimal throughput and memory utilization.

Quick Start

Ask the AI which MoE dispatcher setting is best for a 685B model running on a 256×GB200 system to improve performance.

Please help me install this Skill: Name: perf-moe-dispatcher-selection Download link: https://github.com/NVIDIA-NeMo/Megatron-Bridge/archive/main.zip#perf-moe-dispatcher-selection Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

perf-moe-dispatcher-selection

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper