nemo-mbridge-perf-moe-hardware-configs

Community

Guides optimal MoE training configs by hardware.

Authorsayalinvidia
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Helps engineers select and tune MoE training configurations tailored to specific hardware platforms and model families, summarizing throughput bands, parallelism patterns, and tuning stacks to save time and optimize performance.

Core Features & Use Cases

  • Hardware platform playbooks for MoE workloads across DSV3, Qwen3, and Qwen3-Next.
  • Representative config families with dispatcher, VPP, and CUDA graph guidance to improve throughput and efficiency.
  • Cross-cutting patterns and environment recommendations (CUDA knobs, CPU tuning) to accelerate production-ready training setups.

Quick Start

Refer to the Quick Platform Playbook to tailor MoE training configurations for your hardware and model.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: nemo-mbridge-perf-moe-hardware-configs
Download link: https://github.com/sayalinvidia/sayali-skills-test/archive/main.zip#nemo-mbridge-perf-moe-hardware-configs

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.