nemo-mbridge-perf-parallelism-strategies

Community

Optimize Megatron Bridge parallelism quickly.

Authorsayalinvidia
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This skill helps engineers select and size parallelism strategies for Megatron Bridge, enabling efficient training across diverse model sizes and hardware topologies.

Core Features & Use Cases

  • Decision guidance: choose appropriate data and tensor parallelism (DP, TP) and pipeline/sequence controls (PP, SP, CP) for target models.
  • MoE support: provide guidance for expert_parallel and EP/ETP configurations on mixture-of-experts models.
  • Use Case Example: when configuring training for a 236B MoE model across 16 GPUs, the skill helps derive a robust parallelism recipe that respects topology constraints.

Quick Start

Try asking for a recommended parallelism setup given a model size, hardware topology, and target training scale.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: nemo-mbridge-perf-parallelism-strategies
Download link: https://github.com/sayalinvidia/sayali-skills-test/archive/main.zip#nemo-mbridge-perf-parallelism-strategies

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.