perf-moe-vlm-training

Official

Optimize training of multimodal MoE vision-language models.

AuthorNVIDIA-NeMo
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides practical guidance for training MoE VLMs, helping users compare different parallelism approaches to optimize performance and resource usage.

Core Features & Use Cases

  • Training Strategy Guidance: Offers detailed comparisons between FSDP and 3D-parallel methods for MoE VLM training.
  • Performance Optimization: Explains how to tune model configurations for maximum efficiency on different hardware.
  • Use Case: A researcher wants to accelerate training of a new multimodal model; this Skill guides selecting the appropriate parallelism approach and tuning knobs.

Quick Start

Ask the AI how to improve the training throughput of a multimodal model using the methods described in the guidance.

Dependency Matrix

Required Modules

None required

Components

referencesassets

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: perf-moe-vlm-training
Download link: https://github.com/NVIDIA-NeMo/Megatron-Bridge/archive/main.zip#perf-moe-vlm-training

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.