perf-moe-vlm-training
OfficialOptimize training of multimodal MoE vision-language models.
AuthorNVIDIA-NeMo
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides practical guidance for training MoE VLMs, helping users compare different parallelism approaches to optimize performance and resource usage.
Core Features & Use Cases
- Training Strategy Guidance: Offers detailed comparisons between FSDP and 3D-parallel methods for MoE VLM training.
- Performance Optimization: Explains how to tune model configurations for maximum efficiency on different hardware.
- Use Case: A researcher wants to accelerate training of a new multimodal model; this Skill guides selecting the appropriate parallelism approach and tuning knobs.
Quick Start
Ask the AI how to improve the training throughput of a multimodal model using the methods described in the guidance.
Dependency Matrix
Required Modules
None requiredComponents
referencesassets
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: perf-moe-vlm-training Download link: https://github.com/NVIDIA-NeMo/Megatron-Bridge/archive/main.zip#perf-moe-vlm-training Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.