Name: perf-sequence-packing
Availability: InStock
Author: NVIDIA-NeMo

System Documentation

What problem does it solve?

This Skill enables the validation and application of sequence packing techniques to improve training efficiency and long-context handling in large language models.

Core Features & Use Cases

Enables offline packed SFT for LLM fine-tuning by configuring packed sequence specifications for stable training with extended context lengths.
Supports in-batch packing for vision-language model fine-tuning, facilitating improved training throughput.
Use Case: Adjust sequence lengths and packing parameters to enable longer context training or optimize memory usage during model finetuning for production environments.

Quick Start

Configure your training setup to use PackedSequenceSpecs for sequence length and packing specifications, then run the training process with these parameters to improve efficiency and handle long context sequences.

Please help me install this Skill: Name: perf-sequence-packing Download link: https://github.com/NVIDIA-NeMo/Megatron-Bridge/archive/main.zip#perf-sequence-packing Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

perf-sequence-packing

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper