veomni-profile

Official

Optimize model training with detailed performance analysis.

AuthorByteDance-Seed
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill facilitates comprehensive profiling and performance optimization of deep learning training processes, enabling users to identify bottlenecks and improve efficiency.

Core Features & Use Cases

  • Profile analysis: Parse Chrome traces and memory snapshots to evaluate kernel execution, memory usage, and communication overhead.
  • Performance optimization: Provide insights into bottlenecks such as slow kernels, inefficient communication, and memory issues to guide improvements during model training.
  • Use Case: During training, generate profiling data, analyze the results, and optimize script configurations to maximize throughput and resource utilization, applicable to large-scale model training workflows.

Quick Start

Configure the profiling parameters in your training setup, run the training process with profiling enabled, then analyze the output Chrome traces and memory snapshots using the provided scripts to identify and address performance issues.

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: veomni-profile
Download link: https://github.com/ByteDance-Seed/VeOmni/archive/main.zip#veomni-profile

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.