veomni-profile
OfficialOptimize model training with detailed performance analysis.
AuthorByteDance-Seed
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill facilitates comprehensive profiling and performance optimization of deep learning training processes, enabling users to identify bottlenecks and improve efficiency.
Core Features & Use Cases
- Profile analysis: Parse Chrome traces and memory snapshots to evaluate kernel execution, memory usage, and communication overhead.
- Performance optimization: Provide insights into bottlenecks such as slow kernels, inefficient communication, and memory issues to guide improvements during model training.
- Use Case: During training, generate profiling data, analyze the results, and optimize script configurations to maximize throughput and resource utilization, applicable to large-scale model training workflows.
Quick Start
Configure the profiling parameters in your training setup, run the training process with profiling enabled, then analyze the output Chrome traces and memory snapshots using the provided scripts to identify and address performance issues.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: veomni-profile Download link: https://github.com/ByteDance-Seed/VeOmni/archive/main.zip#veomni-profile Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.