perf-nsight-systems

Official

Profile system-level GPU workloads.

AuthorNVIDIA
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Nsight Systems profiles system-level performance to reveal how CPU and GPU work together over time, enabling you to pinpoint scheduling bottlenecks, overlap gaps, and data movement issues.

Core Features & Use Cases

  • System-level timeline profiling of CPU/GPU activity, API calls, and OS events.
  • NVTX ranges and PyTorch autograd integration to correlate code with GPU work.
  • Workflows for profiling DL training, distributed training, and inference, plus post-run analysis with stats, recipes, and exports.

Quick Start

Profile a representative training run with Nsight Systems, generate an .nsys-rep, and begin analysis with nsys stats/analyze/recipe.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: perf-nsight-systems
Download link: https://github.com/NVIDIA/skills/archive/main.zip#perf-nsight-systems

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.