nsys-optimizer
OfficialSpeed up CUDA apps with Nsight profiling.
AuthorVCERS
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Profiling and optimizing CUDA-based simulations suffer from slow runtimes and opaque bottlenecks; this skill provides a structured method to baseline, diagnose, optimize, and verify improvements using Nsight Systems.
Core Features & Use Cases
- Systematic profiling with Nsight Systems to identify kernels and memory bottlenecks.
- End-to-end optimization loop: profile, diagnose, optimize, re-profile, verify.
- Use Case: accelerate a GPU-accelerated physics or materials simulation by targeting hotspot kernels and memory patterns.
Quick Start
Run an initial Nsight Systems profiling session on your CUDA application to collect baseline performance data.
Dependency Matrix
Required Modules
scipy
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: nsys-optimizer Download link: https://github.com/VCERS/MatClaw/archive/main.zip#nsys-optimizer Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.