nsys-optimizer

Official

Speed up CUDA apps with Nsight profiling.

AuthorVCERS
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Profiling and optimizing CUDA-based simulations suffer from slow runtimes and opaque bottlenecks; this skill provides a structured method to baseline, diagnose, optimize, and verify improvements using Nsight Systems.

Core Features & Use Cases

  • Systematic profiling with Nsight Systems to identify kernels and memory bottlenecks.
  • End-to-end optimization loop: profile, diagnose, optimize, re-profile, verify.
  • Use Case: accelerate a GPU-accelerated physics or materials simulation by targeting hotspot kernels and memory patterns.

Quick Start

Run an initial Nsight Systems profiling session on your CUDA application to collect baseline performance data.

Dependency Matrix

Required Modules

scipy

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: nsys-optimizer
Download link: https://github.com/VCERS/MatClaw/archive/main.zip#nsys-optimizer

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.