suprof

Community

GPU性能分析与调优的核心工具

Authordongg622
Version1.0.0
Installs0

System Documentation

What problem does it solve?

suProfiler 提供GPU性能分析能力,帮助FAE工程师定位训练、推理中的性能瓶颈,提高效率和优化能力。

Core Features & Use Cases

  • 训练性能瓶颈定位:采集系统全网 Timeline,识别GPU利用率低、Kernel耗时长等瓶颈问题。
  • 单Kernel性能调优:分析Kernel的EU/GEMM利用率、缓存命中率,优化内核实现。
  • 推理延迟优化:范围采集关键路径耗时,改善模型推理速度。 使用场景包括训练加速、迁移调优及推理性能提升。

Quick Start

利用 suProfiler 采集CPU/GPU任务性能数据,通过分析raw文件快速找到性能瓶颈。

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: suprof
Download link: https://github.com/dongg622/china-ai-chip-skill/archive/main.zip#suprof

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.