sglang-diffusion-benchmark-profile

Community

Benchmark and profile diffusion latency.

AuthorNabilhassan12345
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Diffusion model benchmarks and profiling streamline the process of measuring denoise latency, end-to-end generation timing, and memory usage, enabling data-driven optimization decisions.

Core Features & Use Cases

  • End-to-end diffusion benchmarks: measure denoise latency, end-to-end latency, and peak memory across presets.
  • Profiling workflow: collect torch.profiler traces and perf dumps to rank hot kernels.
  • Guided optimization handoff: map hotspots to existing fast paths and hand off kernel work to specialized optimization skills when needed.

Quick Start

Run the diffusion benchmark profile for a chosen model to measure denoise latency and collect profiling data.

Dependency Matrix

Required Modules

None required

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: sglang-diffusion-benchmark-profile
Download link: https://github.com/Nabilhassan12345/voice-ai-workspace/archive/main.zip#sglang-diffusion-benchmark-profile

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.