sglang-diffusion-benchmark-profile
CommunityBenchmark and profile diffusion latency.
AuthorNabilhassan12345
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Diffusion model benchmarks and profiling streamline the process of measuring denoise latency, end-to-end generation timing, and memory usage, enabling data-driven optimization decisions.
Core Features & Use Cases
- End-to-end diffusion benchmarks: measure denoise latency, end-to-end latency, and peak memory across presets.
- Profiling workflow: collect torch.profiler traces and perf dumps to rank hot kernels.
- Guided optimization handoff: map hotspots to existing fast paths and hand off kernel work to specialized optimization skills when needed.
Quick Start
Run the diffusion benchmark profile for a chosen model to measure denoise latency and collect profiling data.
Dependency Matrix
Required Modules
None requiredComponents
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: sglang-diffusion-benchmark-profile Download link: https://github.com/Nabilhassan12345/voice-ai-workspace/archive/main.zip#sglang-diffusion-benchmark-profile Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.