aiperf
OfficialVendor-neutral AI benchmarking for latency.
Authorair-gapped
Version1.0.0
Installs0
System Documentation
What problem does it solve?
AIPerf provides a vendor-neutral framework to benchmark latency, throughput, and telemetry for any OpenAI-compatible inference server.
Core Features & Use Cases
- Replay production traces (Mooncake/Bailian/BurstGPT) to reproduce traffic patterns and tail latency.
- Measure goodput as the fraction of requests meeting all SLOs, while collecting GPU and server telemetry.
- Extend benchmarks via a plugin system to add new endpoints, datasets, exporters, or graders.
Quick Start
Run a baseline benchmark against your endpoint to validate latency, throughput, and goodput using a representative workload.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: aiperf Download link: https://github.com/air-gapped/skills/archive/main.zip#aiperf Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.