rag-perf
CommunityConfig-driven RAG performance benchmarking
Authorsayalinvidia
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Benchmark a deployed NVIDIA RAG Blueprint server by driving profiling data and optional aiperf load tests via a config-driven CLI.
Core Features & Use Cases
- Config-driven profiling and load-testing against a live RAG server to quantify latency, throughput, and bottlenecks.
- Supports grid sweeps over key knobs like concurrency, vdb_top_k, and reranker_top_k, producing reproducible results and detailed reports.
- Useful for iteration on retrieval/reranker tuning, capacity planning, and SLA verification with repeatable YAML presets.
Quick Start
Run rag-perf with a YAML config to measure performance against your RAG server and generate a summarized report of TTFT and throughput.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: rag-perf Download link: https://github.com/sayalinvidia/sayali-skills-test/archive/main.zip#rag-perf Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.