rag-perf

Community

Config-driven RAG performance benchmarking

Authorsayalinvidia
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Benchmark a deployed NVIDIA RAG Blueprint server by driving profiling data and optional aiperf load tests via a config-driven CLI.

Core Features & Use Cases

  • Config-driven profiling and load-testing against a live RAG server to quantify latency, throughput, and bottlenecks.
  • Supports grid sweeps over key knobs like concurrency, vdb_top_k, and reranker_top_k, producing reproducible results and detailed reports.
  • Useful for iteration on retrieval/reranker tuning, capacity planning, and SLA verification with repeatable YAML presets.

Quick Start

Run rag-perf with a YAML config to measure performance against your RAG server and generate a summarized report of TTFT and throughput.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: rag-perf
Download link: https://github.com/sayalinvidia/sayali-skills-test/archive/main.zip#rag-perf

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.