evaluate-rag

Name: evaluate-rag
Availability: InStock
Author: marchatton

Community

Evaluate RAG pipeline quality

Software Engineering #llm #rag #metrics #evaluation #retrieval #generation

Authormarchatton

Version1.0.0

Installs0

System Documentation

What problem does it solve?

This Skill addresses the critical need to systematically evaluate and improve the performance of Retrieval-Augmented Generation (RAG) systems by dissecting the quality of both retrieval and generation components.

Core Features & Use Cases

Component-wise Evaluation: Separates the assessment of retrieval accuracy from generation faithfulness and relevance.
Dataset Curation: Provides methods for generating manual and synthetic QA pairs for robust retrieval testing.
Metric Implementation: Details the application of key metrics like Recall@k, Precision@k, MRR, and NDCG@k for different query types.
Chunking Optimization: Guides the process of tuning chunking strategies for improved retrieval performance.
Use Case: When a RAG system is underperforming, use this Skill to pinpoint whether the issue lies in retrieving the correct documents or in the LLM's ability to synthesize a faithful and relevant answer from the provided context.

Quick Start

Use the evaluate-rag skill to analyze the retrieval quality of the RAG pipeline by generating synthetic QA pairs for a given document.

evaluate-rag

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper