Name: relevance-evals
Availability: InStock
Author: RelevanceAI

System Documentation

What problem does it solve?

Manages the end-to-end evaluation process for AI agents, enabling structured test design, execution, and results analysis within Relevance AI.

Core Features & Use Cases

Test-set and test-case management for end-to-end agent evals
Two evaluation modes: generate_and_score and score_only
Batch-based result tracking with per-run rule judgments and summaries
Guidelines for designing reliable, observable rules and scenarios

Quick Start

Create a new test set, add test cases, run an evaluation batch, and review the results.

Please help me install this Skill: Name: relevance-evals Download link: https://github.com/RelevanceAI/agent-skills/archive/main.zip#relevance-evals Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

relevance-evals

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper