Name: eval-dataset
Availability: InStock
Author: opendatahub-io

System Documentation

What problem does it solve?

Automates the generation of end-to-end evaluation test cases for skills, ensuring coverage against a defined eval.yaml/spec and judge criteria.

Core Features & Use Cases

Generate diverse, judge-driven test cases aligned to dataset schemas and evaluation config.
Create context artifacts like inputs.yaml, annotations.yaml, answers.yaml, and companion files as needed.
Validate case structure and provide guidance for expanding or refining evaluation coverage.

Quick Start

Run /eval-dataset to generate a complete set of evaluation test cases for a target skill.

Please help me install this Skill: Name: eval-dataset Download link: https://github.com/opendatahub-io/agent-eval-harness/archive/main.zip#eval-dataset Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

eval-dataset

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper