eval-generator

Official

Automate eval test case generation for agents.

Authormicrosoft
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Generates concrete eval test cases for an agent by using an eval suite plan or a plain-English agent description, enabling structured evaluation across lifecycle stages.

Core Features & Use Cases

  • Use plan-based generation: leverages the eval suite plan's scenario table, evaluation methods, quality signals, and tags to produce test inputs and configurations.
  • Fallback generation: if no plan exists, creates 6-8 test cases from a plain-English agent description, including happy-path, edge, and adversarial scenarios.
  • Output formats: produces a Copilot Studio test set table, a CSV for single-response import, and a docx-style report for human review.
  • Lifecycle integration: supports subsequent steps in the cycle with /eval-result-interpreter and /eval-triage-and-improvement.

Quick Start

Run /eval-generator with your agent description to generate test cases.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: eval-generator
Download link: https://github.com/microsoft/eval-guide/archive/main.zip#eval-generator

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.