eval-generator
OfficialAutomate eval test case generation for agents.
Software Engineering#testing#test-generation#eval#agent-evaluation#copilot-studio#conversational-eval
Authormicrosoft
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Generates concrete eval test cases for an agent by using an eval suite plan or a plain-English agent description, enabling structured evaluation across lifecycle stages.
Core Features & Use Cases
- Use plan-based generation: leverages the eval suite plan's scenario table, evaluation methods, quality signals, and tags to produce test inputs and configurations.
- Fallback generation: if no plan exists, creates 6-8 test cases from a plain-English agent description, including happy-path, edge, and adversarial scenarios.
- Output formats: produces a Copilot Studio test set table, a CSV for single-response import, and a docx-style report for human review.
- Lifecycle integration: supports subsequent steps in the cycle with /eval-result-interpreter and /eval-triage-and-improvement.
Quick Start
Run /eval-generator with your agent description to generate test cases.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: eval-generator Download link: https://github.com/microsoft/eval-guide/archive/main.zip#eval-generator Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.