eval-suite-planner
OfficialPlan eval suites from agent descriptions.
Education & Research#ai-evaluation#copilot-studio#scenario-library#eval-suite#evaluation-planning#stage-define
Authormicrosoft
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Produces a complete eval-suite plan from a plain-English agent description, grounding the plan in Microsoft's Eval Scenario Library and MS Learn agent evaluation guidance.
Core Features & Use Cases
- Defines Step 0 routing to business problem and capability scenario types (Information Retrieval, Knowledge Grounding, etc.) and maps to core plan outputs.
- Generates a scenario table with core business, capability, edge-case, and variation tests, plus primary and secondary evaluation methods per scenario.
- Produces accompanying quality signals mapping, pass/fail thresholds, and a rationale that explains how the plan supports Stage 1 Define and enables Stage 2 execution.
Quick Start
Provide an agent description to generate a complete eval-suite plan.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: eval-suite-planner Download link: https://github.com/microsoft/eval-guide/archive/main.zip#eval-suite-planner Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.