Name: eval-faq
Availability: InStock
Author: microsoft

System Documentation

What problem does it solve?

Answers AI agent evaluation questions with practical, opinionated guidance grounded primarily in Microsoft's agent evaluation ecosystem (MS Learn, Eval Scenario Library, Triage & Improvement Playbook, Eval Guidance Kit) supplemented by select industry sources.

Core Features & Use Cases

Provides authoritative, cited guidance for eval-method selection, dataset design, non-determinism handling, tool-call evaluation, and red-teaming.
Synthesizes framework references from MS Learn and the Triage Playbook to support Stage 1 Define, Set Baseline & Iterate, Systematic Expansion, and Operationalize planning.
Use cases include planning evals, interpreting results, and triaging failures with root-cause analysis.

Quick Start

Ask a question using /eval-faq <your question> to receive actionable guidance.

Please help me install this Skill: Name: eval-faq Download link: https://github.com/microsoft/eval-guide/archive/main.zip#eval-faq Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

eval-faq

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper