Name: auto-eval
Availability: InStock
Author: easingthemes

System Documentation

What problem does it solve?

Automates offline evaluation of AI automation agents by running tests against pre-captured fixtures, enabling quality verification without invoking ADO or LLM APIs.

Core Features & Use Cases

Offline evaluation against pre-captured fixtures to ensure prompts and rule changes do not regress agent behavior.
Supports multiple agents (dor, pr-review, pr-answer) and an optional tier-2 mode for real LLM checks.
Provides end-to-end QA workflow guidance from argument parsing to result interpretation.

Quick Start

Run the evaluation framework against pre-captured fixtures to verify agent quality offline.

Please help me install this Skill: Name: auto-eval Download link: https://github.com/easingthemes/dx-aem-flow/archive/main.zip#auto-eval Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

auto-eval

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper