eval-extract

Community

Blind evaluation of handoff extraction prompts.

Authorbrianruggieri
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Blind evaluation of handoff extraction prompts using blinded subagents to eliminate author bias and ensure consistent scoring across fixtures.

Core Features & Use Cases

  • Loads the grading rubric and test fixtures, runs extractor and grader agents per fixture, and outputs per-fixture extractions and scorecards.
  • Saves results to handoff/tests/output/eval and provides an automated summary across fixtures.
  • Supports iterative refinement by exposing evaluation workflow that can be re-run with fresh agents.

Quick Start

Run the blind evaluation workflow to execute extraction prompts against fixtures and generate scorecards.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: eval-extract
Download link: https://github.com/brianruggieri/skills/archive/main.zip#eval-extract

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.