eval-extract
CommunityBlind evaluation of handoff extraction prompts.
Authorbrianruggieri
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Blind evaluation of handoff extraction prompts using blinded subagents to eliminate author bias and ensure consistent scoring across fixtures.
Core Features & Use Cases
- Loads the grading rubric and test fixtures, runs extractor and grader agents per fixture, and outputs per-fixture extractions and scorecards.
- Saves results to handoff/tests/output/eval and provides an automated summary across fixtures.
- Supports iterative refinement by exposing evaluation workflow that can be re-run with fresh agents.
Quick Start
Run the blind evaluation workflow to execute extraction prompts against fixtures and generate scorecards.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: eval-extract Download link: https://github.com/brianruggieri/skills/archive/main.zip#eval-extract Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.