eval-analyze

Official

Auto-generates eval.yaml from skills.

Authoropendatahub-io
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This skill automates the creation of eval.yaml for the agent-eval harness by analyzing a target skill's SKILL.md, sub-skills, prompts, scripts, and test cases to derive a complete evaluation configuration.

Core Features & Use Cases

  • Recursive skill analysis: reads the target SKILL.md and any sub-skills, along with prompts, scripts, and test cases, to derive a complete evaluation configuration.
  • Grounded configuration: ensures dataset schema, outputs, judges, models, and thresholds are anchored in observed files rather than placeholders.
  • Workflow automation: produces a ready-to-run eval.yaml and caches eval.md for future reference.
  • Trigger integration: can be invoked to prepare evaluation infrastructure when eval.yaml is missing.

Quick Start

Invoke /eval-analyze against a target skill to auto-generate a complete eval.yaml for evaluation.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: eval-analyze
Download link: https://github.com/opendatahub-io/agent-eval-harness/archive/main.zip#eval-analyze

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.