skill-eval-writer

Community

Auto-generate evals and grade scripts for any Skill.

Authorcr330326
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This tool solves the challenge of creating standardized evaluation tooling for any Skill by turning a Skill's SKILL.md and its references into a complete evaluation package. It outputs evals.json and a grade.py script, along with run guidelines, enabling automated validation of instruction quality across Skills.

Core Features & Use Cases

  • Generates a complete eval suite (evals.json) with five well-scoped eval cases and corresponding assertions.
  • Produces grade.py that can validate Skill outputs by collecting either textual responses or generated files.
  • Validates eval definitions and provides a consistent JSON schema for scoring.
  • Supports optional directories (scripts/, references/, assets/) as resources to enrich evaluations.

Quick Start

Run the Skill Eval Writer on a Skill directory to generate evals.json, grade.py, and a grading guide.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: skill-eval-writer
Download link: https://github.com/cr330326/AgentSkill/archive/main.zip#skill-eval-writer

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.