Name: eval-designer
Availability: InStock
Author: xcrrr

System Documentation

What problem does it solve?

Eval Designer enables teams to design and implement robust evaluation frameworks for LLM systems to measure quality, safety, accuracy, and alignment across prompts, models, and deployments.

Core Features & Use Cases

Define evaluation goals and scope for end-to-end LLM evaluation.
Build test suites, rubrics, and automated evaluation pipelines; support human calibration and versioned runs.
Apply to CI/CD pipelines for model or prompt changes, safety audits, and regression testing.

Quick Start

Define an evaluation brief for a new LLM feature and generate an accompanying rubric and automated test plan.

Please help me install this Skill: Name: eval-designer Download link: https://github.com/xcrrr/claude-skills/archive/main.zip#eval-designer Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

eval-designer

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper