eval-suite-planning

Official

Plan LLM evaluation suites for features.

AuthorAccelerated-Innovation
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Plan and orchestrate end-to-end LLM evaluation suites for feature development, coordinating DeepEval, Promptfoo, and RAGAS to ensure governance and quality.

Core Features & Use Cases

  • Determine required evaluation tools from architecture preflight (DeepEval, Promptfoo, RAGAS)
  • Generate eval criteria, dataset references, and test structure for feature evaluation
  • Produce a ready-to-run evaluation plan including artifact layout and thresholds to enforce governance

Quick Start

Provide a complete evaluation plan by reading the feature specs and architecture guidance.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: eval-suite-planning
Download link: https://github.com/Accelerated-Innovation/governed-ai-delivery/archive/main.zip#eval-suite-planning

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.