output-dev-eval-testing

Official

Offline evaluation tests for Output SDK workflows.

Authorgrowthxai
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Offline evaluation testing is required to validate AI workflows without impacting live production, enabling repeatable quality checks and regression detection for Output SDK pipelines.

Core Features & Use Cases

  • Build, run, and manage offline eval tests for Output SDK workflows using @outputai/evals.
  • Create dataset YAMLs, evaluators with verify(), and eval workflows to automate quality gates and reproducibility.
  • Use for regression testing, subjective quality assessment with LLM judges, and CLI-driven validation across multiple scenarios.

Quick Start

Initialize an eval workflow scaffold and add datasets and evaluators to begin offline testing.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: output-dev-eval-testing
Download link: https://github.com/growthxai/output/archive/main.zip#output-dev-eval-testing

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.