dw-skill-eval-build

Community

Co-create test sets from skill analysis.

Authorxurik
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill orchestrates the end-to-end creation of a structured test set for a skill based on prior analysis results, enabling repeatable evaluation workflows.

Core Features & Use Cases

  • Loads analysis results (skill-intent, api_references, and coverage data) from eval/evaluations/{skill-name}/skill-analysis.yaml.
  • Generates Happy Path, Edge Case, Adversarial, and Pressure test cases to form a complete testset.
  • Writes the final test set to eval/evaluations/{skill-name}/testset.yaml and facilitates validation and refinement.

Quick Start

在 Claude Code 中执行 /dw-skill-eval-build 以为已分析的技能生成测试集。

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: dw-skill-eval-build
Download link: https://github.com/xurik/dataworks-skill-evaluator/archive/main.zip#dw-skill-eval-build

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.