validation-artifacts
CommunityMake validation claims provable and reproducible.
Software Engineering#validation#api testing#visual regression#reproducibility#test evidence#ml metrics#holdout evaluation
Authorthistleknot
Version1.0.0
Installs0
System Documentation
What problem does it solve?
It prevents teams from claiming “validation passed” without evidence by forcing every success claim to include reproducible, inspectable artifacts across training, inference, tests, UI, and APIs.
Core Features & Use Cases
- Mandatory proof over assertions: Collects the training logs, metrics, deterministic test evidence, visual diffs, benchmarks, and script outputs required to substantiate “passed” claims.
- Domain-specific artifact requirements: Defines tailored artifact sets for ML training (loss curves, holdout metrics), inference (predictions, confusion matrices, latency), deterministic validation (logs, reproducibility, coverage), visual validation (before/after/diff), and API/script validation (request/response samples, status codes, error cases, performance).
- Validation gating and red-flag detection: Provides clear “do not claim success” gates and common failure patterns like missing holdout evaluation, leakage suspicion, flaky tests, or unverifiable UI/API behavior.
- UI autonomous validation pattern: Supports a hack → headless screenshot → artifact commit → revert workflow to produce auditable unit-style proofs for UI modules.
Quick Start
Use the validation-artifacts skill to enforce that every “it works” claim is accompanied by the required holdout metrics, test logs, and reproducible artifacts under validation_artifacts/.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: validation-artifacts Download link: https://github.com/thistleknot/skills/archive/main.zip#validation-artifacts Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.