checkpoint-review
CommunityThorough A/B diff evaluation for robust software.
Software Engineering#evaluation#workflows#code-review#model-selection#diff-analysis#ab-test#execution-evidence
AuthorMinhOmega
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill analyzes Model A/B diffs and generates a comprehensive, multi-section evaluation to determine which model better satisfies the given acceptance criteria. It collates evidence from diffs, execution traces, and prior turn evaluations to produce a final turn evaluation.
Core Features & Use Cases
- Automated, multi-axis evaluation of PR-like diffs across A and B implementations.
- Context-aware: consumes prior turn evaluations and execution evidence to avoid sign-off on isolated changes.
- Produces a complete workspace/turn_{N}/turn_{N}_evaluation.md with validation gate before write.
Quick Start
Place your workspace turn N diffs and evidence under turn_<N>, then invoke the evaluation workflow to generate turn_<N>/turn_<N>_evaluation.md.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: checkpoint-review Download link: https://github.com/MinhOmega/marlin-skill/archive/main.zip#checkpoint-review Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.