test-evaluation
CommunityDiagnose flaky and messy test suites.
System Documentation
What problem does it solve?
This Skill helps you evaluate, debug, and improve existing test suites that have become unreliable, slow, or insufficient at catching regressions.
Core Features & Use Cases
- Test pyramid assessment: Identifies when tests are stuck in integration/E2E layers and provides guidance for moving them down toward faster, more deterministic unit tests.
- Flaky test diagnosis: Helps pinpoint root causes such as isolation failures, external service timing issues, locale/timezone sensitivity, async synchronization mistakes, and nondeterministic ordering.
- Actionable refactoring guidance: Recommends safer remediation tactics like layer-correcting tests, quarantining when needed, and avoiding destructive changes without replacement coverage.
Use Case: Your CI intermittently fails while local runs pass; use the Skill to determine which tests are likely flaky for systemic reasons (e.g., network timeouts, shared state, or nondeterministic data), then plan the most appropriate fix at the correct test layer.
Quick Start
Ask the AI to evaluate and debug your existing test suite by identifying weak coverage areas and diagnosing the most likely causes of flakiness in the failing tests.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: test-evaluation Download link: https://github.com/jcdendrite/claude-config/archive/main.zip#test-evaluation Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.