test-evaluation

Community

Diagnose flaky and messy test suites.

Authorjcdendrite
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill helps you evaluate, debug, and improve existing test suites that have become unreliable, slow, or insufficient at catching regressions.

Core Features & Use Cases

  • Test pyramid assessment: Identifies when tests are stuck in integration/E2E layers and provides guidance for moving them down toward faster, more deterministic unit tests.
  • Flaky test diagnosis: Helps pinpoint root causes such as isolation failures, external service timing issues, locale/timezone sensitivity, async synchronization mistakes, and nondeterministic ordering.
  • Actionable refactoring guidance: Recommends safer remediation tactics like layer-correcting tests, quarantining when needed, and avoiding destructive changes without replacement coverage.

Use Case: Your CI intermittently fails while local runs pass; use the Skill to determine which tests are likely flaky for systemic reasons (e.g., network timeouts, shared state, or nondeterministic data), then plan the most appropriate fix at the correct test layer.

Quick Start

Ask the AI to evaluate and debug your existing test suite by identifying weak coverage areas and diagnosing the most likely causes of flakiness in the failing tests.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: test-evaluation
Download link: https://github.com/jcdendrite/claude-config/archive/main.zip#test-evaluation

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.