failure-taxonomy

Official

Structure LLM failure modes.

Authormaragudk
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill transforms unstructured, freeform annotations from LLM trace reviews into a structured, actionable taxonomy of failure modes, enabling systematic error analysis and improvement.

Core Features & Use Cases

  • Axial Coding: Groups open-coded annotations into coherent, non-overlapping binary failure categories.
  • Taxonomy Building: Defines clear titles, definitions, and examples for each failure mode.
  • Re-labeling & Quantification: Applies the taxonomy to traces and calculates error rates for prioritization.
  • Use Case: After reviewing 50 user interactions with a chatbot, you have raw notes like "bot misunderstood intent" or "gave irrelevant info". This Skill helps you cluster these into categories like "Intent Misinterpretation" or "Off-Topic Response" and quantify how often each occurs.

Quick Start

Use the failure-taxonomy skill to build a taxonomy from the provided annotations.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: failure-taxonomy
Download link: https://github.com/maragudk/evals-skills/archive/main.zip#failure-taxonomy

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.