Name: summarize-eval-results
Availability: InStock
Author: seiggy

System Documentation

What problem does it solve?

This Skill automates the tedious process of analyzing and summarizing the output from AI agent evaluation tests, making it easier to understand performance and identify issues.

Core Features & Use Cases

Automated Reporting: Parses complex JSON evaluation results into a human-readable markdown report.
Key Metrics Visualization: Presents a clear table of scenario performance across various metrics.
Failure Analysis: Details specific failures, including user prompts, tool call chains, and agent responses.
Use Case: After running a suite of agent tests, use this Skill to quickly generate a summary report that highlights which scenarios passed, failed, and why, allowing for rapid iteration on agent prompts.

Quick Start

Run the summarize eval results skill on the latest evaluation reports.

Please help me install this Skill: Name: summarize-eval-results Download link: https://github.com/seiggy/lucia-dotnet/archive/main.zip#summarize-eval-results Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

summarize-eval-results

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper