experiment-analyst

Community

Diagnose why agent experiments succeed or fail.

Authordanicat
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill helps you understand why AI agents succeed or fail in specific experiment runs by turning raw run logs into evidence-backed success determinants and behavioral patterns.

Core Features & Use Cases

  • Evidence-based experiment deconstruction: Extracts performance overview, tool usage breakdown, and top failure signals from a Tenkai experiments SQLite database.
  • Success determinants via tool correlation: Identifies tools that correlate with higher success or failure rates by comparing tool usage across successful vs failed runs.
  • Targeted behavioral deep dives: Reconstructs run workflows (message/tool steps) to compare a winning pattern against a failure loop for specific alternatives.

Quick Start

Run the analysis for experiment ID 12 by executing: python3 agents/tenkai/.gemini/skills/experiment-analyst/scripts/analyze_experiment.py 12

Dependency Matrix

Required Modules

None required

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: experiment-analyst
Download link: https://github.com/danicat/skills/archive/main.zip#experiment-analyst

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.