chaos-labs-eval-creation

Community

Create robust evals for Chaos wallet agents.

Authoreugene-belkovich
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Chaos Labs eval-creation skill automates the generation of structured evaluation test cases for Chaos AI wallet agents, enabling consistent, repeatable testing of personas, objectives, and expected outcomes.

Core Features & Use Cases

  • Eval Config Structure: defines user persona, agent flow, and explicit expectations for evaluation.
  • Turn-based evaluation coaching: supports sequences, timeouts, and rubric-based llm_eval criteria.
  • Examples and templates: provides full YAML examples to bootstrap new eval suites.
  • Use Case: QA teams can rapidly generate comprehensive eval scenarios for security, usability, and compliance checks.

Quick Start

Create a Chaos Labs evaluation config for a wallet agent with a defined persona, objective, and evaluation criteria.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: chaos-labs-eval-creation
Download link: https://github.com/eugene-belkovich/ai-setup/archive/main.zip#chaos-labs-eval-creation

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.