agent-sop-eval
CommunityEvaluate agent SOPs with structured feedback.
Authorjapurcell
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Evaluate and improve AI agent SOPs by enabling structured, end-to-end evaluations that expose gaps in planning, reasoning, and execution, followed by evidence-based improvements.
Core Features & Use Cases
- End-to-end evaluation planning and execution for AI agents
- Test data generation, execution, and results analysis using Strands Evals SDK
- Actionable feedback and improvement recommendations based on results
Quick Start
Provide an agent path and ask to evaluate its SOPs for a specific task.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: agent-sop-eval Download link: https://github.com/japurcell/skills/archive/main.zip#agent-sop-eval Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.