klayoutclaw:e2e-judge
CommunityEnd-to-end agentic tests with automated judging.
Authorcaidish
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Orchestrates automated end-to-end testing of KlayoutClaw by running agentic tasks, performing independent MCP verifications, and judging outcomes with an LLM.
Core Features & Use Cases
- Orchestrates a full E2E test pipeline: task generation, autonomous tool usage, layout verification, and judge verdicts.
- Supports multiple test phases (preflight, layout, geometry, evaluate, hallbar, pipeline, discovery) and Phase 5 autonomous pipelines.
- Provides an auditable transcript, verification results, and structured verdicts for performance benchmarking.
Quick Start
Launch the agentic E2E judge against a live KlayoutClaw MCP server and review the structured verdicts produced by the judge.
Dependency Matrix
Required Modules
None requiredComponents
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: klayoutclaw:e2e-judge Download link: https://github.com/caidish/KlayoutClaw/archive/main.zip#klayoutclaw-e2e-judge Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.