klayoutclaw:e2e-judge

Community

End-to-end agentic tests with automated judging.

Authorcaidish
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Orchestrates automated end-to-end testing of KlayoutClaw by running agentic tasks, performing independent MCP verifications, and judging outcomes with an LLM.

Core Features & Use Cases

  • Orchestrates a full E2E test pipeline: task generation, autonomous tool usage, layout verification, and judge verdicts.
  • Supports multiple test phases (preflight, layout, geometry, evaluate, hallbar, pipeline, discovery) and Phase 5 autonomous pipelines.
  • Provides an auditable transcript, verification results, and structured verdicts for performance benchmarking.

Quick Start

Launch the agentic E2E judge against a live KlayoutClaw MCP server and review the structured verdicts produced by the judge.

Dependency Matrix

Required Modules

None required

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: klayoutclaw:e2e-judge
Download link: https://github.com/caidish/KlayoutClaw/archive/main.zip#klayoutclaw-e2e-judge

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.