operator-eval
CommunityRun operator-only eval loops locally, safely.
AuthorJetXu-LLM
Version1.0.0
Installs0
System Documentation
What problem does it solve?
The hidden operator-only evaluation loop over local runtime/eval artifacts can be difficult to run safely and reproducibly within standard user workflows.
Core Features & Use Cases
- Automates local evaluation tasks by reading, validating, and executing a single operator request from the local surface.
- Supports regression review, replayable evaluation runs, candidate promotion, and baseline freezing while preserving provenance.
- Ensures isolation by keeping evaluation artifacts under runtime/eval and separating runtime activity logs from normal user data.
Quick Start
Ask the agent to run the operator-eval workflow for the current evaluation request and return results to the main agent.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: operator-eval Download link: https://github.com/JetXu-LLM/DocMason/archive/main.zip#operator-eval Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.