Pilot result evaluation
CommunityDecide pass or fail and update wiki
Education & Research#success criteria#research automation#experiment results#wiki updates#pilot evaluation#status lifecycle#metric comparison
Authorduany049
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This skill prevents ambiguous pilot outcomes from lingering by turning noisy pilot runs into a clear verdict and consistently updating the corresponding idea page.
Core Features & Use Cases
- Verdict evaluation: Reads pilot results and log, then applies success-criterion logic to classify outcomes as pass, fail, or inconclusive.
- Wiki state update: Writes back to the idea page fields including pilot_result, failure_reason (when failed), and status transitions to failed when required.
- Persistent reporting: Generates and saves a PILOT_VERDICT_REPORT summarizing metrics, log signals, and next-step guidance.
Quick Start
Run the pilot evaluation for a specific idea slug so the system updates wiki/ideas/{slug}.md and produces a PILOT_VERDICT_REPORT from the pilot artifacts.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: Pilot result evaluation Download link: https://github.com/duany049/skill-runtime-evolution/archive/main.zip#pilot-result-evaluation Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.