Pilot result evaluation

Community

Decide pass or fail and update wiki

Authorduany049
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This skill prevents ambiguous pilot outcomes from lingering by turning noisy pilot runs into a clear verdict and consistently updating the corresponding idea page.

Core Features & Use Cases

  • Verdict evaluation: Reads pilot results and log, then applies success-criterion logic to classify outcomes as pass, fail, or inconclusive.
  • Wiki state update: Writes back to the idea page fields including pilot_result, failure_reason (when failed), and status transitions to failed when required.
  • Persistent reporting: Generates and saves a PILOT_VERDICT_REPORT summarizing metrics, log signals, and next-step guidance.

Quick Start

Run the pilot evaluation for a specific idea slug so the system updates wiki/ideas/{slug}.md and produces a PILOT_VERDICT_REPORT from the pilot artifacts.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: Pilot result evaluation
Download link: https://github.com/duany049/skill-runtime-evolution/archive/main.zip#pilot-result-evaluation

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.