experiment-analyze
CommunityAudit experiments and propose corrective triggers
Education & Research#reproducibility#benchmarks#experiment-analysis#failure-modes#statistical-audit#backward-triggers
AuthorZhangHanbo
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill automates post-experiment analysis by reading experiment outputs, running a statistical audit, comparing observed outcomes against pre-registered hypotheses and benchmarks, and synthesizing a finding that surfaces reproducibility and failure-mode issues for human review.
Core Features & Use Cases
- Statistical Audit: Runs scripts/audit_stats.py to compute trials per condition, confidence intervals, effect sizes, variance, and flags statistical insufficiency or missing baselines.
- Mode-Aware Analysis: Supports reproduction, diagnostic, and approach modes to compute per-condition verdicts, reproducibility = {pass|partial|fail}, hypothesis status, and mechanism checks (ablations vs priors).
- Governed Action & Provenance: Appends experiment_analysis and finding records, proposes backward triggers (t4/t5/t7/t8/t14/t15) but requires human confirmation before any state transition.
- Use Case: After an experiment finishes, run this Skill to validate statistical rigor, detect new failure modes, compare to benchmarks, and produce a concise, reviewable finding with recommended next steps.
Quick Start
Run the experiment-analyze skill on the experiment directory to audit statistics, compare outcomes to hypotheses and benchmarks, and produce a finding with proposed backward triggers for human review.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: experiment-analyze Download link: https://github.com/ZhangHanbo/alpha_research/archive/main.zip#experiment-analyze Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.