experiment-run

Community

Make experiments reproducible and auditable.

Authorgyy0592
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Ensures every computational experiment is launched, recorded, and validated so results are reproducible, discoverable, and auditable months later. It prevents lost metrics, hidden hyperparameters, and irreproducible outputs by enforcing config-driven runs, structured output directories, and immediate recording.

Core Features & Use Cases

  • Config-first runs: All runtime parameters live in a single config file so scripts have no hardcoded magic values.
  • Recording negotiation and formats: Interactive negotiation of scalar, intermediate, and artifact fields with exact scalars.csv and events.jsonl schemas.
  • Robust run lifecycle: Pre-flight confirmation, code snapshot, run_manifest, Slurm/local job generation, incremental recording with flushes, and post-run validation producing run_checks.json.
  • Use cases: Training jobs on GPU clusters with Slurm, local long-running data processing, repeated evaluation experiments with per-sample logging, and any workflow where checkpoints and metrics must be preserved.

Quick Start

Prepare a config, negotiate what metrics and per-item fields to record, confirm the pre-flight summary, then create the exp/ directory, snapshot the code, and generate and submit the job.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: experiment-run
Download link: https://github.com/gyy0592/claude-config/archive/main.zip#experiment-run

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.