experiment-run
CommunityMake experiments reproducible and auditable.
Authorgyy0592
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Ensures every computational experiment is launched, recorded, and validated so results are reproducible, discoverable, and auditable months later. It prevents lost metrics, hidden hyperparameters, and irreproducible outputs by enforcing config-driven runs, structured output directories, and immediate recording.
Core Features & Use Cases
- Config-first runs: All runtime parameters live in a single config file so scripts have no hardcoded magic values.
- Recording negotiation and formats: Interactive negotiation of scalar, intermediate, and artifact fields with exact scalars.csv and events.jsonl schemas.
- Robust run lifecycle: Pre-flight confirmation, code snapshot, run_manifest, Slurm/local job generation, incremental recording with flushes, and post-run validation producing run_checks.json.
- Use cases: Training jobs on GPU clusters with Slurm, local long-running data processing, repeated evaluation experiments with per-sample logging, and any workflow where checkpoints and metrics must be preserved.
Quick Start
Prepare a config, negotiate what metrics and per-item fields to record, confirm the pre-flight summary, then create the exp/ directory, snapshot the code, and generate and submit the job.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: experiment-run Download link: https://github.com/gyy0592/claude-config/archive/main.zip#experiment-run Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.