ml-intern
CommunityAutonomous ML post-training and ablations.
AuthorAlexiosBluffMara
Version1.0.0
Installs0
System Documentation
What problem does it solve?
ml-intern is an open-source AI agent that autonomously researches, writes, and ships ML related code using the HuggingFace ecosystem to conduct post-training experiments, literature review, dataset discovery, training scripts, and iterative evaluation. It enables automated Cortex Gemma-4-e4b fine-tuning, ablation studies, and hyperparameter sweeps without writing a bespoke harness.
Core Features & Use Cases
- Single-agent loop with a max 300 iterations, ToolRouter covering HF docs/papers/datasets/repos and GitHub search, sandbox execution, MCP server tools
- ContextManager with auto-compaction at 170k tokens
- Doom-loop detector to inject corrective prompts on repeated patterns
- Provider-agnostic backends via litellm
- Outputs: final session uploaded to HF Hub with trace, trained artifacts, and model cards
- Use cases: Cortex fine-tunes, overnight ablations, literature synthesis, and dataset discovery
Quick Start
Run ml-intern to start an autonomous post-training experiment workflow.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: ml-intern Download link: https://github.com/AlexiosBluffMara/mercury/archive/main.zip#ml-intern Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.