multiturn-rl
OfficialOrchestrate multi-turn RL training with Tinker.
Authorthinking-machines-lab
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Multi-turn RL training for interactive environments using the Tinker API, enabling agents to reason across turns, manage tool use, and optimize policies in dynamic tasks.
Core Features & Use Cases
- Orchestrates Harbor terminal RL, Search-RAG, and multiplayer RL pipelines for end-to-end training
- Provides guidance on environment types, turn structures, and configuration patterns (HarborTask, HarborDatasetBuilder, AsyncConfig, Config)
- Includes steps to run, test, and extend multi-turn RL experiments across custom environments
Quick Start
Run the multi-turn RL training workflow by executing the Harbor RL, Search-R1, or multiplayer RL trainer script.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: multiturn-rl Download link: https://github.com/thinking-machines-lab/tinker-cookbook/archive/main.zip#multiturn-rl Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.