multiturn-rl

Official

Orchestrate multi-turn RL training with Tinker.

Authorthinking-machines-lab
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Multi-turn RL training for interactive environments using the Tinker API, enabling agents to reason across turns, manage tool use, and optimize policies in dynamic tasks.

Core Features & Use Cases

  • Orchestrates Harbor terminal RL, Search-RAG, and multiplayer RL pipelines for end-to-end training
  • Provides guidance on environment types, turn structures, and configuration patterns (HarborTask, HarborDatasetBuilder, AsyncConfig, Config)
  • Includes steps to run, test, and extend multi-turn RL experiments across custom environments

Quick Start

Run the multi-turn RL training workflow by executing the Harbor RL, Search-R1, or multiplayer RL trainer script.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: multiturn-rl
Download link: https://github.com/thinking-machines-lab/tinker-cookbook/archive/main.zip#multiturn-rl

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.