Name: multiturn-rl
Availability: InStock
Author: thinking-machines-lab

System Documentation

What problem does it solve?

Multi-turn RL training for interactive environments using the Tinker API, enabling agents to reason across turns, manage tool use, and optimize policies in dynamic tasks.

Core Features & Use Cases

Orchestrates Harbor terminal RL, Search-RAG, and multiplayer RL pipelines for end-to-end training
Provides guidance on environment types, turn structures, and configuration patterns (HarborTask, HarborDatasetBuilder, AsyncConfig, Config)
Includes steps to run, test, and extend multi-turn RL experiments across custom environments

Quick Start

Run the multi-turn RL training workflow by executing the Harbor RL, Search-R1, or multiplayer RL trainer script.

Please help me install this Skill: Name: multiturn-rl Download link: https://github.com/thinking-machines-lab/tinker-cookbook/archive/main.zip#multiturn-rl Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

multiturn-rl

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper