verl
CommunityVeRL setup and training guide.
Authorzzy1127
Version1.0.0
Installs0
System Documentation
What problem does it solve?
VeRL provides a structured approach to RL experiments by enforcing strict data formatting and ready-to-use templates, simplifying installation, data preparation, and training workflows.
Core Features & Use Cases
- Supports installation guidance, Parquet-based data preparation, and ready-to-use SFT and GRPO templates.
- Enables end-to-end RL training pipelines with reproducible configurations and data templates.
- Use cases include setting up SFT on GSM8K-style data and GRPO-based policy optimization tasks.
Quick Start
Follow the installation steps and begin with the Parquet data templates to start an SFT or GRPO training run.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: verl Download link: https://github.com/zzy1127/PostTrainAgent/archive/main.zip#verl Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.