zedclaw-atropos-environments
CommunityBuild agentic RL envs for Atropos training
Software Engineering#evaluation#reinforcement-learning#reward-functions#wandb logging#Atropos#tool-calling agents
AuthorZardLi1115
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill helps you build and debug ZedClaw reinforcement learning environments for Atropos so your reward signals and tool-using agent loop work correctly.
Core Features & Use Cases
- Agent-loop environment architecture: Guides you to implement only the required environment hooks while the base env handles multi-turn orchestration and tool resolution.
- Reward verification with tool context: Shows how to score rollouts using the full AgentResult (via message parsing) and optionally validate outcomes inside the sandbox with ToolContext.
- End-to-end evaluation modes: Provides the correct serve, process, and evaluate workflows to run training data generation and agentic benchmarking with tools.
Quick Start
Ask the AI to generate the minimal ZedClaw Atropos environment class that implements setup, get_next_item, format_prompt, compute_reward, evaluate, and wandb_log, then run evaluate mode with your chosen OpenAI-compatible inference configuration.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: zedclaw-atropos-environments Download link: https://github.com/ZardLi1115/zedclaw/archive/main.zip#zedclaw-atropos-environments Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.