nemo-gym-reward-profiling

Community

Profile Nemo Gym rewards across repeats

Authoryo-steven
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Reward profiling in Nemo Gym can be hard to run correctly and interpret when repeats, materialized inputs, and per-rollout results must be joined by task and rollout identity.

Core Features & Use Cases

  • Orchestrates the standard profiling flow: use ng_run to start servers, ng_collect_rollouts to produce rollouts.jsonl and materialized inputs, and ng_reward_profile to generate reward profiling outputs.
  • Explains how identities and artifacts connect: clarifies how _ng_task_index and _ng_rollout_index link *_materialized_inputs.jsonl, rollouts.jsonl, and *_reward_profiling.jsonl.
  • Supports practical profiling scenarios: repeated rollouts for per-task averages and variance, strict vs partial profiling (including allow_partial_rollouts), and inspecting rollout_infos for reward and token metrics.

Quick Start

Run ng_run to start the Nemo Gym services, then ng_collect_rollouts to write rollouts.jsonl and *_materialized_inputs.jsonl, and finally ng_reward_profile to produce *_reward_profiling.jsonl from the materialized inputs and rollout artifacts.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: nemo-gym-reward-profiling
Download link: https://github.com/yo-steven/skills-exploration-20260522/archive/main.zip#nemo-gym-reward-profiling

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.