nemo-gym-reward-profiling
CommunityProfile Nemo Gym rewards across repeats
Data & Analytics#token usage#nemo-gym#reward profiling#jsonl artifacts#rollout analysis#repeated rollouts#task identity
Authoryo-steven
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Reward profiling in Nemo Gym can be hard to run correctly and interpret when repeats, materialized inputs, and per-rollout results must be joined by task and rollout identity.
Core Features & Use Cases
- Orchestrates the standard profiling flow: use ng_run to start servers, ng_collect_rollouts to produce rollouts.jsonl and materialized inputs, and ng_reward_profile to generate reward profiling outputs.
- Explains how identities and artifacts connect: clarifies how _ng_task_index and _ng_rollout_index link *_materialized_inputs.jsonl, rollouts.jsonl, and *_reward_profiling.jsonl.
- Supports practical profiling scenarios: repeated rollouts for per-task averages and variance, strict vs partial profiling (including allow_partial_rollouts), and inspecting rollout_infos for reward and token metrics.
Quick Start
Run ng_run to start the Nemo Gym services, then ng_collect_rollouts to write rollouts.jsonl and *_materialized_inputs.jsonl, and finally ng_reward_profile to produce *_reward_profiling.jsonl from the materialized inputs and rollout artifacts.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: nemo-gym-reward-profiling Download link: https://github.com/yo-steven/skills-exploration-20260522/archive/main.zip#nemo-gym-reward-profiling Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.