update-golden-values

Community

Refresh goldens with diff scoring + PR blurb

Authoryo-steven
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill refreshes Megatron-Core golden-value artifacts from a specified GitHub Actions workflow run and produces a reviewer-friendly summary of how much the new goldens differ from the previous ones.

Core Features & Use Cases

  • Golden refresh from CI artifacts: Downloads golden-value JSON artifacts generated by a workflow run and overwrites the local tests/functional_tests/test_cases/**/golden_values_*.json files.
  • Failing-only or full refresh: Supports only-failing mode to pull artifacts from failing/cancelled jobs (or all mode to pull from every job that produced goldens).
  • Per-metric relative-difference scoring: Computes a per-(file, metric) signed avg_rel_diff = mean((old − new) / old) over shared steps and aggregates it into a PR-ready interpretation.
  • PR-ready summary generation: Produces a structured blurb including headline per-metric statistics and a distribution table for |avg_rel_diff|.

Quick Start

Provide a GitHub Actions workflow run ID and choose scope only-failing or all, then run the skill to download goldens, compute relative-diff scoring, and generate a PR-ready summary for the resulting golden-value changes.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: update-golden-values
Download link: https://github.com/yo-steven/skills-exploration-20260522/archive/main.zip#update-golden-values

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.