eval-metrics

Community

Rank-based metrics for ordinal grading.

AuthorAKCqhzdy
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Automates the computation of evaluation metrics for ordinal predictions (e.g., Spearman ρ, Kendall τ), providing exact and adjacent match percentages, mean scores per level, and a confusion matrix. Outputs: evaluation/{year}/metrics.json. Pure statistical computation — no LLM calls.

Core Features & Use Cases

  • Compute rank-based correlations for ordinal grading data.
  • Generate per-level average predicted scores and a confusion matrix for error analysis.
  • Support optional level divisions and resume-safe operation across grading years.

Quick Start

Run the evaluation workflow to generate metrics from ground truth and final scores for a specific subject/year.

Dependency Matrix

Required Modules

None required

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: eval-metrics
Download link: https://github.com/AKCqhzdy/dse-subject-grading/archive/main.zip#eval-metrics

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.