gtkb-benchmarks

Official

Measure GT-KB evidence quality fast

AuthorRemaker-Digital
Version1.0.0
Installs0

System Documentation

What problem does it solve?

GT-KB needs repeatable, read-only measurement benchmarks to quantify how well governance and evidence practices are working over time.

Core Features & Use Cases

  • Run GT-KB read-only benchmarks to compute structured observations (headline scalars plus per-dimension breakdowns) and produce both JSON and human-readable markdown summaries.
  • Assess assertion and evidence quality such as linkage survival across artifacts, recall coverage in change reasoning, attribution presence, and semantic recall of deliberations.
  • Compare benchmark runs by diffing prior results to understand changes in idempotency and benchmark values under consistent window and commit inputs.

Quick Start

Run read-only benchmarks for the default one-year window by executing: python -m scripts.benchmarks.cli run --all

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: gtkb-benchmarks
Download link: https://github.com/Remaker-Digital/groundtruth-kb/archive/main.zip#gtkb-benchmarks

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.