gtkb-benchmarks
OfficialMeasure GT-KB evidence quality fast
Data & Analytics#recall#benchmarking#governance#assertion quality#evidence quality#run comparison#gt cli
AuthorRemaker-Digital
Version1.0.0
Installs0
System Documentation
What problem does it solve?
GT-KB needs repeatable, read-only measurement benchmarks to quantify how well governance and evidence practices are working over time.
Core Features & Use Cases
- Run GT-KB read-only benchmarks to compute structured observations (headline scalars plus per-dimension breakdowns) and produce both JSON and human-readable markdown summaries.
- Assess assertion and evidence quality such as linkage survival across artifacts, recall coverage in change reasoning, attribution presence, and semantic recall of deliberations.
- Compare benchmark runs by diffing prior results to understand changes in idempotency and benchmark values under consistent window and commit inputs.
Quick Start
Run read-only benchmarks for the default one-year window by executing: python -m scripts.benchmarks.cli run --all
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: gtkb-benchmarks Download link: https://github.com/Remaker-Digital/groundtruth-kb/archive/main.zip#gtkb-benchmarks Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.