generalization-theory
CommunityDiagnose memorization vs signal learning fast.
Education & Research#generalization#grokking#memorization#training dynamics#eNTK#double descent#SNR preconditioning
Authorthistleknot
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Generalization-theory helps you determine why a model’s training loss is improving while test performance stalls or worsens, distinguishing memorization-driven behavior from genuine signal learning.
Core Features & Use Cases
- Signal/noise diagnosis (eNTK partitioning): classifies training dynamics into coherent “signal” directions versus trapped “noise” residuals to explain overfitting, grokking, and double descent.
- Unified interpretation of training phenomena: maps benign overfitting, implicit bias, and grokking to the same empirical partition geometry instead of treating them as unrelated quirks.
- Intervention selection ladder: recommends the lightest effective change among data, architecture, and optimizer surfaces, including an SNR-style preconditioning strategy.
Quick Start
Use generalization-theory to analyze your run checkpoints and telemetry to decide whether to intervene through data cleanup, architectural bias, or an SNR-aware optimizer adjustment.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: generalization-theory Download link: https://github.com/thistleknot/skills/archive/main.zip#generalization-theory Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.