logit-lens
OfficialDecode layer predictions across model layers
Data & Analytics#debugging#visualization#interpretability#transformer#model-interpretation#logit-lens#layer-wise
Authorndif-team
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Decodes intermediate layer activations into vocabulary predictions to reveal what the model "thinks" at each processing step, offering insight beyond the final output.
Core Features & Use Cases
- Decode per-layer predictions by applying final normalization and unembedding to hidden states to reveal evolving tokens.
- Visualize and compare layer-wise predictions to diagnose information flow and debugging tasks.
- Use for interpretability research, model auditing, and understanding recall dynamics across Transformer layers.
Quick Start
Prompt the model with a sample input and run a layer-wise trace to display the top token predictions at each layer.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: logit-lens Download link: https://github.com/ndif-team/skills/archive/main.zip#logit-lens Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.