logit-lens

Official

Decode layer predictions across model layers

Authorndif-team
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Decodes intermediate layer activations into vocabulary predictions to reveal what the model "thinks" at each processing step, offering insight beyond the final output.

Core Features & Use Cases

  • Decode per-layer predictions by applying final normalization and unembedding to hidden states to reveal evolving tokens.
  • Visualize and compare layer-wise predictions to diagnose information flow and debugging tasks.
  • Use for interpretability research, model auditing, and understanding recall dynamics across Transformer layers.

Quick Start

Prompt the model with a sample input and run a layer-wise trace to display the top token predictions at each layer.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: logit-lens
Download link: https://github.com/ndif-team/skills/archive/main.zip#logit-lens

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.