pyvene-interventions

Official

Causal interventions for PyTorch models

AuthorOrchestra-Research
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill enables researchers to perform causal interventions on PyTorch models, allowing for a deeper understanding of model behavior and the identification of causal relationships within neural networks.

Core Features & Use Cases

  • Causal Tracing: Pinpoint where specific factual associations are stored in a model.
  • Activation Patching: Test the necessity of specific model components for observed behaviors.
  • Interchange Intervention Training (IIT): Train interventions to discover causal structure.
  • Model Steering: Guide model generation towards desired outputs.
  • Use Case: You want to understand which specific neurons in a large language model are responsible for recalling factual information, like the capital of a country. This Skill allows you to systematically test hypotheses about these causal pathways.

Quick Start

Use the pyvene-interventions skill to perform causal tracing on a GPT-2 model by restoring activations at layer 8, position 5.

Dependency Matrix

Required Modules

pyvenetorchtransformers

Components

scriptsreferences

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: pyvene-interventions
Download link: https://github.com/Orchestra-Research/AI-Research-SKILLs/archive/main.zip#pyvene-interventions

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.