pyvene-interventions
OfficialCausal interventions for PyTorch models
Education & Research#pytorch#causal inference#llm analysis#interpretability#activation patching#causal tracing
AuthorOrchestra-Research
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill enables researchers to perform causal interventions on PyTorch models, allowing for a deeper understanding of model behavior and the identification of causal relationships within neural networks.
Core Features & Use Cases
- Causal Tracing: Pinpoint where specific factual associations are stored in a model.
- Activation Patching: Test the necessity of specific model components for observed behaviors.
- Interchange Intervention Training (IIT): Train interventions to discover causal structure.
- Model Steering: Guide model generation towards desired outputs.
- Use Case: You want to understand which specific neurons in a large language model are responsible for recalling factual information, like the capital of a country. This Skill allows you to systematically test hypotheses about these causal pathways.
Quick Start
Use the pyvene-interventions skill to perform causal tracing on a GPT-2 model by restoring activations at layer 8, position 5.
Dependency Matrix
Required Modules
pyvenetorchtransformers
Components
scriptsreferences
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: pyvene-interventions Download link: https://github.com/Orchestra-Research/AI-Research-SKILLs/archive/main.zip#pyvene-interventions Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.