V-JEPA 2 Self-Supervised Training
CommunityTrain V-JEPA 2 with self-supervised video
Authorsovr610
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Enables end-to-end setup and execution of the V-JEPA 2 self-supervised training workflow, including context/predictor architectures, EMA target encoder management, loss computation, and checkpointing for reproducible experiments.
Core Features & Use Cases
- Self-supervised training orchestration: Seamlessly train V-JEPA 2 models using latent space predictions and EMA targets.
- EMA target management: Integrates an exponentially moving average target encoder to stabilize training and prevent collapse.
- Configurable DROID support: Supports DROID-style fine-tuning with autoregressive rollout and optional normalization of representations.
- Checkpointing & resumption: Handles save/load of all components (encoder, predictor, EMA target, optimizer, scaler) for reliable experiments.
- Use Case: Prototype a JEPA training loop on synthetic data, then scale to real video datasets with progressively larger setups.
Quick Start
Run a minimal training loop with a tiny encoder and predictor to validate the JEPA workflow.
Dependency Matrix
Required Modules
torch
Components
scriptsreferencesassets
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: V-JEPA 2 Self-Supervised Training Download link: https://github.com/sovr610/refffiy/archive/main.zip#v-jepa-2-self-supervised-training Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.