tabpfn-explore

Community

Start Kaggle comps with robust EDA & CV

Authordianaprior
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Enables rapid, systematic exploration and validation of tabular competition datasets so you can detect leakage, distribution shift, and prepare consistent cross-validation splits before any modeling or API calls.

Core Features & Use Cases

  • Competition reconnaissance: document evaluation metric, task type, dataset size, and known pitfalls to guide modeling choices.
  • Exploratory Data Analysis: profile missingness, class balance, high-cardinality categoricals, duplicates, and high correlations that affect model design.
  • Adversarial validation: run train-vs-test classifiers to quantify distribution shift and surface the features driving it.
  • CV scheme and budget checks: define and save reproducible folds (StratifiedKFold, GroupKFold, TimeSeriesSplit) and verify TabPFN API cell budget constraints.
  • Deliverables: cleaned X_train/X_test/y_train DataFrames, a reusable folds object, notes/competition_overview.md, and an issues checklist for leakage, imbalance, and high-cardinality features.

Quick Start

Run tabpfn-explore on your train and test CSVs to generate cleaned DataFrames, a saved folds object, adversarial validation diagnostics, and an API cell budget check.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: tabpfn-explore
Download link: https://github.com/dianaprior/kaggle-competition-agent-skill/archive/main.zip#tabpfn-explore

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.