tabpfn-explore
CommunityStart Kaggle comps with robust EDA & CV
Data & Analytics#cross-validation#eda#data-profiling#kaggle#adversarial-validation#tabpfn#api-budget
Authordianaprior
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Enables rapid, systematic exploration and validation of tabular competition datasets so you can detect leakage, distribution shift, and prepare consistent cross-validation splits before any modeling or API calls.
Core Features & Use Cases
- Competition reconnaissance: document evaluation metric, task type, dataset size, and known pitfalls to guide modeling choices.
- Exploratory Data Analysis: profile missingness, class balance, high-cardinality categoricals, duplicates, and high correlations that affect model design.
- Adversarial validation: run train-vs-test classifiers to quantify distribution shift and surface the features driving it.
- CV scheme and budget checks: define and save reproducible folds (StratifiedKFold, GroupKFold, TimeSeriesSplit) and verify TabPFN API cell budget constraints.
- Deliverables: cleaned X_train/X_test/y_train DataFrames, a reusable folds object, notes/competition_overview.md, and an issues checklist for leakage, imbalance, and high-cardinality features.
Quick Start
Run tabpfn-explore on your train and test CSVs to generate cleaned DataFrames, a saved folds object, adversarial validation diagnostics, and an API cell budget check.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: tabpfn-explore Download link: https://github.com/dianaprior/kaggle-competition-agent-skill/archive/main.zip#tabpfn-explore Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.