datarobot-data-preparation
OfficialUpload, validate, and prep data for DataRobot.
Data & Analytics#data quality#data validation#python sdk#dataset upload#datarobot#schema checks#dataset versioning
Authordatarobot-oss
Version1.0.0
Installs0
System Documentation
What problem does it solve?
DataRobot-ready datasets are hard to prepare because uploads often fail late due to schema issues, formatting problems, or poor data quality that could have been caught earlier.
Core Features & Use Cases
- Dataset Upload: Upload CSV/Parquet files or source data into DataRobot and capture dataset metadata for downstream workflows.
- Data Validation: Validate structure and quality signals such as missing values, schema/type mismatches, and common data issues before model training or predictions.
- Dataset Management & Versioning: List, search, and manage dataset lifecycles, including updating metadata and creating new dataset versions.
- Data Preparation for Training/Predictions: Clean and format data so it matches DataRobot requirements and supports prediction datasets aligned to training structure.
Quick Start
Upload your dataset file and then validate it by instructing the agent to upload the file sales_data.csv as “Sales Data Q4 2024”, validate the resulting dataset, and return any schema or data quality issues.
Dependency Matrix
Required Modules
datarobot
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: datarobot-data-preparation Download link: https://github.com/datarobot-oss/datarobot-agent-skills/archive/main.zip#datarobot-data-preparation Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.