Data Cleaning Pipeline
CommunityTurn messy data into clean, analysis-ready pipelines.
Authorerlebach
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Data cleaning pipelines transform raw, messy data into clean, standardized formats suitable for analysis and modeling through systematic handling of missing values, outliers, and data quality issues.
Core Features & Use Cases
- Missing Value Handling: Imputation and removal strategies to prepare datasets for modeling.
- Outlier Detection & Treatment: Identify and mitigate anomalies to prevent skewed analyses.
- Data Standardization & Validation: Ensure consistent data types and integrity checks across pipelines.
- Use Case: Prepare a retail dataset for ML by washing, imputing, and normalizing features, then validating results before modeling.
Quick Start
Run a minimal end-to-end cleaning pipeline on your dataset to produce a ready-for-analysis dataframe.
Dependency Matrix
Required Modules
None requiredComponents
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: Data Cleaning Pipeline Download link: https://github.com/erlebach/gordon/archive/main.zip#data-cleaning-pipeline Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.