Data Cleaning Pipeline

Community

Turn messy data into clean, analysis-ready pipelines.

Authorerlebach
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Data cleaning pipelines transform raw, messy data into clean, standardized formats suitable for analysis and modeling through systematic handling of missing values, outliers, and data quality issues.

Core Features & Use Cases

  • Missing Value Handling: Imputation and removal strategies to prepare datasets for modeling.
  • Outlier Detection & Treatment: Identify and mitigate anomalies to prevent skewed analyses.
  • Data Standardization & Validation: Ensure consistent data types and integrity checks across pipelines.
  • Use Case: Prepare a retail dataset for ML by washing, imputing, and normalizing features, then validating results before modeling.

Quick Start

Run a minimal end-to-end cleaning pipeline on your dataset to produce a ready-for-analysis dataframe.

Dependency Matrix

Required Modules

None required

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: Data Cleaning Pipeline
Download link: https://github.com/erlebach/gordon/archive/main.zip#data-cleaning-pipeline

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.