data-engineering-and-pipelines
CommunityResilient data pipelines with safety and replay.
AuthorTiepbm
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Organizations struggle to move data reliably between systems while maintaining data quality, observability, and recoverability in the face of schema drift, late-arriving data, and failures.
Core Features & Use Cases
- End-to-end data pipeline design for ETL/ELT, batch, streaming, CDC, and event-driven flows with explicit handling for replay, backfill, and schema evolution.
- Quality, lineage, and recoverability controls across ingestion, validation, transformation, and publication stages.
- Use Case Example: Build a warehouse loading pipeline that ingests source data, validates schema, deduplicates, handles late data, and provides replayable runs with run IDs and check-pointing.
Quick Start
Create a simple end-to-end pipeline that ingests data, validates schema, and writes idempotently to the sink with run-tracking.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: data-engineering-and-pipelines Download link: https://github.com/Tiepbm/software-engineering-agent/archive/main.zip#data-engineering-and-pipelines Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.