data-engineering-and-pipelines

Community

Resilient data pipelines with safety and replay.

AuthorTiepbm
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Organizations struggle to move data reliably between systems while maintaining data quality, observability, and recoverability in the face of schema drift, late-arriving data, and failures.

Core Features & Use Cases

  • End-to-end data pipeline design for ETL/ELT, batch, streaming, CDC, and event-driven flows with explicit handling for replay, backfill, and schema evolution.
  • Quality, lineage, and recoverability controls across ingestion, validation, transformation, and publication stages.
  • Use Case Example: Build a warehouse loading pipeline that ingests source data, validates schema, deduplicates, handles late data, and provides replayable runs with run IDs and check-pointing.

Quick Start

Create a simple end-to-end pipeline that ingests data, validates schema, and writes idempotently to the sink with run-tracking.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: data-engineering-and-pipelines
Download link: https://github.com/Tiepbm/software-engineering-agent/archive/main.zip#data-engineering-and-pipelines

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.