data-engineering-data-pipeline

Community

Build robust data pipelines.

Authorbcastelino
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill provides comprehensive guidance and best practices for designing, implementing, and optimizing scalable, reliable, and cost-effective data pipelines for both batch and streaming data.

Core Features & Use Cases

  • Architecture Design: Choose and design ETL/ELT, Lambda, Kappa, or Lakehouse architectures.
  • Implementation: Build ingestion, transformation (dbt, Spark), orchestration (Airflow, Prefect), and storage (Delta Lake, Iceberg) layers.
  • Data Quality & Monitoring: Implement data quality frameworks and set up robust monitoring and cost optimization strategies.
  • Use Case: Design a streaming data pipeline to ingest real-time user activity, process it, and store it in a Delta Lake for immediate analytics.

Quick Start

Design a batch data pipeline for processing daily sales orders, including ingestion, transformation with dbt, and storage in Delta Lake.

Dependency Matrix

Required Modules

None required

Components

scriptsreferencesassets

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: data-engineering-data-pipeline
Download link: https://github.com/bcastelino/agent-skills-kit/archive/main.zip#data-engineering-data-pipeline

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.