data-engineering-data-pipeline
CommunityBuild robust data pipelines.
Authorbcastelino
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides comprehensive guidance and best practices for designing, implementing, and optimizing scalable, reliable, and cost-effective data pipelines for both batch and streaming data.
Core Features & Use Cases
- Architecture Design: Choose and design ETL/ELT, Lambda, Kappa, or Lakehouse architectures.
- Implementation: Build ingestion, transformation (dbt, Spark), orchestration (Airflow, Prefect), and storage (Delta Lake, Iceberg) layers.
- Data Quality & Monitoring: Implement data quality frameworks and set up robust monitoring and cost optimization strategies.
- Use Case: Design a streaming data pipeline to ingest real-time user activity, process it, and store it in a Delta Lake for immediate analytics.
Quick Start
Design a batch data pipeline for processing daily sales orders, including ingestion, transformation with dbt, and storage in Delta Lake.
Dependency Matrix
Required Modules
None requiredComponents
scriptsreferencesassets
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: data-engineering-data-pipeline Download link: https://github.com/bcastelino/agent-skills-kit/archive/main.zip#data-engineering-data-pipeline Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.