trino-airflow-lakehouse-pipelines
CommunityOrchestrate Iceberg ETL with Airflow quality gates
Software Engineering#trino#airflow#medallion architecture#iceberg#data quality gates#lakehouse etl#snapshot maintenance
Authorivanshamaev
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill helps you reliably orchestrate multi-layer Lakehouse ETL (Bronze→Silver→Gold) on Iceberg tables while ensuring data quality, handling late-arriving data, and keeping table metadata healthy through scheduled maintenance.
Core Features & Use Cases
- Medallion pipeline DAG (Bronze/Silver/Gold): Implements ingestion freshness checks, Silver merge/dedup logic, Gold aggregations, and optimizer-friendly
ANALYZEsteps, all executed via Airflow with Trino operators/hooks. - Iceberg maintenance jobs as Airflow tasks: Schedules compaction and operational housekeeping using Trino SQL for
optimize,expire_snapshots, andremove_orphan_fileswith correct task chaining. - Partition backfill with dynamic task mapping: Generates partition date ranges and backfills partition-by-partition with idempotent deletes and reloads into the Silver layer.
- Late-arriving data handling via watermark table: Tracks last processed watermark per pipeline/table and provides the pattern to query/update watermarks safely.
- Post-load data quality gates: Enforces row-count, freshness, and null-rate checks using Trino queries (patterned with a TrinoHook-based approach), preventing promotion of bad data.
Quick Start
Ask the agent to generate an Airflow DAG for an Iceberg medallion pipeline that runs Trino-based Bronze→Silver merges, executes DQ gates per day, loads Gold aggregates, and schedules daily Iceberg maintenance jobs after completion.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: trino-airflow-lakehouse-pipelines Download link: https://github.com/ivanshamaev/de-agent-skills/archive/main.zip#trino-airflow-lakehouse-pipelines Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.