trino-airflow-lakehouse-pipelines

Community

Orchestrate Iceberg ETL with Airflow quality gates

Authorivanshamaev
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill helps you reliably orchestrate multi-layer Lakehouse ETL (Bronze→Silver→Gold) on Iceberg tables while ensuring data quality, handling late-arriving data, and keeping table metadata healthy through scheduled maintenance.

Core Features & Use Cases

  • Medallion pipeline DAG (Bronze/Silver/Gold): Implements ingestion freshness checks, Silver merge/dedup logic, Gold aggregations, and optimizer-friendly ANALYZE steps, all executed via Airflow with Trino operators/hooks.
  • Iceberg maintenance jobs as Airflow tasks: Schedules compaction and operational housekeeping using Trino SQL for optimize, expire_snapshots, and remove_orphan_files with correct task chaining.
  • Partition backfill with dynamic task mapping: Generates partition date ranges and backfills partition-by-partition with idempotent deletes and reloads into the Silver layer.
  • Late-arriving data handling via watermark table: Tracks last processed watermark per pipeline/table and provides the pattern to query/update watermarks safely.
  • Post-load data quality gates: Enforces row-count, freshness, and null-rate checks using Trino queries (patterned with a TrinoHook-based approach), preventing promotion of bad data.

Quick Start

Ask the agent to generate an Airflow DAG for an Iceberg medallion pipeline that runs Trino-based Bronze→Silver merges, executes DQ gates per day, loads Gold aggregates, and schedules daily Iceberg maintenance jobs after completion.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: trino-airflow-lakehouse-pipelines
Download link: https://github.com/ivanshamaev/de-agent-skills/archive/main.zip#trino-airflow-lakehouse-pipelines

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.