airflow-starrocks-cdc-orchestrator

Community

Run safe StarRocks CDC with guardrails

Authorivanshamaev
Version1.0.0
Installs0

System Documentation

What problem does it solve?

It solves the operational challenge of keeping StarRocks incrementally in sync with upstream CDC streams while preventing data loss during schema changes and handling load failures safely.

Core Features & Use Cases

  • Watermark-based incremental sync: Computes a StarRocks watermark and loads only the delta from the source on a schedule.
  • Routine Load health monitoring: Checks Kafka Routine Load state, resumes NEED_SCHEDULE jobs, and surfaces PAUSED/CANCELLED conditions.
  • Flink CDC job submission & control: Starts Flink CDC jobs via REST, waits for RUNNING/terminal states, and supports cancellation for safe transitions.
  • DLQ reprocessing for parse failures: Reads DLQ messages, applies fix transformations, and reloads corrected events.
  • Schema change coordination: Pauses Routine Load, applies ALTER, restarts Debezium, then resumes ingestion to reduce downtime and inconsistencies.

Quick Start

Ask the agent to generate an Airflow deployment plan that wires this skill into your StarRocks CDC workflow with the required DAG schedules and placeholders for your Airflow connections (StarRocks, source DB, Kafka, and Flink REST API).

Dependency Matrix

Required Modules

requestsconfluent-kafka

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: airflow-starrocks-cdc-orchestrator
Download link: https://github.com/ivanshamaev/de-agent-skills/archive/main.zip#airflow-starrocks-cdc-orchestrator

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.