airflow-starrocks-cdc-orchestrator
CommunityRun safe StarRocks CDC with guardrails
Software Engineering#airflow#starrocks#routine load#cdc orchestration#flink rest api#dlq reprocessing#schema change management
Authorivanshamaev
Version1.0.0
Installs0
System Documentation
What problem does it solve?
It solves the operational challenge of keeping StarRocks incrementally in sync with upstream CDC streams while preventing data loss during schema changes and handling load failures safely.
Core Features & Use Cases
- Watermark-based incremental sync: Computes a StarRocks watermark and loads only the delta from the source on a schedule.
- Routine Load health monitoring: Checks Kafka Routine Load state, resumes NEED_SCHEDULE jobs, and surfaces PAUSED/CANCELLED conditions.
- Flink CDC job submission & control: Starts Flink CDC jobs via REST, waits for RUNNING/terminal states, and supports cancellation for safe transitions.
- DLQ reprocessing for parse failures: Reads DLQ messages, applies fix transformations, and reloads corrected events.
- Schema change coordination: Pauses Routine Load, applies ALTER, restarts Debezium, then resumes ingestion to reduce downtime and inconsistencies.
Quick Start
Ask the agent to generate an Airflow deployment plan that wires this skill into your StarRocks CDC workflow with the required DAG schedules and placeholders for your Airflow connections (StarRocks, source DB, Kafka, and Flink REST API).
Dependency Matrix
Required Modules
requestsconfluent-kafka
Components
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: airflow-starrocks-cdc-orchestrator Download link: https://github.com/ivanshamaev/de-agent-skills/archive/main.zip#airflow-starrocks-cdc-orchestrator Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.