starrocks-broker-load
CommunityLoad lake files into StarRocks reliably
Data & Analytics#starrocks#airflow integration#broker load#bulk etl#data lake ingestion#s3 parquet#load monitoring
Authorivanshamaev
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill solves the problem of bulk-loading large datasets from object storage or HDFS into StarRocks with repeatable, production-friendly control over formats, authentication, and load monitoring.
Core Features & Use Cases
- Bulk Broker Load into StarRocks: Perform
LOAD LABELfrom S3/HDFS/GCS/Azure/MinIO sources for large batch ETL without tying up the caller synchronously. - Multi-format ingestion: Load CSV, Parquet, and ORC with schema mapping expressions and format-specific configuration.
- Operational safety & observability: Track progress via
SHOW LOAD, cancel jobs withCANCEL LOAD, and tune parallelism and timeouts for predictable throughput. - Use Case: Daily/hourly landing-zone batch ETL where partitioned lake files (e.g.,
dt=YYYY-MM-DD/*.parquet) must be ingested into one or multiple StarRocks tables and then followed byANALYZEfor fresh optimizer stats.
Quick Start
Use the starrocks-broker-load Skill to load Parquet partition files from S3 into a target table by providing the correct LOAD LABEL statement, broker authentication, and PROPERTIES such as timeout for an async batch run.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: starrocks-broker-load Download link: https://github.com/ivanshamaev/de-agent-skills/archive/main.zip#starrocks-broker-load Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.