starrocks-files-ingestion
CommunityIngest files into StarRocks via FILES() and Iceberg catalogs
Data & Analytics#partition pruning#starrocks#files()#iceberg catalog#s3 ingestion#hdfs csv#etl patterns
Authorivanshamaev
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill solves the challenge of loading and querying data stored in external files (S3/HDFS) without first creating and managing explicit tables, while also enabling efficient integration with Iceberg external catalogs in StarRocks.
Core Features & Use Cases
- FILEs() on-demand querying: Read Parquet/ORC/CSV directly from paths (S3/HDFS) for quick exploration or lightweight ingestion.
- External Iceberg catalog workflows: Create Iceberg external catalogs (Hive Metastore, AWS Glue, REST, MinIO+HMS), query them, and move data into StarRocks internal tables.
- Partition-aware performance: Use partition filter pushdown patterns with
EXPLAINto avoid scanning unnecessary partitions and to speed up ETL. - Schema and DDL patterns: Handle schema evolution and use safe DDL patterns like
SHOW CREATE CATALOG/SHOW CREATE TABLEafter CTAS.
Quick Start
Use the starrocks-files-ingestion skill to query Parquet files on S3 with an inferred schema using StarRocks FILES() without creating a table first.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: starrocks-files-ingestion Download link: https://github.com/ivanshamaev/de-agent-skills/archive/main.zip#starrocks-files-ingestion Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.