starrocks-files-ingestion

Community

Ingest files into StarRocks via FILES() and Iceberg catalogs

Authorivanshamaev
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill solves the challenge of loading and querying data stored in external files (S3/HDFS) without first creating and managing explicit tables, while also enabling efficient integration with Iceberg external catalogs in StarRocks.

Core Features & Use Cases

  • FILEs() on-demand querying: Read Parquet/ORC/CSV directly from paths (S3/HDFS) for quick exploration or lightweight ingestion.
  • External Iceberg catalog workflows: Create Iceberg external catalogs (Hive Metastore, AWS Glue, REST, MinIO+HMS), query them, and move data into StarRocks internal tables.
  • Partition-aware performance: Use partition filter pushdown patterns with EXPLAIN to avoid scanning unnecessary partitions and to speed up ETL.
  • Schema and DDL patterns: Handle schema evolution and use safe DDL patterns like SHOW CREATE CATALOG / SHOW CREATE TABLE after CTAS.

Quick Start

Use the starrocks-files-ingestion skill to query Parquet files on S3 with an inferred schema using StarRocks FILES() without creating a table first.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: starrocks-files-ingestion
Download link: https://github.com/ivanshamaev/de-agent-skills/archive/main.zip#starrocks-files-ingestion

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.