implement-comet-expression

Community

Add Spark expressions to Comet faster

Authorsp-202
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Implementing a new Spark expression in DataFusion Comet is complex because it requires matching Spark’s exact semantics, wiring the serde and native execution layers, and proving correctness against SQL tests and an audit comparison to multiple Spark versions.

Core Features & Use Cases

  • Spark-first semantics study: Locates the canonical Spark master implementation and its tests to understand input types, null handling, ANSI branches, and guards.
  • Upstream reuse when possible: Checks the datafusion-spark crate for an existing implementation to avoid re-implementing behavior that already matches Spark.
  • Full Comet wiring workflow: Guides adding the Scala serde, registering the protobuf mapping, creating the native physical function, and validating via Comet SQL tests plus an audit-driven iteration loop.
  • Use Case: You want Comet to support a missing Spark SQL expression end-to-end so Spark SQL/DataFrame queries can run with Comet acceleration without semantic drift.

Quick Start

Use the implement-comet-expression skill with the target Spark expression name you want to add, so it produces a step-by-step plan from Spark source study through serde/native wiring, SQL test creation, and audit-driven fixes.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: implement-comet-expression
Download link: https://github.com/sp-202/datafusion-comet-geo/archive/main.zip#implement-comet-expression

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.