wire-datafusion-function
CommunityWire DataFusion functions into Comet
System Documentation
What problem does it solve?
It solves the challenge of matching Spark expression semantics to Comet by wiring an existing DataFusion or datafusion-spark function into the Comet Scala serde layer and (when needed) the native UDF registration path.
Core Features & Use Cases
- Semantics-matched wiring patterns: Selects the lightest correct integration approach (passthrough, register + passthrough, or custom serde) based on Spark contract requirements.
- Correct Spark-to-Comet behavior: Ensures return types, null handling, ANSI error paths, type restrictions, and foldable-only argument rules are preserved.
- Testable integration: Adds Spark SQL file tests that exercise validity bitmaps, edge cases (NaN, infinities, boundary values), and fallback behavior when serde returns None.
- Documentation updates: Updates the expressions user guide and support matrix while relying on generated compatibility pages for per-expression details.
Quick Start
Use wire-datafusion-function to integrate the existing upstream $ARGUMENTS function into Comet by following the wiring-pattern decision, applying the Scala serde wiring, registering the UDF only when required, and adding the corresponding SQL file tests for the directly supported input types.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: wire-datafusion-function Download link: https://github.com/sp-202/datafusion-comet-geo/archive/main.zip#wire-datafusion-function Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.