spark-native-bronze
CommunityGenerate Databricks Bronze data at scale.
Software Engineering#data engineering#synthetic data#spark#databricks#delta lake#bronze table#interview demo
Authorslysik
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill provides a standardized, scalable, and efficient method for generating synthetic data and creating Bronze Delta tables in Databricks, replacing inefficient manual or library-based approaches.
Core Features & Use Cases
- Scalable Synthetic Data Generation: Creates large datasets (100K to 1M+ rows) using
spark.range()for distributed performance. - Canonical Bronze Pattern: Enforces best practices for Bronze table creation, including metadata columns and direct Delta Lake writes.
- Interview Demo Ready: Optimized for speed and clarity, making it ideal for demonstrating data engineering patterns in interviews.
- Use Case: Generate a 100,000-row retail transaction dataset for a Databricks interview demo, ensuring it's ready for downstream Silver and Gold transformations.
Quick Start
Use the spark-native-bronze skill to generate 100,000 synthetic events for the retail domain.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: spark-native-bronze Download link: https://github.com/slysik/databricks-claude-coding/archive/main.zip#spark-native-bronze Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.