docker-data-environments
CommunityBuild secure Docker stacks for data tools
Software Engineering#data engineering#docker#docker compose#airflow#spark#buildkit#multi-stage dockerfile
Authorivanshamaev
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Provides production-minded, reproducible Docker builds and local containerized stacks for data engineering tools, reducing CI/CD build time, configuration drift, and insecure image practices.
Core Features & Use Cases
- Multi-stage Dockerfiles for data tools: Efficient dbt, Spark, and Airflow images with clear build vs runtime separation.
- Fast CI/CD via layer caching and BuildKit: Uses cache mounts for pip/apt to speed iterative builds.
- Private registry and multi-platform delivery: Supports docker buildx for amd64/arm64 and pushing to ghcr.io or Harbor.
- Local data engineering environment with Docker Compose: Includes a ready-to-run stack (Spark, Airflow, Kafka, MinIO, Postgres) for integration testing.
- Build-time secrets and security hardening: Uses BuildKit secrets for private PyPI and recommends non-root, slim, read-only filesystem patterns.
Quick Start
Ask your agent to generate a multi-stage Dockerfile for your dbt project and a compose.yaml that brings up Spark + Airflow + Kafka + MinIO + Postgres locally with secure BuildKit secret usage.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: docker-data-environments Download link: https://github.com/ivanshamaev/de-agent-skills/archive/main.zip#docker-data-environments Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.