docker-data-environments

Community

Build secure Docker stacks for data tools

Authorivanshamaev
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Provides production-minded, reproducible Docker builds and local containerized stacks for data engineering tools, reducing CI/CD build time, configuration drift, and insecure image practices.

Core Features & Use Cases

  • Multi-stage Dockerfiles for data tools: Efficient dbt, Spark, and Airflow images with clear build vs runtime separation.
  • Fast CI/CD via layer caching and BuildKit: Uses cache mounts for pip/apt to speed iterative builds.
  • Private registry and multi-platform delivery: Supports docker buildx for amd64/arm64 and pushing to ghcr.io or Harbor.
  • Local data engineering environment with Docker Compose: Includes a ready-to-run stack (Spark, Airflow, Kafka, MinIO, Postgres) for integration testing.
  • Build-time secrets and security hardening: Uses BuildKit secrets for private PyPI and recommends non-root, slim, read-only filesystem patterns.

Quick Start

Ask your agent to generate a multi-stage Dockerfile for your dbt project and a compose.yaml that brings up Spark + Airflow + Kafka + MinIO + Postgres locally with secure BuildKit secret usage.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: docker-data-environments
Download link: https://github.com/ivanshamaev/de-agent-skills/archive/main.zip#docker-data-environments

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.