synthdata-generate

Community

Generate synthetic tabular data from YAML schemas.

Authorrappdw
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Generate synthetic datasets from YAML schemas to accelerate testing, prototyping, and privacy-safe data exploration across multiple domains.

Core Features & Use Cases

  • Domain-template driven generation: HR directories, ecommerce orders, SaaS metrics, healthcare records, and more.
  • YAML-schema driven engines with Faker-backed fields, distributions (normal, lognormal, zipf, poisson), foreign-key integrity, behavioral profiles, and temporal event generation.
  • Output formats include xlsx, csv, json, sql, and parquet for easy integration into analytics pipelines and apps.

Quick Start

Provide a YAML schema or select a built-in template to generate a synthetic dataset.

Dependency Matrix

Required Modules

numpypandaspyyamlfaker

Components

scripts

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: synthdata-generate
Download link: https://github.com/rappdw/synthdata/archive/main.zip#synthdata-generate

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.