synthdata-generate
CommunityGenerate synthetic tabular data from YAML schemas.
Authorrappdw
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Generate synthetic datasets from YAML schemas to accelerate testing, prototyping, and privacy-safe data exploration across multiple domains.
Core Features & Use Cases
- Domain-template driven generation: HR directories, ecommerce orders, SaaS metrics, healthcare records, and more.
- YAML-schema driven engines with Faker-backed fields, distributions (normal, lognormal, zipf, poisson), foreign-key integrity, behavioral profiles, and temporal event generation.
- Output formats include xlsx, csv, json, sql, and parquet for easy integration into analytics pipelines and apps.
Quick Start
Provide a YAML schema or select a built-in template to generate a synthetic dataset.
Dependency Matrix
Required Modules
numpypandaspyyamlfaker
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: synthdata-generate Download link: https://github.com/rappdw/synthdata/archive/main.zip#synthdata-generate Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.