skill-system-eda
CommunityDeterministic Polars-first EDA for datasets
Authorarthur0824hao
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Profile and validate tabular datasets end-to-end by generating deterministic profiles and human-readable reports, with optional memory writeback to a dedicated memory store.
Core Features & Use Cases
- Deterministic profiling of CSV/Parquet datasets using Polars with lazy scanning for large files.
- Generation of machine-readable profile.yaml and human-readable report.md, plus optional memory writeback integration for traceability.
- Use cases include profiling new datasets, checking data quality, drift detection, anomaly analysis, and saving/validating data contracts.
Quick Start
Run profile-dataset on a CSV file to generate profile.yaml and report.md.
Dependency Matrix
Required Modules
numpypolarspyyamlscipyscikit-learn
Components
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: skill-system-eda Download link: https://github.com/arthur0824hao/ExperimentPipeline/archive/main.zip#skill-system-eda Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.