gen-data-dict

Community

Create BIDS-style data dictionaries for datasets.

Authorbcmcpher
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This tool automatically generates a standardized, human-readable data dictionary for every variable in a merged or analysis-ready dataset, enabling consistent metadata documentation and sharing in BIDS-style formats.

Core Features & Use Cases

  • Annotates each column with a descriptive label, data type, cardinality, sample values, and null fraction.
  • Produces a JSON data dictionary named <input_stem>_data_dictionary.json that can be used for data governance, reproducibility, and data sharing.
  • Useful when preparing merged TSV/CSV or Parquet files for neuroimaging, clinical, or analytics projects, or when documenting a dataset for publication.

Quick Start

Provide a merged input file (e.g., merged.tsv) to generate the data dictionary in the same directory.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: gen-data-dict
Download link: https://github.com/bcmcpher/my-skills/archive/main.zip#gen-data-dict

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.