data-profiler

Community

Automate data profiling and dictionary generation.

AuthorTerryFYL
Version1.0.0
Installs0

System Documentation

What problem does it solve?

数据探查与数据字典生成。触发条件: (1) 用户提供新的 CSV/Excel 数据文件, (2) 说"探查数据"/"数据画像"/"data profile"/"生成数据字典", (3) 在分析之前需要全面了解数据结构。 核心能力: 自动生成数据字典、检测变量层级关系、评估缺失模式、 定义分析人群,为后续所有分析结除"盲猜列名"和"NaN崩溃"的风险。

Core Features & Use Cases

  • 自动生成 data_dictionary.md 和 data_profile_report.md,提供完整的数据字典和数据质量概览。
  • 自动化变量层级关系检测,帮助区分细粒度变量与聚合变量,避免混淆。
  • 缺失模式分析与分析人群定义,为后续分析和统计建模提供稳健的输入。
  • 适用于新数据导入、研究设计准备、数据清洗前的探索性分析。

Quick Start

Provide a CSV or Excel file path and command the system to generate the data dictionary and population definitions.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: data-profiler
Download link: https://github.com/TerryFYL/ai-research-army/archive/main.zip#data-profiler

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.