data-loading
CommunityIngest, inspect, and map SDTM data from S3.
Authorsiddharthchauhan
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill addresses the tedious, error-prone process of loading raw EDC data from AWS S3, downloading and extracting ZIP archives, scanning source files, and analyzing data structures to support SDTM pipeline Phase 1.
Core Features & Use Cases
- S3 Data Ingestion: Load raw EDC data from S3 buckets and locate domain files for processing.
- ZIP Extraction & File Discovery: Automatically download, unzip, and enumerate extracted CSV/XML files for domain mapping.
- Source File Scanning & Analysis: Detect available SDTM domains, assess file structure, and summarize key metadata to guide transformation.
- Use Case: Prepare MAXIS-08 or similar study data for SDTM mapping by previewing domain files and verifying data completeness before transformation.
Quick Start
- Load data from S3: specify study_id, s3_bucket, and s3_prefix to fetch RAW_DATA.
- Scan the extracted directory to enumerate files and detected domains.
- Analyze a representative source file to verify row/column counts and data quality before mapping.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: data-loading Download link: https://github.com/siddharthchauhan/ETL/archive/main.zip#data-loading Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.