transform-pipeline-verification
OfficialVerify transform outputs before you scale.
Data & Analytics#data quality#LLM training#cost estimation#forecasting datasets#transform pipeline#dataset linting#iterative verification
Authorlightning-rod-labs
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This Skill helps prevent wasted cost and bad training data by showing you how to run and inspect Lightning Rod transform pipeline outputs at intermediate and full stages before scaling up.
Core Features & Use Cases
- Iterative pipeline verification: Run a
QuestionPipelinewith the minimum stages you need (including seeds-only) and inspect the resulting dataset immediately. - Quality and distribution spot-checking: Validate dataset fields such as
is_valid, label distribution, and sample-level fields likequestion_text,label,reasoning, andinvalid_reason. - Server-side dataset linting: Run the dataset linter to catch structural issues (e.g., duplicates, missing required fields, label inconsistencies) before splitting or training.
- Use case: A notebook workflow where you generate 10–50 samples, confirm validity and label quality, then use
estimate_costand rerun at largermax_questionsonce the pipeline looks healthy.
Quick Start
Generate a small seeds-or-full transform run, download the produced dataset rows, spot-check validity and label fields, and then lint the dataset before you split or train.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: transform-pipeline-verification Download link: https://github.com/lightning-rod-labs/lightningrod-python-sdk/archive/main.zip#transform-pipeline-verification Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.