transform-pipeline-verification

Official

Verify transform outputs before you scale.

Authorlightning-rod-labs
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This Skill helps prevent wasted cost and bad training data by showing you how to run and inspect Lightning Rod transform pipeline outputs at intermediate and full stages before scaling up.

Core Features & Use Cases

  • Iterative pipeline verification: Run a QuestionPipeline with the minimum stages you need (including seeds-only) and inspect the resulting dataset immediately.
  • Quality and distribution spot-checking: Validate dataset fields such as is_valid, label distribution, and sample-level fields like question_text, label, reasoning, and invalid_reason.
  • Server-side dataset linting: Run the dataset linter to catch structural issues (e.g., duplicates, missing required fields, label inconsistencies) before splitting or training.
  • Use case: A notebook workflow where you generate 10–50 samples, confirm validity and label quality, then use estimate_cost and rerun at larger max_questions once the pipeline looks healthy.

Quick Start

Generate a small seeds-or-full transform run, download the produced dataset rows, spot-check validity and label fields, and then lint the dataset before you split or train.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: transform-pipeline-verification
Download link: https://github.com/lightning-rod-labs/lightningrod-python-sdk/archive/main.zip#transform-pipeline-verification

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.