lora-prepare-dataset
CommunityValidate and ready your LoRA training data
Data & Analytics#image quality#deduplication#computer vision#lora#dataset validation#caption coverage#training readiness
Authorredbananastudios
Version1.0.0
Installs0
System Documentation
What problem does it solve?
It prevents wasted LoRA training runs by checking whether a dataset of training images meets quality, consistency, and suitability requirements before you spend compute.
Core Features & Use Cases
- Dataset gatekeeping: Verifies minimum image counts per LoRA class and produces a structured pass, warning, or fail verdict.
- Per-image quality checks: Screens for decodeable formats, minimum resolution, cropping/subject visibility, exposure/blur, and data consistency heuristics.
- Duplicate and coverage detection: Flags exact and near-duplicates and reports caption file coverage so downstream captioning or training can proceed safely.
Quick Start
Validate a dataset by providing the absolute image folder path, the LoRA class, the intended subject text, and an absolute output folder for the generated prepare_report.json.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: lora-prepare-dataset Download link: https://github.com/redbananastudios/ai-library/archive/main.zip#lora-prepare-dataset Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.