lora-caption-dataset
CommunityGenerate consistent LoRA captions with a trigger token.
Data & Analytics#computer vision#lora#captioning#text normalization#dataset preparation#trigger token
Authorredbananastudios
Version1.0.0
Installs0
System Documentation
What problem does it solve?
LoRA training datasets often fail reproducibility when captions are missing or inconsistent, because downstream training depends on a stable trigger-token format across every image.
Core Features & Use Cases
- Caption generation or validation: Creates or checks the
<image-basename>.txtcaption file for every training image, optionally overwriting as requested. - Trigger-token enforcement: Guarantees each caption includes the required trigger token in the correct, deterministic position for the training pipeline.
- Class-consistent formatting: Applies character/product/style/pose/environment templates so captions follow uniform structure for downstream training.
- Noise reduction: Strips common caption leakage like model names, watermarks, and generic phrases to keep captions clean and usable.
Quick Start
Use the skill to generate standardized captions for all images in your LoRA dataset folder, ensuring every caption includes the derived trigger token and matches the class template you specify.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: lora-caption-dataset Download link: https://github.com/redbananastudios/ai-library/archive/main.zip#lora-caption-dataset Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.