kailash-align
CommunityFine-tune and align LLMs with eval-gated serving.
Software Engineering#alignment#trl#llm fine-tuning#lora adapters#eval-before-serve#ollama deployment#vllm serving
AuthorFFOO6866
Version1.0.0
Installs0
System Documentation
What problem does it solve?
It prevents unsafe or underperforming fine-tuned LLM deployments by providing an end-to-end training, evaluation, and serving workflow with a hard evaluation-before-serving gate.
Core Features & Use Cases
- End-to-end alignment pipeline: Orchestrates training across multiple TRL-based alignment methods and adapter lifecycles.
- 12 supported alignment methods: Covers offline, unpaired, monolithic, and online training patterns (including LoRA-based adapter management and chaining for multi-stage flows like sft_then_dpo).
- Mandatory eval-before-serve: Requires benchmarking and safety checks (including base-model comparison) before any model can be deployed.
- Deployment options for real runtimes: Exports to GGUF, deploys to Ollama, serves via vLLM, and connects deployed models into Kaizen agents through a bridge.
Quick Start
Ask the AI to train and align a DPO adapter, then evaluate it and deploy it to Ollama or vLLM only if evaluation passes.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: kailash-align Download link: https://github.com/FFOO6866/lead2cash/archive/main.zip#kailash-align Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.