kailash-align

Community

Fine-tune and align LLMs with eval-gated serving.

AuthorFFOO6866
Version1.0.0
Installs0

System Documentation

What problem does it solve?

It prevents unsafe or underperforming fine-tuned LLM deployments by providing an end-to-end training, evaluation, and serving workflow with a hard evaluation-before-serving gate.

Core Features & Use Cases

  • End-to-end alignment pipeline: Orchestrates training across multiple TRL-based alignment methods and adapter lifecycles.
  • 12 supported alignment methods: Covers offline, unpaired, monolithic, and online training patterns (including LoRA-based adapter management and chaining for multi-stage flows like sft_then_dpo).
  • Mandatory eval-before-serve: Requires benchmarking and safety checks (including base-model comparison) before any model can be deployed.
  • Deployment options for real runtimes: Exports to GGUF, deploys to Ollama, serves via vLLM, and connects deployed models into Kaizen agents through a bridge.

Quick Start

Ask the AI to train and align a DPO adapter, then evaluate it and deploy it to Ollama or vLLM only if evaluation passes.

Dependency Matrix

Required Modules

None required

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: kailash-align
Download link: https://github.com/FFOO6866/lead2cash/archive/main.zip#kailash-align

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.