mergekit

Community

Merge specialist LLMs into one model.

Authorqcmuu
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Mergekit solves the problem of combining multiple fine-tuned language model checkpoints into a single better generalist without needing additional training.

Core Features & Use Cases

  • GPU-free model fusion: Create a merged model primarily via CPU workflows for many merge methods, avoiding costly retraining.
  • Multiple merge strategies: Use SLERP, TIES, DARE, Task Arithmetic, Frankenmerge (layer stacking), and Evolutionary merge to control how capabilities combine.
  • Practical outcomes: Combine coding/math/reasoning specialists to reduce catastrophic forgetting and improve breadth compared to selecting a single checkpoint.
  • Use Case: You have separate LoRA/finetune outputs for math and coding on the same base architecture; Mergekit merges them into one checkpoint that balances both abilities.

Quick Start

Ask the AI to merge your checkpoints into an output directory by running mergekit on a SLERP YAML config file (e.g., mergekit-yaml slerp_merge.yaml ./merged-model --copy-tokenizer).

Dependency Matrix

Required Modules

mergekittransformerstorchpyyaml

Components

Standard package

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: mergekit
Download link: https://github.com/qcmuu/AI-Research-Skills/archive/main.zip#mergekit

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.