llamafactory-docs
CommunityFine-tune and deploy LLMs end-to-end.
Software Engineering#multimodal#distributed training#lora#llama-factory#inference api#llm fine-tuning#dataset formatting
Authorwenerme
Version1.0.0
Installs0
System Documentation
What problem does it solve?
LLaMA Factory documentation helps you configure and run fine-tuning, reinforcement learning, and deployment workflows without getting stuck on framework details and dataset formatting requirements.
Core Features & Use Cases
- Training & RLHF/Preference Learning Guidance: Supports SFT, DPO/KTO/PPO/ORPO/SimPO/CPO family, reward modeling, and GRPO-style reinforcement learning workflows.
- Dataset Preparation & Formats: Explains dataset requirements for alpaca/sharegpt-style data, including multimodal fields needed for images/videos/audios.
- End-to-End Deployment Paths: Covers deployment and serving options such as OpenAI-style API and inference backends like vLLM and SGLang, plus the LLaMA Board Web UI.
- Quick CLI Workflows: Provides command patterns for training, web UI, chat, and API serving using
llamafactory-cli.
Quick Start
Ask for a complete setup and training plan by running: llamafactory-cli train examples/train_lora/llama3_lora_sft.yaml.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: llamafactory-docs Download link: https://github.com/wenerme/ai/archive/main.zip#llamafactory-docs Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.