huggingface-local-models

Community

Run local GGUF models with llama.cpp.

Authordomattioli
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Locate, download, and run Hugging Face GGUF-based models locally using llama.cpp across CPU and accelerators, simplifying hardware-aware model deployment.

Core Features & Use Cases

  • HF local-app guidance: leverage the local-app hardware compatibility data to pick the right quant.
  • Model discovery & lookup: search Hugging Face for llama.cpp-compatible GGUFs and verify exact filenames.
  • Local serving: launch with llama-cli or llama-server using the exact GGUF file for low-latency inference.

Quick Start

Install the required tools and select a local GGUF model from Hugging Face, then start the server with the chosen GGUF file.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: huggingface-local-models
Download link: https://github.com/domattioli/DomI/archive/main.zip#huggingface-local-models

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.