llm-local-deploy
OfficialDeploy LLMs locally with one-click inference.
Authormahg-es
Version1.0.0
Installs0
System Documentation
What problem does it solve?
Deploy and serve local large language models on-premises to keep data private and reduce API costs.
Core Features & Use Cases
- Deploy and configure inference servers (llama.cpp, Ollama, vLLM) to support local LLM workloads.
- Provide an OpenAI-compatible API endpoint for seamless integration with existing tooling and workflows.
- Benchmark and validate performance (throughput, latency, memory) across CPU/GPU setups for varied workloads.
Quick Start
Use the local LLM deployment workflow to select a model, configure the inference server, and start the service on your machine.
Dependency Matrix
Required Modules
None requiredComponents
Standard package💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: llm-local-deploy Download link: https://github.com/mahg-es/araya/archive/main.zip#llm-local-deploy Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.