vllm-bootstrap
CommunityBoot a production-ready vLLM server
Software Engineering#tool calling#tensor-parallel#vllm#deployment verification#flashinfer#openai-compatible api#gpu oom
Authorsaintgo7
Version1.0.0
Installs0
System Documentation
What problem does it solve?
vLLM servers often fail to start or behave incorrectly due to dependency/version mismatches, wrong tensor-parallel settings, tool-calling parser incompatibilities, and GPU memory/OOM issues during model loading and request handling.
Core Features & Use Cases
- Verified dependency matrix: Provides a tested set of package versions (vLLM, transformers, flashinfer-python/cubin, mistral_common) and warns about known-bad combinations.
- Launch configuration guide: Defines a standard OpenAI-compatible server startup command and explains key flags like tensor parallel size (TP), max context length, and GPU memory utilization.
- Failure-to-fix playbook: Covers common boot/runtime errors (DeepGEMM issues, flashinfer cubin mismatch, ReasoningEffort import errors, tool-call parser mismatches, NCCL TP hangs, OOM, port conflicts) with concrete remediation steps.
- Tool-call parser mapping: Maps tool-call parser choices (hermes/mistral/pythonic/etc.) to model families to reduce “tool calls not parsed” failures.
- Verification & monitoring: Includes curl-based smoke tests for /v1/models, chat completions, and tool calling plus guidance for log tailing and GPU metric checks.
Quick Start
Ask an AI to guide you to install the verified dependencies and launch an OpenAI-compatible vLLM server with a chosen tensor-parallel size and tool-call parser that matches your model, then run the /v1/models and chat/tool-calling smoke tests to confirm it is working.
Dependency Matrix
Required Modules
None requiredComponents
scripts
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: vllm-bootstrap Download link: https://github.com/saintgo7/claude-skills/archive/main.zip#vllm-bootstrap Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.