vllm-chat-templates

Official

Triage and fix vLLM chat templates in production.

Authorair-gapped
Version1.0.0
Installs0

System Documentation

What problem does it solve?

Diagnoses and triages vLLM chat-template problems by separating template-side issues from parser failures, guiding operators to identify whether the symptom originates in the template, the reasoning parser, or the tool parser, and pointing to known fixes and recovery steps.

Core Features & Use Cases

  • Layered triage guidance for the three core shells: chat template (Jinja), reasoning parser, and tool parser, with cross-layer diagnostics.
  • Per-family familiarity with flags, known bugs, workarounds, and up-to-date guidelines for model families like Qwen, DeepSeek, GPT-OSS, Kimi-K2, Llama-4, Mistral, Gemma, Phi-4, and GLM.
  • Production-oriented debugging workflow including enabling prompt logging, bypassing vLLM, capturing raw outputs, pinning versions, and using debugging references to accelerate problem resolution.

Quick Start

Start by logging the actual prompt served by vLLM and compare byte-for-byte against the model card example to determine whether the issue originates from the template, the parser, or forwarded kwargs.

Dependency Matrix

Required Modules

None required

Components

references

đź’» Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: vllm-chat-templates
Download link: https://github.com/air-gapped/skills/archive/main.zip#vllm-chat-templates

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 510,000+ vetted skills library on demand.