Name: vllm-tool-calling
Availability: InStock
Author: saintgo7

System Documentation

What problem does it solve?

vLLM tool calling can silently fail or leak tool-call markup into user-visible content, breaking automated function execution in production.

Core Features & Use Cases

3-stage defenses (server + model + client fallback): Prevents failures when any single layer regresses, including stream boundary issues.
Parser-to-model mapping: Ensures the selected vLLM --tool-call-parser matches the model’s real tokenizer/chat-template output format.
Client-side promotion of leaked patterns: Detects Hermes/Qwen3 XML and bare-JSON cases, then promotes them into OpenAI-standard tool_calls while stripping leaked content.
Smoke test guidance: Validates both non-stream and stream behavior and checks that leaks do not appear in content.

Quick Start

Install and activate this skill guidance by running: ./install.sh vllm-tool-calling.

Please help me install this Skill: Name: vllm-tool-calling Download link: https://github.com/saintgo7/claude-skills/archive/main.zip#vllm-tool-calling Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

vllm-tool-calling

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper