Name: sglang
Availability: InStock
Author: ovachiever

System Documentation

What problem does it solve?

SGLang is a high-performance serving framework for LLMs with RadixAttention prefix caching. It accelerates structured outputs (JSON/regex) and agent workflows by caching shared prefixes across requests.

Core Features & Use Cases

RadixAttention: automatic KV-prefix caching to accelerate multi-turn prompts
Structured outputs: JSON, regex-constrained generation
Agent workflows with function calling and tool integration
Fast decoding and reduced latency for large prompts
Production-grade deployment for multi-tenant LLM serving

Use cases include agent-driven tasks, chatbots with repeated prefixes, and high-throughput inference pipelines.

Quick Start

Run a local server and expose a JSON-structured output endpoint, then integrate with a client to perform batch generations.

Please help me install this Skill: Name: sglang Download link: https://github.com/ovachiever/droid-tings/archive/main.zip#sglang Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

sglang

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper