Name: sglang-llama4-optimization
Availability: InStock
Author: BBuf

System Documentation

What problem does it solve?

PR-backed optimization manual for Llama4 in SGLang to guide evaluation, patching, and documentation of model integration. It consolidates governance through PR histories and diffusion notes to ensure traceable improvements across quantization, routing, and multimodal backends.

Core Features & Use Cases

PR-diff driven review and documentation of Llama4 integration changes.
Quantization, routing decisions, and Eagle backend compatibility guidance.
Use Case: When a PR introduces FP8/Llama4 features, practitioners can follow the manual to reproduce the change, assess impact, and capture the resulting notes for future audits.

Quick Start

Begin by locating the latest Llama4 optimization PR dossier in the history and follow the diff audit steps to reproduce, validate, and document the changes.

Please help me install this Skill: Name: sglang-llama4-optimization Download link: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/archive/main.zip#sglang-llama4-optimization Please download this .zip file, extract it, and install it in the .claude/skills/ directory.

sglang-llama4-optimization

System Documentation

What problem does it solve?

Core Features & Use Cases

Quick Start

Dependency Matrix

Required Modules

Components

💻 Claude Code Installation

Agent Skills Search Helper