sglang-deepseek-v31-optimization

Community

Current-main DeepSeek V3.1 optimization playbook.

AuthorBBuf
Version1.0.0
Installs0

System Documentation

What problem does it solve?

The DeepSeek V3.1 optimization manual guides engineers in auditing, recovering, extending, and debugging tool calling, thinking mode, chat templates, streaming parser behavior, loading fixes, and MTP validation for SGLang deployments.

Core Features & Use Cases

  • PR-backed optimization guidance for DeepSeek V3.1 and V3.1-Terminus across SGLang.
  • Cross-surface guidance covering parser, runtime, and template interactions, including MoE backend configs and backend-specific tests.
  • Use Case: When Codex or SGLang needs to recover or audit DeepSeek V3.1 on a new PR, this playbook provides steps and checks.

Quick Start

Run a focused optimization pass on DeepSeek V3.1 tool calling, thinking mode, chat templates, and streaming parser behavior.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: sglang-deepseek-v31-optimization
Download link: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/archive/main.zip#sglang-deepseek-v31-optimization

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.