sglang-deepseek-v31-optimization
CommunityCurrent-main DeepSeek V3.1 optimization playbook.
AuthorBBuf
Version1.0.0
Installs0
System Documentation
What problem does it solve?
The DeepSeek V3.1 optimization manual guides engineers in auditing, recovering, extending, and debugging tool calling, thinking mode, chat templates, streaming parser behavior, loading fixes, and MTP validation for SGLang deployments.
Core Features & Use Cases
- PR-backed optimization guidance for DeepSeek V3.1 and V3.1-Terminus across SGLang.
- Cross-surface guidance covering parser, runtime, and template interactions, including MoE backend configs and backend-specific tests.
- Use Case: When Codex or SGLang needs to recover or audit DeepSeek V3.1 on a new PR, this playbook provides steps and checks.
Quick Start
Run a focused optimization pass on DeepSeek V3.1 tool calling, thinking mode, chat templates, and streaming parser behavior.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: sglang-deepseek-v31-optimization Download link: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/archive/main.zip#sglang-deepseek-v31-optimization Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.