sglang-hunyuan3-preview-optimization
CommunityPR-backed optimization for Hunyuan3 Preview.
AuthorBBuf
Version1.0.0
Installs0
System Documentation
What problem does it solve?
This document consolidates PR-backed optimization guidance for Tencent Hunyuan 3 Preview in SGLang, enabling engineers to audit, extend, and deploy efficient BF16 MoE configurations with precise hardware sizing, parser flags, and attention-backend requirements for production readiness.
Core Features & Use Cases
- PR-dossier guided optimization: links to per-PR evidence and diff-based justification, ensuring changes are traceable.
- Hardware sizing and configuration: BF16 weight and TP planning across H200/H100 and B300/GB300 GPUs with model-specific constraints.
- Safe integration and deployment: includes parser/tool flags, MTP options, EAGLE toggles, and trust-remote-code considerations.
- Use Case: Create and review a deployment recipe for Hy3 Preview with tested GPUs and Blackwell attention backend settings.
Quick Start
Review the latest Hunyuan3 Preview optimization PR diffs and tailor the guidance to your hardware setup.
Dependency Matrix
Required Modules
None requiredComponents
references
💻 Claude Code Installation
Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.
Please help me install this Skill: Name: sglang-hunyuan3-preview-optimization Download link: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/archive/main.zip#sglang-hunyuan3-preview-optimization Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
Agent Skills Search Helper
Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.