sglang-hunyuan3-preview-optimization

Community

PR-backed optimization for Hunyuan3 Preview.

AuthorBBuf
Version1.0.0
Installs0

System Documentation

What problem does it solve?

This document consolidates PR-backed optimization guidance for Tencent Hunyuan 3 Preview in SGLang, enabling engineers to audit, extend, and deploy efficient BF16 MoE configurations with precise hardware sizing, parser flags, and attention-backend requirements for production readiness.

Core Features & Use Cases

  • PR-dossier guided optimization: links to per-PR evidence and diff-based justification, ensuring changes are traceable.
  • Hardware sizing and configuration: BF16 weight and TP planning across H200/H100 and B300/GB300 GPUs with model-specific constraints.
  • Safe integration and deployment: includes parser/tool flags, MTP options, EAGLE toggles, and trust-remote-code considerations.
  • Use Case: Create and review a deployment recipe for Hy3 Preview with tested GPUs and Blackwell attention backend settings.

Quick Start

Review the latest Hunyuan3 Preview optimization PR diffs and tailor the guidance to your hardware setup.

Dependency Matrix

Required Modules

None required

Components

references

💻 Claude Code Installation

Recommended: Let Claude install automatically. Simply copy and paste the text below to Claude Code.

Please help me install this Skill:
Name: sglang-hunyuan3-preview-optimization
Download link: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/archive/main.zip#sglang-hunyuan3-preview-optimization

Please download this .zip file, extract it, and install it in the .claude/skills/ directory.
View Source Repository

Agent Skills Search Helper

Install a tiny helper to your Agent, search and equip skill from 471,000+ vetted skills library on demand.