SGLang-Diffusion：发布两月进展综述

2026-01-16 00:00·168天前

AI 摘要

SGLang-Diffusion 最新版本（lmsysorg/sglang:dev-pr-17247）性能较初始版本提升2.5倍，在NVIDIA GPU上较其他方案快5倍。新增Day-0支持Flux.2、Qwen-Image系列、Z-Image-Turbo等多款模型，完整支持LoRA格式与HTTP API，并推出ComfyUI集成插件。技术层面引入Layerwise Offload机制实现计算与权重加载重叠，支持SP/TP混合并行及SageAttention系列后端，兼容AMD、4090、5090及MUSA硬件。

原文 · 未翻译

Contents

Overview

Performance Benchmark

Key Improvements

Layerwise Offload

Kernel Improvements

Cache-DiT Integration

Few More Things

Roadmap (26Q1)

Acknowledgment

Learn more

SGLang-Diffusion: Two Months In

Since its release in early Nov. 2025, SGLang-Diffusion has gained significant attention and widespread adoption within the community. We are deeply grateful for the extensive feedback and growing number of contributions from open-source developers.

Over the past two months, we've been meticulously optimizing SGLang-Diffusion, now (docker image tag: lmsysorg/sglang:dev-pr-17247) up to 2.5x faster than our initial release.

lmsysorg/sglang:dev-pr-17247

Here is a summary of our progress:

Overview

New Models:

Day-0 support for Flux.2, Qwen-Image-Edit-2511, Qwen-Image-2512, Z-Image-Turbo, Qwen-Image-Layered, TurboWan, GLM-Image and more.

Run SGLang-Diffusion with diffusers backend: compatible with all models in diffusers; more improvements are planned (see Issue #16642).

LoRA Support:

We support almost all LoRA formats for supported models. This section lists some example LoRAs that have been explicitly tested and verified. Base ModelSupported LoRAsWan2.2lightx2v/Wan2.2-Distill-Loras Cseti/wan2.2-14B-Arcane_Jinx-lora-v1Wan2.1lightx2v/Wan2.1-Distill-LorasZ-Image-Turbotarn59/pixel_art_style_lora_z_image_turbo wcde/Z-Image-Turbo-DeJPEG-LoraQwen-Imagelightx2v/Qwen-Image-Lightning flymy-ai/qwen-image-realism-lora prithivMLmods/Qwen-Image-HeadshotX starsfriday/Qwen-Image-EVA-LoRAQwen-Image-Editostris/qwen_image_edit_inpainting lightx2v/Qwen-Image-Edit-2511-LightningFluxdvyio/flux-lora-simple-illustration XLabs-AI/flux-furry-lora XLabs-AI/flux-RealismLora

LMSYS：Blog（Chatbot Arena 团队）

导出 Markdown

SGLang-Diffusion：发布两月进展综述

2026-01-16 00:00·168天前

阅读原文· lmsys.org

AI 摘要

原文 · 保持原样，未翻译

Contents

Overview

Performance Benchmark

Key Improvements

Layerwise Offload

Kernel Improvements

Cache-DiT Integration

Few More Things

Roadmap (26Q1)

Acknowledgment

Learn more

SGLang-Diffusion: Two Months In