chore(deps): bump sglang from 0.5.2 to 0.5.9 by dependabot[bot] · Pull Request #4 · Xnhyacinth/ResAdapt

dependabot · 2026-03-29T04:55:14Z

Bumps sglang from 0.5.2 to 0.5.9.

Release notes

v0.5.9

Highlights

LoRA Weight Loading Overlap with Computation: Overlap LoRA weight loading with computation during inference, reducing TTFT by ~78% and TPOT by ~34.88% on large adaptors: #15512

TRT-LLM NSA Kernel Integration for DeepSeek V3.2: Integrate TRT-LLM DSA kernels for Native Sparse Attention, boosting DeepSeek V3.2 performance by 3x-5x on Blackwell platforms with trtllm for both --nsa-prefill-backend and --nsa-decode-backend (with minor accuracy drop): #16758, #17662, #18389

Flashinfer All-to-All MoE Dispatcher: Add the Flashinfer all-to-all MoE dispatcher for efficient expert parallelism communication, enabling optimized routing in MoE models: #14668

FA4 (FP4 Attention) Support for Multimodal Encoder: Introduce FP4 attention backend and variable-length attention function for multimodal encoders, enabling lower-precision inference for vision-language models: #13539

Anthropic Compatible API Endpoint: Add native Anthropic API compatibility to SGLang, allowing direct integration with tools and clients built for the Anthropic API format: #18630

SGLang-Diffusion Advanced Optimizations: Production-ready improvements including token-level sequence sharding, parallel VAE decoding, fused kernels, Nunchaku and FP8 support, and multiple new models in the ComfyUI plugin: blog

Spec V2 Critical bug fix: Fix out-of-index bug caused by torch garbage collection in speculative decoding v2, improving reliability of speculative verification: #18958

Deploying DeepSeek on GB300 NVL72: Optimization work for long-context inference using prefill-decode disaggregation and other SGLang features on NVIDIA's latest GB300 platform: blog

Bump AITER version to 0.1.10.post3: Support FP8 Prefill/Decode/KV Cache

Commit-to-Version Lookup in docs.sglang.io: Easily find the earliest official version that includes a given PR or commit, streamlining release tracking for users and developers: #18450, link

New Model Support

Kimi-K2.5: #17789, cookbook

GLM-5: cookbook (still requires a custom docker for transformers upgrade, will follow up with a rc release since transformers upgrade is risky)

Qwen 3.5: #18489, #18926, #18937, cookbook

MiniMax 2.5: cookbook

Ernie4.5-VL: #15679

Step3-VL: #17513

Step-3.5-Flash: #18084, cookbook

LLaDA 2.1: cookbook

Ring 2.5 1T / Ling 2.5 1T: #18598, cookbook, cookbook

MOVA (Diffusion): #17704

GLM-OCR: #17582, cookbook

DeepSeek-OCR-2: #17897

SGLang-Diffusion

Support multiple new models in ComfyUI Plugin

Parallel Folding and Parallel VAE Decoding for faster image/video generation

Nunchaku and FP8 support for diffusion models

Sequence Sharding (token-level) replacing Frame Sharding for improved efficiency

LTX-2 support: #17495, #17496

MOVA model support: #17704

Cache-DiT optimizations and fused kernel improvements

Numerous bug fixes and refactors across the diffusion pipeline

Performance

Integrate TRT-LLM NSA kernels with up to 3-5x speedup on Blackwell: #16758, #17662, #18389

... (truncated)

Commits

bbe9c7e Revert "Refactor graph input buffers (#18991)" (#19173)
901957a [CI] Skip some subtests for tool call parser (#19172)
543c051 Revert "[AMD] support two batch overlapping for mori ep #17953" (#19161)
d16da1b [CI]Extend timeout for test_text_models_perf.py (#19155)
f88e631 [diffusion] CI: relax perf check threshold (#19154)
00c4461 [PD] Change bootstrap_room metadata dtype from int64 to uint64 (#19141)
c188b0a Fix spec v2+dp attention in nsa backend (#19134)
53f096f [Fix] Quick fix for int32 overflow in Mooncakes' send_kvcache_slice (#19076)
19cbc6a Revert "[jit kernel] Support per_token_group_quant_8bit jit kernel" (#19131)
5d67bfa fix KimiK2Detector regex patterns with re.DOTALL (#19120)
Additional commits viewable in compare view

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.

Dependabot commands and options

You can trigger Dependabot actions by commenting on this PR:

@dependabot rebase will rebase this PR
@dependabot recreate will recreate this PR, overwriting any edits that have been made to it
@dependabot show <dependency name> ignore conditions will show all of the ignore conditions of the specified dependency
@dependabot ignore this major version will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this minor version will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
@dependabot ignore this dependency will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)

Bumps [sglang](https://github.com/sgl-project/sglang) from 0.5.2 to 0.5.9. - [Release notes](https://github.com/sgl-project/sglang/releases) - [Commits](sgl-project/sglang@v0.5.2...v0.5.9) --- updated-dependencies: - dependency-name: sglang dependency-version: 0.5.9 dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>

dependabot · 2026-03-30T14:03:56Z

OK, I won't notify you again about this release, but will get in touch when a new version is available. If you'd rather skip all updates until the next major or minor version, let me know by commenting @dependabot ignore this major version or @dependabot ignore this minor version.

If you change your mind, just re-open this PR and I'll resolve any conflicts on it.

dependabot bot added dependencies Pull requests that update a dependency file python Pull requests that update python code labels Mar 29, 2026

Xnhyacinth closed this Mar 30, 2026

dependabot bot deleted the dependabot/pip/sglang-0.5.9 branch March 30, 2026 14:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(deps): bump sglang from 0.5.2 to 0.5.9#4

chore(deps): bump sglang from 0.5.2 to 0.5.9#4
dependabot[bot] wants to merge 1 commit intomainfrom
dependabot/pip/sglang-0.5.9

dependabot bot commented on behalf of github Mar 29, 2026

Uh oh!

dependabot bot commented on behalf of github Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

dependabot bot commented on behalf of github Mar 29, 2026

v0.5.9

Highlights

New Model Support

SGLang-Diffusion

Performance

Uh oh!

dependabot bot commented on behalf of github Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant