Skip to content

Pull requests: sgl-project/sglang

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[misc] add env for http keep alive timeout
#19847 opened Mar 4, 2026 by happierpig Loading…
5 tasks
[quant] fix fp32 downcasting
#19844 opened Mar 4, 2026 by zhooooong Loading…
5 tasks
[diffusion] runtime: introduce generation service facade diffusion SGLang Diffusion
#19839 opened Mar 4, 2026 by richl9 Loading…
5 tasks
[diffusion] docs: add diffusion runtime architecture boundary guide documentation Improvements or additions to documentation
#19838 opened Mar 4, 2026 by richl9 Loading…
5 tasks
fix cuda graph capturing error in sm120 mxfp8 triton path
#19835 opened Mar 4, 2026 by wolfcomos Loading…
5 tasks
Fix Qwen3.5 pipeline parallelism crash
#19833 opened Mar 4, 2026 by AjAnubolu Loading…
3 tasks
Add support for InstantTensor documentation Improvements or additions to documentation
#19830 opened Mar 4, 2026 by arlo-aisys Loading…
4 of 5 tasks
[AMD] aiter a8w8 gemm configuration
#19826 opened Mar 4, 2026 by seungrokj Loading…
5 tasks
[NPU][Bug fix] context parallel bug fix deepseek npu
#19820 opened Mar 4, 2026 by liupeng374 Loading…
5 tasks
[diffusion][WIP] support realtime krea diffusion diffusion SGLang Diffusion
#19817 opened Mar 4, 2026 by IPostYellow Draft
5 tasks
[AMD] Add bf16 MoE weights padding quant LLM Quantization
#19814 opened Mar 4, 2026 by Emmanuel0612 Loading…
5 tasks
Fix Qwen3.5/Qwen3Next MTP EPLB compatibility
#19812 opened Mar 4, 2026 by AjAnubolu Loading…
5 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.