Skip to content
@sgl-project

sgl-project

Pinned Loading

  1. sglang sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    Python 24.1k 4.6k

  2. sgl-learning-materials sgl-learning-materials Public

    Materials for learning SGLang

    766 57

  3. ome ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    Go 383 61

  4. genai-bench genai-bench Public

    Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.

    Python 274 49

  5. SpecForge SpecForge Public

    Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

    Python 717 167

  6. sglang-jax sglang-jax Public

    JAX backend for SGL

    Python 244 73

Repositories

Showing 10 of 23 repositories
  • sgl-project.github.io Public

    This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang

    sgl-project/sgl-project.github.io’s past year of commit activity
    HTML 109 26 10 1 Updated Mar 4, 2026
  • sglang Public

    SGLang is a high-performance serving framework for large language models and multimodal models.

    sgl-project/sglang’s past year of commit activity
    Python 24,052 Apache-2.0 4,638 580 (28 issues need help) 1,700 Updated Mar 4, 2026
  • sglang-jax Public

    JAX backend for SGL

    sgl-project/sglang-jax’s past year of commit activity
    Python 244 Apache-2.0 73 92 (8 issues need help) 30 Updated Mar 4, 2026
  • sgl-kernel-xpu Public

    SGLang kernel library for Intel XPU

    sgl-project/sgl-kernel-xpu’s past year of commit activity
    Python 18 MIT 18 0 12 Updated Mar 4, 2026
  • sgl-flash-attn Public Forked from Dao-AILab/flash-attention

    Fast and memory-efficient exact attention

    sgl-project/sgl-flash-attn’s past year of commit activity
    Python 18 BSD-3-Clause 2,468 0 1 Updated Mar 4, 2026
  • whl Public

    Kernel Library Wheel for SGLang

    sgl-project/whl’s past year of commit activity
    HTML 16 MIT 8 1 0 Updated Mar 4, 2026
  • sgl-docs Public
    sgl-project/sgl-docs’s past year of commit activity
    MDX 4 Apache-2.0 14 0 1 Updated Mar 4, 2026
  • ome Public

    Open Model Engine (OME) — Kubernetes operator for LLM serving, GPU scheduling, and model lifecycle management. Works with SGLang, vLLM, TensorRT-LLM, and Triton

    sgl-project/ome’s past year of commit activity
    Go 383 Apache-2.0 61 33 (2 issues need help) 46 Updated Mar 3, 2026
  • sgl-cookbook Public

    Cookbook of SGLang - Recipe

    sgl-project/sgl-cookbook’s past year of commit activity
    JavaScript 88 Apache-2.0 38 4 (1 issue needs help) 12 Updated Mar 3, 2026
  • rbg Public

    A workload for deploying LLM inference services on Kubernetes

    sgl-project/rbg’s past year of commit activity
    Go 177 Apache-2.0 45 23 20 Updated Mar 2, 2026