Skip to content
Change the repository type filter

All

    Repositories list

    • sgl-project.github.io

      Public
      This is the documentation repository for SGLang. It is auto-generated from https://github.com/sgl-project/sglang/tree/main/docs.
      HTML
      249591Updated Dec 19, 2025Dec 19, 2025
    • sglang

      Public
      SGLang is a fast serving framework for large language models and vision language models.
      Python
      3.8k22k6621.1kUpdated Dec 19, 2025Dec 19, 2025
    • sglang-jax

      Public
      JAX backend for SGL
      Python
      472007018Updated Dec 19, 2025Dec 19, 2025
    • sgl-kernel-xpu

      Public
      SGLang kernel library for Intel XPU
      Python
      1315014Updated Dec 19, 2025Dec 19, 2025
    • sgl-kernel-npu

      Public
      SGLang kernel library for NPU
      C++
      61861321Updated Dec 19, 2025Dec 19, 2025
    • ome

      Public
      OME is a Kubernetes operator for enterprise-grade management and serving of Large Language Models (LLMs)
      Go
      503403220Updated Dec 19, 2025Dec 19, 2025
    • whl

      Public
      Kernel Library Wheel for SGLang
      HTML
      41611Updated Dec 19, 2025Dec 19, 2025
    • mini-sglang

      Public
      Python
      951.3k44Updated Dec 18, 2025Dec 18, 2025
    • sgl-cookbook

      Public
      Cookbook of SGLang - Recipe
      JavaScript
      63725Updated Dec 18, 2025Dec 18, 2025
    • SpecForge

      Public
      Train speculative decoding models effortlessly and port them smoothly to SGLang serving.
      Python
      1195574916Updated Dec 18, 2025Dec 18, 2025
    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Cuda
      7782101Updated Dec 17, 2025Dec 17, 2025
    • sgl-test-files

      Public
      The test files for SGLang.
      3102Updated Dec 17, 2025Dec 17, 2025
    • FlashMLA

      Public
      FlashMLA: Efficient Multi-head Latent Attention Kernels
      C++
      918000Updated Dec 16, 2025Dec 16, 2025
    • sgl-flash-attn

      Public
      Fast and memory-efficient exact attention
      Python
      2.2k1400Updated Dec 15, 2025Dec 15, 2025
    • sgl-learning-materials

      Public
      Materials for learning SGLang
      5069000Updated Dec 15, 2025Dec 15, 2025
    • rbg

      Public
      A workload for deploying LLM inference services on Kubernetes
      Go
      36140139Updated Dec 12, 2025Dec 12, 2025
    • genai-bench

      Public
      Genai-bench is a powerful benchmark tool designed for comprehensive token-level performance evaluation of large language model (LLM) serving systems.
      Python
      4224949Updated Dec 10, 2025Dec 10, 2025
    • fast-hadamard-transform

      Public
      Fast Hadamard transform in CUDA, with a PyTorch interface
      C
      49100Updated Oct 15, 2025Oct 15, 2025
    • sgl-whl

      Public
      SGLang wheels for multiple platforms
      11110Updated Oct 13, 2025Oct 13, 2025