-
Tencent
- Beijing
-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
-
-
-
kata-containers Public
Forked from kata-containers/kata-containersKata Containers is an open source project and community working to build a standard implementation of lightweight Virtual Machines (VMs) that feel and perform like containers, but provide the workl…
Rust Apache License 2.0 UpdatedSep 30, 2025 -
codex Public
Forked from openai/codexLightweight coding agent that runs in your terminal
Rust Apache License 2.0 UpdatedSep 29, 2025 -
torch_memory_saver Public
Forked from fzyzcjy/torch_memory_saverAllow torch tensor memory to be released and resumed later
Python MIT License UpdatedSep 28, 2025 -
-
-
-
nixl Public
Forked from ai-dynamo/nixlNVIDIA Inference Xfer Library (NIXL)
C++ Apache License 2.0 UpdatedSep 3, 2025 -
-
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedApr 22, 2025 -
perftest Public
Forked from linux-rdma/perftestInfiniband Verbs Performance Tests
C Other UpdatedApr 14, 2025 -
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Cuda MIT License UpdatedMar 17, 2025 -
-
-
-
flashinfer Public
Forked from flashinfer-ai/flashinferFlashInfer: Kernel Library for LLM Serving
Cuda Apache License 2.0 UpdatedNov 18, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 8, 2024 -
-
-
triton Public
Forked from triton-lang/tritonDevelopment repository for the Triton language and compiler
C++ MIT License UpdatedMar 9, 2024 -
-
ring-flash-attention Public
Forked from zhuzilin/ring-flash-attentionRing attention implementation with flash attention
Python UpdatedFeb 26, 2024 -
-
-
ssh Public
Forked from gliderlabs/sshEasy SSH servers in Golang
Go BSD 3-Clause "New" or "Revised" License UpdatedFeb 2, 2024 -
-
sentencepiece Public
Forked from google/sentencepieceUnsupervised text tokenizer for Neural Network-based text generation.
C++ Apache License 2.0 UpdatedAug 30, 2023 -