-
Penny Public
Hand-Rolled GPU communications library
-
sglang Public
Forked from sgl-project/sglangSGLang is a fast serving framework for large language models and vision language models.
Python Apache License 2.0 UpdatedOct 9, 2025 -
-
-
-
-
-
Mooncake Public
Forked from kvcache-ai/MooncakeMooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
C++ Apache License 2.0 UpdatedAug 12, 2025 -
DeepGEMM Public
Forked from deepseek-ai/DeepGEMMDeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
C++ MIT License UpdatedJul 30, 2025 -
reasoning-gym Public
Forked from open-thought/reasoning-gymprocedural reasoning datasets
Python Apache License 2.0 UpdatedJul 27, 2025 -
manim Public
Forked from 3b1b/manimAnimation engine for explanatory math videos
Python MIT License UpdatedJul 13, 2025 -
-
-
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
-
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
Python Apache License 2.0 UpdatedFeb 5, 2025 -
FastSoftmax Public
Step by step implementation of a fast softmax kernel in CUDA
-
-
-
-
-
-
tinygrad Public
Forked from tinygrad/tinygradYou like pytorch? You like micrograd? You love tinygrad! ❤️
Python MIT License UpdatedJul 30, 2024 -
diff-pdf Public
Forked from vslavik/diff-pdfA simple tool for visually comparing two PDF files
C++ GNU General Public License v2.0 UpdatedJul 5, 2024 -
-
gpuocelot Public
Forked from gpuocelot/gpuocelotGPUOcelot: A dynamic compilation framework for PTX
C++ BSD 3-Clause "New" or "Revised" License UpdatedJun 7, 2024 -
-
-
-