-
-
-
transformer_nuggets Public
Forked from drisspg/transformer_nuggetsA place to store reusable transformer components of my own creation or found on the interwebs
Python BSD 3-Clause "New" or "Revised" License UpdatedDec 13, 2025 -
llm-scratch Public
Controlling every detail of LLM training, by building from the ground up.
Python UpdatedDec 8, 2025 -
tilus Public
Forked from NVIDIA/tilusTilus is a tile-level kernel programming language with explicit control over shared memory and registers.
Python Apache License 2.0 UpdatedDec 5, 2025 -
LeetCUDA Public
Forked from xlite-dev/LeetCUDA📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉
Cuda GNU General Public License v3.0 UpdatedNov 28, 2025 -
quack Public
Forked from Dao-AILab/quackA Quirky Assortment of CuTe Kernels
Python Apache License 2.0 UpdatedNov 21, 2025 -
llmq Public
Forked from IST-DASLab/llmqQuantized LLM training in pure CUDA/C++.
C++ Apache License 2.0 UpdatedNov 9, 2025 -
SkyRL Public
Forked from NovaSky-AI/SkyRLSkyRL: A Modular Full-stack RL Library for LLMs
Python Apache License 2.0 UpdatedNov 9, 2025 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedNov 9, 2025 -
-
nixl Public
Forked from ai-dynamo/nixlNVIDIA Inference Xfer Library (NIXL)
C++ Apache License 2.0 UpdatedNov 7, 2025 -
pplx-garden Public
Forked from perplexityai/pplx-gardenPerplexity open source garden for inference technology
Rust MIT License UpdatedNov 7, 2025 -
pplx-kernels Public
Forked from perplexityai/pplx-kernelsPerplexity GPU Kernels
C++ MIT License UpdatedNov 7, 2025 -
DeepEP Public
Forked from deepseek-ai/DeepEPDeepEP: an efficient expert-parallel communication library
Cuda MIT License UpdatedNov 6, 2025 -
alphafold3 Public
Forked from google-deepmind/alphafold3AlphaFold 3 inference pipeline.
Python Other UpdatedNov 4, 2025 -
flash-attention Public
Forked from Dao-AILab/flash-attentionFast and memory-efficient exact attention
Python BSD 3-Clause "New" or "Revised" License UpdatedNov 3, 2025 -
nano-vllm Public
Forked from GeeeekExplorer/nano-vllmNano vLLM
Python MIT License UpdatedNov 3, 2025 -
Liger-Kernel Public
Forked from linkedin/Liger-KernelEfficient Triton Kernels for LLM Training
Python BSD 2-Clause "Simplified" License UpdatedNov 3, 2025 -
-
checkpoint-engine Public
Forked from MoonshotAI/checkpoint-enginePython MIT License UpdatedOct 31, 2025 -
Sana Public
Forked from NVlabs/SanaSANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
Python Apache License 2.0 UpdatedOct 30, 2025 -
modded-nanogpt Public
Forked from KellerJordan/modded-nanogptNanoGPT (124M) in 3 minutes
Python MIT License UpdatedOct 29, 2025 -
video-generation Public
Forked from meituan-longcat/LongCat-VideoPython MIT License UpdatedOct 27, 2025 -
-
distributed-training Public
Forked from LambdaLabsML/distributed-training-guidePython MIT License UpdatedOct 22, 2025 -
jax-llm-examples Public
Forked from jax-ml/jax-llm-examplesMinimal yet performant LLM examples in pure JAX
Python Apache License 2.0 UpdatedSep 23, 2025 -