Unified multidimensional array model that collects nonrectangular shapes, advanced indexing, views and sparsity into a single set of composable abstractions Resources

Python 11 Updated Mar 26, 2024

IST-DASLab / marlin

FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

Python 962 82 Updated Sep 4, 2024

volcengine / veScale

Byted PyTorch Distributed for Hyperscale Training of LLMs and RLs

Python 910 53 Updated Nov 27, 2025

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 4,865 648 Updated Dec 23, 2025

NVIDIA / cutlass

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,999 1,592 Updated Dec 21, 2025

AnswerDotAI / fsdp_qlora

Training LLMs with QLoRA + FSDP

Jupyter Notebook 1,534 202 Updated Nov 9, 2024

gpu-mode / resource-stream

GPU programming related news and material links

1,876 110 Updated Sep 17, 2025

mistralai / mistral-inference

Official inference library for Mistral models

Jupyter Notebook 10,607 1,002 Updated Nov 21, 2025

terrytangyuan / distributed-ml-patterns

Distributed Machine Learning Patterns from Manning Publications by Yuan Tang https://bit.ly/2RKv8Zo

Python 482 47 Updated Sep 22, 2025

d2l-ai / d2l-en

Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.

Python 27,637 4,875 Updated Aug 18, 2024

apple / axlearn

An Extensible Deep Learning Library

Python 2,303 391 Updated Dec 11, 2025

ROCm / rccl

ROCm Communication Collectives Library (RCCL)

C++ 405 193 Updated Dec 20, 2025

ROCm / ROCm

AMD ROCm™ Software - GitHub Home

Shell 6,001 501 Updated Dec 22, 2025

NVIDIA-NeMo / NeMo

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,343 3,243 Updated Dec 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wei (Will) Feng weifengpy

Achievements

Achievements

Block or report weifengpy

Stars

facebookresearch / moodist

tile-ai / tilelang

ShawnZhong / compiler-explorer-triton

Dao-AILab / quack

DynamoRIO / dynamorio

flashinfer-ai / flashinfer

PKU-DAIR / Hetu-Galvatron

KellerJordan / modded-nanogpt

yifuwang / symm-mem-recipes

karpathy / llm.c

karpathy / LLM101n

thuml / depyf

HazyResearch / ThunderKittens

gpu-mode / awesomeMLSys

unslothai / unsloth

kaldi-asr / kaldi

bhosmer / fold