My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing otherwise seemingly hard concepts. Currently included IWSLT p…

Jupyter Notebook 1,076 185 Updated Dec 27, 2020

gordicaleksa / pytorch-GAT

My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entr…

Jupyter Notebook 2,636 352 Updated Nov 17, 2022

kjsman / stable-diffusion-pytorch

Yet another PyTorch implementation of Stable Diffusion (probably easy to read)

Python 596 63 Updated Mar 4, 2024

gmongaras / Diffusion_models_from_scratch

Creating a diffusion model from scratch in PyTorch to learn exactly how they work.

Python 390 32 Updated Apr 4, 2025

sgrvinod / a-PyTorch-Tutorial-to-Transformers

Attention Is All You Need | a PyTorch Tutorial to Transformers

Python 359 55 Updated Feb 22, 2024

codertimo / BERT-pytorch

Google AI 2018 BERT pytorch implementation

Python 6,508 1,328 Updated Sep 15, 2023

bbycroft / llm-viz

3D Visualization of an GPT-style LLM

TypeScript 5,169 599 Updated Aug 24, 2024

minitorch / minitorch

The full minitorch student suite.

Python 2,261 523 Updated Aug 17, 2024

hkproj / bert-from-scratch

BERT explained from scratch

16 6 Updated Oct 26, 2023

Emperor-WS / PyEmber

An Educational Framework Based on PyTorch for Deep Learning Education and Exploration

Python 10 1 Updated Dec 24, 2023

ethen8181 / machine-learning

🌎 machine learning tutorials (mainly in Python3)

HTML 3,437 671 Updated Nov 22, 2025

hkproj / pytorch-stable-diffusion

Stable Diffusion implemented from scratch in PyTorch

Jupyter Notebook 1,008 200 Updated Oct 22, 2024

johnma2006 / mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,906 214 Updated Mar 8, 2024

srush / annotated-s4

Implementation of https://srush.github.io/annotated-s4

Python 509 66 Updated Jun 20, 2025

alxndrTL / mamba.py

A simple and efficient Mamba implementation in pure PyTorch and MLX.

Python 1,387 116 Updated Dec 4, 2024

karpathy / minGPT

A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

Python 23,175 3,036 Updated Aug 15, 2024

ThinamXx / Meta-llama

Complete implementation of Llama2 with/without KV cache & inference 🚀

Python 49 8 Updated May 24, 2024

parrt / tensor-sensor

The goal of this library is to generate more helpful exception messages for matrix algebra expressions for numpy, pytorch, jax, tensorflow, keras, fastai.

Jupyter Notebook 812 39 Updated Apr 7, 2022

aimagelab / meshed-memory-transformer

Meshed-Memory Transformer for Image Captioning. CVPR 2020

Python 540 135 Updated Dec 21, 2022

markriedl / transformer-walkthrough

A walkthrough of transformer architecture code

Jupyter Notebook 371 63 Updated Feb 20, 2024

harvardnlp / annotated-transformer

An annotated implementation of the Transformer paper.

Jupyter Notebook 6,853 1,472 Updated Apr 7, 2024

bclarkson-code / Tricycle

Autograd to GPT-2 completely from scratch

Python 125 12 Updated Aug 10, 2025

karpathy / minbpe

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 10,220 985 Updated Jul 1, 2024

phlippe / uvadlc_notebooks

Repository of Jupyter notebook tutorials for teaching the Deep Learning Course at the University of Amsterdam (MSc AI), Fall 2023

Jupyter Notebook 3,047 659 Updated Oct 31, 2025

srush / annotated-mamba

Annotated version of the Mamba paper

Jupyter Notebook 492 19 Updated Feb 27, 2024

joennlae / tensorli

Absolute minimalistic implementation of a GPT-like transformer using only numpy (<650 lines).

Python 254 14 Updated Nov 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

alcompa

Highlights

Block or report alcompa

From scratch🛠️

huggingface / diffusion-models-class

hkproj / pytorch-transformer-distributed

GX-BERT / GX-BERT

qubvel-org / segmentation_models.pytorch

gordicaleksa / pytorch-original-transformer