Skip to content
View TheoBoyer's full-sized avatar
👨‍🍳
👨‍🍳

Block or report TheoBoyer

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Muon is an optimizer for hidden layers in neural networks

Python 2,111 99 Updated Nov 23, 2025

NanoGPT-speedrunning for the poor T4 enjoyers

Python 73 11 Updated Apr 22, 2025

easiest way to calculate gns in pytorch

Python 9 Updated Mar 17, 2025

A bidirectional pipeline parallelism algorithm for computation-communication overlap in DeepSeek V3/R1 training.

Python 2,887 310 Updated Mar 10, 2025

nanoGRPO is a lightweight implementation of Group Relative Policy Optimization (GRPO)

Python 136 9 Updated May 8, 2025

A Neural network layer able to express distributions over anything

TeX 1 Updated Jan 29, 2025

Establishing Scaling Laws for Crypto Market Forecasting

Python 1 Updated Mar 19, 2025

CUDA Templates and Python DSLs for High-Performance Linear Algebra

C++ 8,982 1,586 Updated Dec 18, 2025

LLM training in simple, raw C/CUDA

Cuda 28,415 3,332 Updated Jun 26, 2025

Mamba SSM architecture

Python 16,756 1,541 Updated Nov 11, 2025

Conformal prediction for time-series applications.

Jupyter Notebook 130 7 Updated Nov 30, 2023
Python 31 Updated Aug 13, 2023

Inference Llama 2 in one file of pure C

C 19,033 2,428 Updated Aug 6, 2024

[Unmaintained, see README] An ecosystem of Rust libraries for working with large language models

Rust 6,140 370 Updated Jun 24, 2024

RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…

Python 14,228 981 Updated Dec 17, 2025

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,984 2,215 Updated Jul 29, 2024

Series of lectures on Scientific Methodology and Performance Evaluation

HTML 71 23 Updated Dec 18, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 30,887 3,778 Updated Dec 18, 2025

Tez is a super-simple and lightweight Trainer for PyTorch. It also comes with many utils that you can use to tackle over 90% of deep learning projects in PyTorch.

Python 1,159 143 Updated Jan 29, 2023

A set of tools to play with deep learning

Python 26 6 Updated Nov 21, 2024

Implementation of flat mnist Generative Adversiarial Neural Network using low level features of tfjs

JavaScript 1 Updated Dec 15, 2018