Skip to content
View Demon-Sheriff's full-sized avatar
💻
Focusing
💻
Focusing

Highlights

  • Pro

Block or report Demon-Sheriff

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Tenstorrent MLIR compiler

MLIR 248 107 Updated Feb 10, 2026

a teaching deep learning framework: the bridge from micrograd to tinygrad

Python 54 5 Updated Feb 10, 2026
Python 98 6 Updated Feb 10, 2026

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

Jupyter Notebook 4,681 413 Updated Apr 3, 2024

Introduction to Machine Learning Systems

JavaScript 18,036 2,102 Updated Feb 10, 2026

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,639 463 Updated Oct 27, 2025

MoE training for Me and You and maybe other people

Python 353 30 Updated Feb 7, 2026

Comprehensive guide, algorithms and tools on distributed systems

Go 232 18 Updated Aug 18, 2025

Writing custom Linear Algebra and ML kernels in CUDA to outperform pytorch, cuBLAS, numpy

Cuda 2 Updated Jul 12, 2025

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 854 104 Updated Feb 10, 2026

Hand-Rolled GPU communications library

Cuda 84 6 Updated Nov 25, 2025

Tile primitives for speedy kernels

Cuda 3,133 236 Updated Feb 10, 2026

Continuous Thought Machines, because thought takes time and reasoning is a process.

Python 1,758 273 Updated Dec 29, 2025

Training framework with a goal to explore the frontier of sample efficiency of small language models

Python 98 10 Updated Jan 25, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 23,482 4,390 Updated Feb 10, 2026

biasing the universal tokenizer and an attempt to optimize compression rates in multilingual compression

Python 5 1 Updated Dec 29, 2025

a parallel and minimal implementation of Byte Pair Encoding (BPE) from scratch in less than 200 lines of python.

Jupyter Notebook 3 Updated Aug 30, 2025

custom flash attention kernel in cuda to benchmark it against torch and burn my rtx 3050

Cuda 1 Updated Aug 26, 2025

Async RL Training at Scale

Python 1,058 199 Updated Feb 10, 2026

List of papers related to neural network quantization in recent AI conferences and journals.

797 59 Updated Mar 27, 2025

The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

LLVM 36,861 16,091 Updated Feb 10, 2026

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 52,858 8,951 Updated Nov 12, 2025

You like pytorch? You like micrograd? You love tinygrad! ❤️

Python 31,339 3,882 Updated Feb 10, 2026

Course 18.S191 at MIT, Fall 2022 - Introduction to computational thinking with Julia

Julia 2,784 495 Updated Dec 19, 2025

A Lightweight Recommendation System

Python 9,267 715 Updated Oct 13, 2025

A Complete Resource to Master Graduate-Level GenAI Mathematics

CSS 11 3 Updated Aug 12, 2025

Learn ML engineering for free in 4 months! Register here 👇🏼

Jupyter Notebook 12,618 2,870 Updated Dec 27, 2025

🍕 AI agent that calls to order pizza for you

Python 8 2 Updated Oct 29, 2023

Cleanlab's open-source library is the standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.

Python 11,310 882 Updated Jan 13, 2026
Next