Skip to content
View zetwhite's full-sized avatar
πŸ’ 
πŸ’ 

Organizations

@dead4s

Block or report zetwhite

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Jupyter Notebook 1,576 100 Updated Nov 5, 2025

Minimal yet performant LLM examples in pure JAX

Python 217 29 Updated Dec 4, 2025

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 769 112 Updated Dec 16, 2025

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 24,748 2,205 Updated Dec 19, 2025

AGENTS.md β€” a simple, open format for guiding coding agents

TypeScript 12,587 911 Updated Dec 19, 2025

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,521 432 Updated Dec 18, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 21,723 3,808 Updated Dec 19, 2025

γ€ŠC++ 17 The Complete Guide》- ηΏ»θ―‘δΈ­

512 65 Updated Mar 1, 2023

Nano vLLM

Python 9,777 1,230 Updated Nov 3, 2025

πŸ’₯ Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,321 1,007 Updated Dec 16, 2025

Inference Llama 2 in one file of pure πŸ”₯

Mojo 2,115 136 Updated Nov 30, 2025

Cross-platform, customizable ML solutions for live and streaming media.

C++ 32,529 5,663 Updated Dec 19, 2025

The Modular Platform (includes MAX & Mojo)

Mojo 25,366 2,743 Updated Dec 18, 2025

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,566 926 Updated Jul 8, 2025

A primitive library for neural network

C++ 1,370 223 Updated Nov 24, 2024

A python library for converting Pytorch modules into a circle model that is a lightweight and efficient representation in ONE designed for optimized on-device neural network inference.

Python 16 22 Updated Dec 5, 2025

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,200 746 Updated Dec 12, 2025

Dynamic Memory Management for Serving LLMs without PagedAttention

C 448 34 Updated May 30, 2025

Display and control your Android device

C 132,743 12,394 Updated Dec 17, 2025

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,213 8,579 Updated Nov 12, 2025

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,432 448 Updated Apr 24, 2023

Simple implementation of Speculative Sampling in NumPy for GPT-2.

Python 98 9 Updated Aug 20, 2023

LLM101n: Let's build a Storyteller

35,898 1,961 Updated Aug 1, 2024

γ€Žγ‚Όγƒ­γ‹γ‚‰δ½œγ‚‹ Deep Learning ❺』(O'Reilly Japan, 2024)

Jupyter Notebook 396 105 Updated Sep 30, 2024

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,674 189 Updated Jun 25, 2024

A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

Python 186 43 Updated Dec 8, 2022

Fine-tuning & Reinforcement Learning for LLMs. πŸ¦₯ Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,612 4,090 Updated Dec 18, 2025

Go ahead and axolotl questions

Python 10,964 1,221 Updated Dec 18, 2025

A flexible distributed key-value database that is optimized for caching and other realtime workloads.

C 24,037 970 Updated Dec 18, 2025
Next