zetwhite

Follow

💠

SeungHui Youn zetwhite

💠

Follow

44 followers · 76 following

SeungHui Company

Achievements

Achievements

Organizations

Lists (1)

Sort

🚀 My stack

Stars

google-deepmind / open_x_embodiment

Jupyter Notebook 1,576 100 Updated Nov 5, 2025

jax-ml / jax-llm-examples

Minimal yet performant LLM examples in pure JAX

Python 217 29 Updated Dec 4, 2025

jax-ml / scaling-book

Home for "How To Scale Your Model", a short blog-style textbook about scaling LLMs on TPUs

HTML 769 112 Updated Dec 16, 2025

block / goose

an open source, extensible AI agent that goes beyond code suggestions - install, execute, edit, and test with any LLM

Rust 24,748 2,205 Updated Dec 19, 2025

agentsmd / agents.md

AGENTS.md — a simple, open format for guiding coding agents

TypeScript 12,587 911 Updated Dec 19, 2025

quic / aimet

AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.

Python 2,521 432 Updated Dec 18, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 21,723 3,808 Updated Dec 19, 2025

CnTransGroup / Cpp17TheCompleteGuideChinese

《C++ 17 The Complete Guide》- 翻译中

512 65 Updated Mar 1, 2023

GeeeekExplorer / nano-vllm

Nano vLLM

Python 9,777 1,230 Updated Nov 3, 2025

huggingface / tokenizers

💥 Fast State-of-the-Art Tokenizers optimized for Research and Production

Rust 10,321 1,007 Updated Dec 16, 2025

tairov / llama2.mojo

Inference Llama 2 in one file of pure 🔥

Mojo 2,115 136 Updated Nov 30, 2025

google-ai-edge / mediapipe

Cross-platform, customizable ML solutions for live and streaming media.

C++ 32,529 5,663 Updated Dec 19, 2025

modular / modular

The Modular Platform (includes MAX & Mojo)

Mojo 25,366 2,743 Updated Dec 18, 2025

vwxyzjn / cleanrl

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)

Python 8,566 926 Updated Jul 8, 2025

OpenPPL / ppl.nn

A primitive library for neural network

C++ 1,370 223 Updated Nov 24, 2024

jeho-lee / Awesome-On-Device-AI-Systems

101 2 Updated Nov 24, 2025

Samsung / TICO

A python library for converting Pytorch modules into a circle model that is a lightweight and efficient representation in ONE designed for optimized on-device neural network inference.

Python 16 22 Updated Dec 5, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 10,200 746 Updated Dec 12, 2025

microsoft / vattention

Dynamic Memory Management for Serving LLMs without PagedAttention

C 448 34 Updated May 30, 2025

Genymobile / scrcpy

Display and control your Android device

C 132,743 12,394 Updated Dec 17, 2025

karpathy / nanoGPT

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 51,213 8,579 Updated Nov 12, 2025

jaymody / picoGPT

An unnecessarily tiny implementation of GPT-2 in NumPy.

Python 3,432 448 Updated Apr 24, 2023

jaymody / speculative-sampling

Simple implementation of Speculative Sampling in NumPy for GPT-2.

Python 98 9 Updated Aug 20, 2023

karpathy / LLM101n

LLM101n: Let's build a Storyteller

35,898 1,961 Updated Aug 1, 2024

oreilly-japan / deep-learning-from-scratch-5

『ゼロから作る Deep Learning ❺』(O'Reilly Japan, 2024)

Jupyter Notebook 396 105 Updated Sep 30, 2024

FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads

Jupyter Notebook 2,674 189 Updated Jun 25, 2024

awjuliani / pytorch-diffusion

A basic PyTorch implementation of 'Denoising Diffusion Probabilistic Models'

Python 186 43 Updated Dec 8, 2022

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 49,612 4,090 Updated Dec 18, 2025

axolotl-ai-cloud / axolotl

Go ahead and axolotl questions

Python 10,964 1,221 Updated Dec 18, 2025

valkey-io / valkey

A flexible distributed key-value database that is optimized for caching and other realtime workloads.

C 24,037 970 Updated Dec 18, 2025