WoosukKwon

Woosuk Kwon WoosukKwon

@Inferact | @vllm-project

1.3k followers · 234 following

Achievements

x4 x4 x3

Achievements

x4 x4 x3

Highlights

Stars

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,622 1,118 Updated Mar 16, 2026

FlashSampling / FlashSampling

FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)

Python 64 5 Updated Apr 5, 2026

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 98,830 27,413 Updated Apr 6, 2026

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 965 281 Updated Apr 6, 2026

yewentao256 / yewentao256.github.io

wentao.site / Hugo Template / A template repository for Hugo based blog

55 3 Updated Mar 21, 2026

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,210 772 Updated Apr 6, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 4,132 700 Updated Apr 6, 2026

cornserve-ai / cornserve

Easy, Fast, and Scalable Multimodal AI

Python 121 8 Updated Apr 3, 2026

verl-project / verl

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,460 3,577 Updated Apr 3, 2026

vllm-project / tpu-inference

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 284 147 Updated Apr 6, 2026

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,735 293 Updated Apr 4, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 3,031 367 Updated Apr 6, 2026

hao-ai-lab / LookaheadReasoning

[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning

Python 67 6 Updated Oct 31, 2025

vllm-project / router

A high-performance and light-weight router for vLLM large scale deployment

Rust 177 62 Updated Mar 31, 2026

thinking-machines-lab / batch_invariant_ops

Python 986 74 Updated Nov 4, 2025

vllm-project / recipes

Common recipes to run vLLM

Jupyter Notebook 571 196 Updated Apr 3, 2026

algorithmicsuperintelligence / openevolve

Open-source implementation of AlphaEvolve

Python 5,863 934 Updated Mar 18, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 12,705 1,879 Updated Nov 3, 2025

jemalloc / jemalloc

C 10,754 1,609 Updated Apr 2, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,916 388 Updated Apr 5, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,491 994 Updated Apr 6, 2026

snowflakedb / ArcticInference

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 420 56 Updated Mar 28, 2026

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 2,016 82 Updated Jun 5, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,382 539 Updated Apr 6, 2026

JosephJeesungSuh / subpop

[ACL 2025 Long Main] Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions

Python 40 8 Updated Apr 21, 2025

numba / numba

NumPy aware dynamic Python compiler using LLVM

Python 10,952 1,243 Updated Apr 3, 2026

hao-ai-lab / Dynasor

[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.

Python 227 31 Updated May 31, 2025

LouisShark / chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

HTML 10,492 1,461 Updated Mar 19, 2026

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 1,129 138 Updated Apr 6, 2026

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 5,042 652 Updated Apr 6, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Woosuk Kwon WoosukKwon

Achievements

Achievements

Highlights

Block or report WoosukKwon

Stars

stas00 / ml-engineering

FlashSampling / FlashSampling

pytorch / pytorch

ai-dynamo / nixl

yewentao256 / yewentao256.github.io

pytorch / torchtitan

vllm-project / vllm-omni

cornserve-ai / cornserve

verl-project / verl

vllm-project / tpu-inference

NovaSky-AI / SkyRL

thinking-machines-lab / tinker-cookbook

hao-ai-lab / LookaheadReasoning

vllm-project / router

thinking-machines-lab / batch_invariant_ops

vllm-project / recipes

algorithmicsuperintelligence / openevolve

GeeeekExplorer / nano-vllm

jemalloc / jemalloc

llm-d / llm-d

ai-dynamo / dynamo

snowflakedb / ArcticInference

XiaomiMiMo / MiMo

rllm-org / rllm

JosephJeesungSuh / subpop

numba / numba

hao-ai-lab / Dynasor

LouisShark / chatgpt_system_prompt

facebookresearch / fairseq2

kvcache-ai / Mooncake