WoosukKwon

Woosuk Kwon WoosukKwon

@Inferact | @vllm-project

1.3k followers · 234 following

Achievements

x4 x4 x3

Achievements

x4 x4 x3

Highlights

Stars

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 17,827 1,132 Updated Mar 16, 2026

FlashSampling / FlashSampling

FlashSampling: Fast and Memory-Efficient Exact Sampling (https://huggingface.co/papers/2603.15854)

Python 69 6 Updated Apr 25, 2026

Inferact / vllm-frontend-rs

Early-stage Rust drop-in alternative frontend for vLLM

Rust 26 1 Updated Apr 29, 2026

pytorch / pytorch

Tensors and Dynamic neural networks in Python with strong GPU acceleration

Python 99,529 27,619 Updated Apr 29, 2026

ai-dynamo / nixl

NVIDIA Inference Xfer Library (NIXL)

C++ 1,010 307 Updated Apr 29, 2026

yewentao256 / yewentao256.github.io

wentao.site / Hugo Template / A template repository for Hugo based blog

55 3 Updated Mar 21, 2026

pytorch / torchtitan

A PyTorch native platform for training generative AI models

Python 5,281 801 Updated Apr 29, 2026

vllm-project / vllm-omni

A framework for efficient model inference with omni-modality models

Python 4,556 855 Updated Apr 29, 2026

cornserve-ai / cornserve

Easy, Fast, and Scalable Multimodal AI

Python 124 9 Updated Apr 17, 2026

verl-project / verl

verl/HybridFlow: A Flexible and Efficient RL Post-Training Framework

Python 21,012 3,767 Updated Apr 29, 2026

vllm-project / tpu-inference

TPU inference for vLLM, with unified JAX and PyTorch support.

Python 306 172 Updated Apr 29, 2026

NovaSky-AI / SkyRL

SkyRL: A Modular Full-stack RL Library for LLMs

Python 1,795 311 Updated Apr 29, 2026

thinking-machines-lab / tinker-cookbook

Post-training with Tinker

Python 3,185 403 Updated Apr 29, 2026

hao-ai-lab / LookaheadReasoning

[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning

Python 68 7 Updated Oct 31, 2025

vllm-project / router

A high-performance and light-weight router for vLLM large scale deployment

Rust 212 73 Updated Apr 29, 2026

thinking-machines-lab / batch_invariant_ops

Python 998 76 Updated Nov 4, 2025

vllm-project / recipes

Common recipes to run vLLM

JavaScript 763 246 Updated Apr 29, 2026

algorithmicsuperintelligence / openevolve

Open-source implementation of AlphaEvolve

Python 6,110 978 Updated Mar 18, 2026

GeeeekExplorer / nano-vllm

Nano vLLM

Python 13,181 2,017 Updated Apr 26, 2026

jemalloc / jemalloc

C 10,839 1,614 Updated Apr 27, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 3,100 442 Updated Apr 29, 2026

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,697 1,072 Updated Apr 29, 2026

snowflakedb / ArcticInference

ArcticInference: vLLM plugin for high-throughput, low-latency inference

Python 426 60 Updated Apr 23, 2026

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 2,082 88 Updated Jun 5, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 5,460 547 Updated Apr 28, 2026

JosephJeesungSuh / subpop

[ACL 2025 Long Main] Language Model Fine-Tuning on Scaled Survey Data for Predicting Distributions of Public Opinions

Python 43 7 Updated Apr 21, 2025

numba / numba

NumPy aware dynamic Python compiler using LLVM

Python 10,998 1,256 Updated Apr 28, 2026

hao-ai-lab / Dynasor

[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.

Python 228 31 Updated May 31, 2025

LouisShark / chatgpt_system_prompt

A collection of GPT system prompts and various prompt injection/leaking knowledge.

HTML 10,543 1,468 Updated Apr 23, 2026

facebookresearch / fairseq2

FAIR Sequence Modeling Toolkit 2

Python 1,128 140 Updated Apr 27, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Woosuk Kwon WoosukKwon

Achievements

Achievements

Highlights

Block or report WoosukKwon

Stars

stas00 / ml-engineering

FlashSampling / FlashSampling

Inferact / vllm-frontend-rs

pytorch / pytorch

ai-dynamo / nixl

yewentao256 / yewentao256.github.io

pytorch / torchtitan

vllm-project / vllm-omni

cornserve-ai / cornserve

verl-project / verl

vllm-project / tpu-inference

NovaSky-AI / SkyRL

thinking-machines-lab / tinker-cookbook

hao-ai-lab / LookaheadReasoning

vllm-project / router

thinking-machines-lab / batch_invariant_ops

vllm-project / recipes

algorithmicsuperintelligence / openevolve

GeeeekExplorer / nano-vllm

jemalloc / jemalloc

llm-d / llm-d

ai-dynamo / dynamo

snowflakedb / ArcticInference

XiaomiMiMo / MiMo

rllm-org / rllm

JosephJeesungSuh / subpop

numba / numba

hao-ai-lab / Dynasor

LouisShark / chatgpt_system_prompt

facebookresearch / fairseq2