zhisbug

Follow

❄️

raising a baby

Hao Zhang zhisbug

❄️

raising a baby

Follow

Asst. prof. at UCSD working on LLMs.

399 followers · 13 following

Achievements

Achievements

Organizations

Stars

hao-ai-lab / d3LLM

d3LLM: Ultra-Fast Diffusion LLM 🚀

Python 38 2 Updated Dec 19, 2025

hao-ai-lab / LookaheadReasoning

[NeurIPS 2025] Scaling Speculative Decoding with Lookahead Reasoning

Python 56 6 Updated Oct 31, 2025

lmgame-org / GRL

Multi-Turn RL Training System with AgentTrainer for Language Model Game Reinforcement Learning

Python 54 11 Updated Dec 18, 2025

hao-ai-lab / Awesome-Video-Attention

A curated list of recent papers on efficient video attention for video diffusion models, including sparsification, quantization, and caching, etc.

52 4 Updated Oct 27, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,656 749 Updated Dec 20, 2025

hao-ai-lab / Dynasor

[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.

Python 215 27 Updated May 31, 2025

hao-ai-lab / FastVideo

A unified inference and post-training framework for accelerated video generation.

Python 2,835 226 Updated Dec 21, 2025

hao-ai-lab / vllm-ltr

[NeurIPS 2024] Efficient LLM Scheduling by Learning to Rank

Python 66 15 Updated Nov 4, 2024

hao-ai-lab / Consistency_LLM

[ICML 2024] CLLMs: Consistency Large Language Models

Python 408 18 Updated Nov 16, 2024

hao-ai-lab / MuxServe

Jupyter Notebook 79 10 Updated Oct 17, 2025

hao-ai-lab / LookaheadDecoding

[ICML 2024] Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

Python 1,310 78 Updated Mar 6, 2025

vllm-project / vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 65,844 12,098 Updated Dec 21, 2025

alpa-projects / alpa

Training and serving large-scale neural networks with auto parallelization.

Python 3,171 356 Updated Dec 9, 2023

lm-sys / FastChat

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 39,320 4,780 Updated Jun 2, 2025

lm-sys / lm-sys.github.io

The source of LMSYS website and blogs

JavaScript 72 63 Updated Dec 19, 2025

kingoflolz / swarm-jax

Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes

Python 242 22 Updated May 12, 2023

FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.

Python 9,382 588 Updated Oct 28, 2024

alpa-projects / mms

AlpaServe: Statistical Multiplexing with Model Parallelism for Deep Learning Serving (OSDI 23)

Python 92 17 Updated Jul 14, 2023

DachengLi1 / AMP

(NeurIPS 2022) Automatically finding good model-parallel strategies, especially for complex models and clusters.

Python 43 7 Updated Nov 4, 2022

zhisbug / Cavs

Cavs: An Efficient Runtime System for Dynamic Neural Networks

C++ 15 3 Updated Sep 18, 2020

petuum / autodist

Simple Distributed Deep Learning on TensorFlow

Python 134 25 Updated Jun 17, 2025

ray-project / distml

Distributed ML Optimizer

Python 34 1 Updated Jul 28, 2021

zhisbug / ray

Forked from ray-project/ray

An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyp…

Python 3 1 Updated Jan 21, 2023

ray-project / pygloo

Pygloo provides Python bindings for Gloo.

C++ 22 12 Updated Jul 7, 2025

petuum / tuun

Hyperparameter tuning via uncertainty modeling

Python 49 4 Updated May 3, 2024

jinliangwei / tensorflow-1.12a

C++ 1 2 Updated Oct 4, 2022

jinliangwei / orion

C++ 7 1 Updated Mar 23, 2019

thudzj / ScalableBDL

Code for "BayesAdapter: Being Bayesian, Inexpensively and Robustly, via Bayeisan Fine-tuning"

Python 32 6 Updated Jul 25, 2024

petuum / adaptdl

Resource-adaptive cluster scheduler for deep learning training.

Python 450 81 Updated Mar 5, 2023

facebookresearch / ClassyVision

An end-to-end PyTorch framework for image and video classification

Python 1,611 274 Updated Jun 27, 2024