dblate

🐢

AI Infrastructure

yuhui dblate

🐢

AI Infrastructure

若一去不回，便一去不回

21 followers · 108 following

Baidu
Beijing, China

Achievements

Stars

sgl-project / rbg

A workload for deploying LLM inference services on Kubernetes

Go 75 19 Updated Oct 8, 2025

MoonshotAI / checkpoint-engine

Checkpoint-engine is a simple middleware to update model weights in LLM inference engines

Python 758 54 Updated Sep 30, 2025

galeselee / Awesome_LLM_System-PaperList

Since the emergence of chatGPT in 2022, the acceleration of Large Language Model has become increasingly important. Here is a list of papers on accelerating LLMs, currently focusing mainly on infer…

276 13 Updated Mar 6, 2025

James-QiuHaoran / LLM-serving-with-proxy-models

Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction | A tiny BERT model can tell you the verbosity of an LLM (with low latency overhead!)

Jupyter Notebook 44 7 Updated Jun 1, 2024

coin-or / pulp

A python Linear Programming API

Python 2,343 416 Updated Oct 6, 2025

vllm-project / aibrix

Cost-efficient and pluggable Infrastructure components for GenAI inference

Go 4,288 469 Updated Oct 9, 2025

huggingface / nanoVLM

The simplest, fastest repository for training/finetuning small-sized VLMs.

Python 4,099 393 Updated Sep 10, 2025

buger / goreplay

GoReplay is an open-source tool for capturing and replaying live HTTP traffic into a test environment in order to continuously test your system with real data. It can be used to increase confidence…

Go 19,137 76 Updated Apr 5, 2025

andy-yang-1 / DoubleSparse

16-fold memory access reduction with nearly no loss

Python 105 8 Updated Mar 26, 2025

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 5,501 627 Updated Oct 9, 2025

OpenDriveLab / End-to-end-Autonomous-Driving

[IEEE T-PAMI 2024] All you need for End-to-end Autonomous Driving

3,341 306 Updated Jul 2, 2025

mit-han-lab / Block-Sparse-Attention

A sparse attention kernel supporting mix sparse patterns

C++ 313 16 Updated Feb 13, 2025

tensorflow / tensor2tensor

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

Python 16,547 3,672 Updated Jun 2, 2023

masamasa59 / ai-agent-papers

A collection of AI Agents papers (Updated biweekly)

608 37 Updated Sep 28, 2025

facebookresearch / xformers

Hackable and optimized Transformers building blocks, supporting a composable construction.

Python 9,986 722 Updated Oct 6, 2025

bigai-nlco / VideoLLaMB

[ICCV 2025] Official Repository of VideoLLaMB: Long Video Understanding with Recurrent Memory Bridges

Python 77 2 Updated Feb 27, 2025

efeslab / Nanoflow

A throughput-oriented high-performance serving framework for LLMs

Jupyter Notebook 896 43 Updated Sep 17, 2025

LLM-Systems-Research / orca

Our Clone of Orca used for experimentation

Python 9 4 Updated Oct 15, 2024

ML-GSAI / LLaDA

Official PyTorch implementation for "Large Language Diffusion Models"

Python 3,014 201 Updated Sep 30, 2025

jonyzhang2023 / awesome-embodied-vla-va-vln

A curated list of state-of-the-art research in embodied AI, focusing on vision-language-action (VLA) models, vision-language navigation (VLN), and related multimodal learning approaches.

1,716 72 Updated Oct 9, 2025

yueen-ma / Awesome-VLA

321 13 Updated Apr 15, 2025

Kludex / starlette

The little ASGI framework that shines. 🌟

Python 11,529 1,043 Updated Oct 9, 2025

codecaution / Awesome-Mixture-of-Experts-Papers

A curated reading list of research in Mixture-of-Experts(MoE).

646 45 Updated Oct 30, 2024

sgl-project / sgl-learning-materials

Materials for learning SGLang

595 48 Updated Oct 1, 2025

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

Python 8,100 790 Updated Oct 9, 2025

lucasjinreal / AI-Infer-Engine-From-Zero

关于自建AI推理引擎的手册，从0开始你需要知道的所有事情

270 21 Updated Sep 8, 2022

triton-lang / triton

Development repository for the Triton language and compiler

MLIR 17,168 2,289 Updated Oct 10, 2025

kvcache-ai / Mooncake

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 4,069 389 Updated Oct 9, 2025

zhaochenyang20 / Awesome-ML-SYS-Tutorial

My learning notes/codes for ML SYS.

Python 3,823 232 Updated Oct 6, 2025

ai-dynamo / dynamo

A Datacenter Scale Distributed Inference Serving Framework

Rust 5,264 633 Updated Oct 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

yuhui dblate

Achievements

Achievements

Block or report dblate

Stars

sgl-project / rbg

MoonshotAI / checkpoint-engine

galeselee / Awesome_LLM_System-PaperList

James-QiuHaoran / LLM-serving-with-proxy-models

coin-or / pulp

vllm-project / aibrix

huggingface / nanoVLM

buger / goreplay

andy-yang-1 / DoubleSparse

LMCache / LMCache

OpenDriveLab / End-to-end-Autonomous-Driving

mit-han-lab / Block-Sparse-Attention

tensorflow / tensor2tensor

masamasa59 / ai-agent-papers

facebookresearch / xformers

bigai-nlco / VideoLLaMB

efeslab / Nanoflow

LLM-Systems-Research / orca

ML-GSAI / LLaDA

jonyzhang2023 / awesome-embodied-vla-va-vln

yueen-ma / Awesome-VLA

Kludex / starlette

codecaution / Awesome-Mixture-of-Experts-Papers

sgl-project / sgl-learning-materials

OpenRLHF / OpenRLHF

lucasjinreal / AI-Infer-Engine-From-Zero

triton-lang / triton

kvcache-ai / Mooncake

zhaochenyang20 / Awesome-ML-SYS-Tutorial

ai-dynamo / dynamo