zhuzilin

🛏️

躺平躺平......

Zilin Zhu zhuzilin

🛏️

躺平躺平......

☀️ RL infra @Z.ai, ex WeChat AI

1.9k followers · 165 following

Z.ai
Beijing
21:46 (UTC +08:00)
https://www.zhihu.com/people/zhu-xiao-lin-22-96

Achievements

x4 x2 x3

Achievements

x4 x2 x3

sglang Public
Forked from sgl-project/sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 1 Apache License 2.0 Updated Jun 9, 2026
sgl-router Public

A fork of sgl-model-gateway for slime.

Rust 5 Other Updated Jun 8, 2026
blog Public

my blog~

JavaScript 3 MIT License Updated May 8, 2026
es Public archive

A JavaScript interpreter from scratch, supporting ES5 syntax.

C++ 30 6 GNU Affero General Public License v3.0 Updated Feb 10, 2026
Megatron-Bridge Public
Forked from fzyzcjy/Megatron-Bridge

Training library for Megatron-based models

Python Apache License 2.0 Updated Dec 22, 2025
asystem-amem Public
Forked from inclusionAI/asystem-amem

A NCCL extension library, designed to efficiently offload GPU memory allocated by the NCCL communication library.

C++ 1 Apache License 2.0 Updated Nov 27, 2025
ring-flash-attention Public

Ring attention implementation with flash attention

Python 1,025 99 MIT License Updated Sep 10, 2025
torch_memory_saver Public
Forked from fzyzcjy/torch_memory_saver

Allow torch tensor memory to be released and resumed later

Python 1 MIT License Updated Aug 22, 2025
pytorch-reloadable-pg Public

PyTorch Reloadable Process Group

Python 3 Updated Aug 13, 2025
pytorch-malloc Public

An external memory allocator example for PyTorch.

memory-allocator pytorch

C++ 16 3 Updated Aug 10, 2025
flash-attention-with-sink Public

Python 37 2 Updated Aug 7, 2025
lm-sys.github.io Public
Forked from lm-sys/lm-sys.github.io

JavaScript Other Updated Jul 9, 2025
torch_utils Public
Forked from fzyzcjy/torch_utils

Utility scripts for PyTorch

Python Updated Jul 5, 2025
cumem_allocator Public

Python 5 Updated Jun 23, 2025
mbridge Public
Forked from ISEEKYAN/mbridge

Python Other Updated Jun 17, 2025
zhuzilin Public

2 Updated May 2, 2025
Megatron-LM Public
Forked from NVIDIA/Megatron-LM

Ongoing research training transformer models at scale

Python Other Updated Mar 20, 2025
faster-nougat Public

Implementation of nougat that focuses on processing pdf locally.

Python 85 2 MIT License Updated Jan 15, 2025
OpenRLHF Public
Forked from OpenRLHF/OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & Mixtral)

Python 3 Apache License 2.0 Updated Dec 20, 2024
pdf-with-its-own-md5 Public

A PDF template that contains its own MD5!

cryptography md5 hashquine md5-collisions

TeX 44 4 Updated Nov 28, 2024
vllm-group Public

Python 12 1 MIT License Updated Nov 5, 2024
unilm Public
Forked from microsoft/unilm

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 1 MIT License Updated Oct 9, 2024
transformers Public
Forked from huggingface/transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python Apache License 2.0 Updated Oct 7, 2024
vllm Public
Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python Apache License 2.0 Updated Sep 30, 2024
flash-attention Public
Forked from Dao-AILab/flash-attention

Fast and memory-efficient exact attention

Python 1 BSD 3-Clause "New" or "Revised" License Updated Sep 5, 2024
megablocks Public
Forked from databricks/megablocks

Python Apache License 2.0 Updated Aug 27, 2024
grouped_gemm Public
Forked from fanshiqing/grouped_gemm

PyTorch bindings for CUTLASS grouped GEMM.

Cuda Apache License 2.0 Updated Jul 18, 2024
aqt-pytorch Public

Python 7 Updated Jun 26, 2024
scattermoe Public
Forked from shawntan/scattermoe

Triton-based implementation of Sparse Mixture of Experts.

Python 1 Apache License 2.0 Updated Jun 12, 2024
instruct-eval Public
Forked from declare-lab/instruct-eval

This repository contains code to quantitatively evaluate instruction-tuned models such as Alpaca and Flan-T5 on held-out tasks.

Python Apache License 2.0 Updated Jan 18, 2024

Zilin Zhu zhuzilin

Achievements

Achievements

sglang Public

Uh oh!

sgl-router Public

Uh oh!

blog Public

Uh oh!

es Public archive

Uh oh!

Megatron-Bridge Public

Uh oh!

asystem-amem Public

Uh oh!

ring-flash-attention Public

Uh oh!

torch_memory_saver Public

Uh oh!

pytorch-reloadable-pg Public

Uh oh!

pytorch-malloc Public

Uh oh!

flash-attention-with-sink Public

Uh oh!

lm-sys.github.io Public

Uh oh!

torch_utils Public

Uh oh!

cumem_allocator Public

Uh oh!

mbridge Public

Uh oh!

zhuzilin Public

Uh oh!

Megatron-LM Public

Uh oh!

faster-nougat Public

Uh oh!

OpenRLHF Public

Uh oh!

pdf-with-its-own-md5 Public

Uh oh!

vllm-group Public

Uh oh!

unilm Public

Uh oh!

transformers Public

Uh oh!

vllm Public

Uh oh!

flash-attention Public

Uh oh!

megablocks Public

Uh oh!

grouped_gemm Public

Uh oh!

aqt-pytorch Public

Uh oh!

scattermoe Public

Uh oh!

instruct-eval Public

Uh oh!