PhD student in Computer Science at TSAIL Group, Tsinghua University, @thu-ml.
Interested in pretraining, optimization, theory for LLMs.
-
@thu-ml, Tsinghua University
- Beijing, China
-
03:20
(UTC +09:00) - https://bingrui-li.github.io/
- @bingruili_
- @bingruil.bsky.social
Stars
3
results
for sponsorable starred repositories
written in Python
Clear filter
A high-throughput and memory-efficient inference and serving engine for LLMs
Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.
Train to 94% on CIFAR-10 in <6.3 seconds on a single A100. Or ~95.79% in ~110 seconds (or less!)