Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 574 70 Updated Sep 11, 2024

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,740 288 Updated Aug 24, 2025

smart-lty / ParallelSpeculativeDecoding

[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length

Python 128 7 Updated Oct 29, 2025

OpenManus / OpenManus-RL

A live stream development of RL tunning for LLM agents

Python 3,590 499 Updated Oct 8, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,712 985 Updated Nov 6, 2025

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,438 110 Updated Aug 5, 2025

byungsoo-oh / ml-systems-papers

Curated collection of papers in machine learning systems

452 29 Updated Nov 8, 2025

MuLabPKU / TransMLA

TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)

Python 407 22 Updated Sep 23, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 2,140 178 Updated Sep 3, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,699 441 Updated Nov 11, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,137 3,945 Updated Nov 10, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,784 279 Updated Aug 3, 2025

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,014 52 Updated Oct 25, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,411 162 Updated Mar 20, 2025

NVIDIA-NeMo / Curator

Scalable data pre processing and curation toolkit for LLMs

Python 1,209 187 Updated Nov 10, 2025

RUCAIBox / Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python 744 41 Updated Aug 13, 2025

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 677 50 Updated Jan 20, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,268 421 Updated Nov 10, 2025

ShashankSubramanian / transformer-perf-estimates

Performance Estimates for Transformer AI Models in Science

Jupyter Notebook 9 1 Updated Oct 2, 2024

RLHFlow / Online-RLHF

A recipe for online RLHF and online iterative DPO.

Python 536 49 Updated Dec 28, 2024

lil-lab / icrl

Python 29 2 Updated Feb 10, 2025

trotsky1997 / MathBlackBox

Python 1,035 108 Updated Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhuang Liu zhuango

Achievements

Achievements

Block or report zhuango

Stars

microsoft / apex_plus

LMCache / LMCache

mutinifni / splitwise-sim

IBM / 3D-CiM-LLM-Inference-Simulator

microsoft / vidur

thunlp / TritonBench

codefuse-ai / Awesome-Code-LLM

HuangOwen / Awesome-LLM-Compression

hahnyuan / LLM-Viewer