Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 573 70 Updated Sep 11, 2024

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,736 289 Updated Aug 24, 2025

smart-lty / ParallelSpeculativeDecoding

[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length

Python 125 7 Updated Oct 29, 2025

OpenManus / OpenManus-RL

A live stream development of RL tunning for LLM agents

Python 3,577 498 Updated Oct 8, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,696 973 Updated Nov 6, 2025

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,430 109 Updated Aug 5, 2025

byungsoo-oh / ml-systems-papers

Curated collection of papers in machine learning systems

448 29 Updated Oct 4, 2025

MuLabPKU / TransMLA

TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)

Python 405 22 Updated Sep 23, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 2,135 177 Updated Sep 3, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,679 440 Updated Nov 4, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 47,985 3,925 Updated Nov 6, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,782 279 Updated Aug 3, 2025

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,005 52 Updated Oct 25, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,410 163 Updated Mar 20, 2025

NVIDIA-NeMo / Curator

Scalable data pre processing and curation toolkit for LLMs

Python 1,200 186 Updated Nov 5, 2025

RUCAIBox / Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python 743 41 Updated Aug 13, 2025

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 675 50 Updated Jan 20, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,260 419 Updated Nov 6, 2025

ShashankSubramanian / transformer-perf-estimates

Performance Estimates for Transformer AI Models in Science

Jupyter Notebook 9 1 Updated Oct 2, 2024

RLHFlow / Online-RLHF

A recipe for online RLHF and online iterative DPO.

Python 536 49 Updated Dec 28, 2024

lil-lab / icrl

Python 29 2 Updated Feb 10, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhuang Liu zhuango

Achievements

Achievements

Block or report zhuango

Stars

jzhang38 / TinyLlama

microsoft / apex_plus

LMCache / LMCache

mutinifni / splitwise-sim

IBM / 3D-CiM-LLM-Inference-Simulator

microsoft / vidur

thunlp / TritonBench

codefuse-ai / Awesome-Code-LLM

HuangOwen / Awesome-LLM-Compression

hahnyuan / LLM-Viewer