Analyze the inference of Large Language Models (LLMs). Analyze aspects like computation, storage, transmission, and hardware roofline model in a user-friendly interface.

Python 573 70 Updated Sep 11, 2024

LeapLabTHU / Absolute-Zero-Reasoner

Official Repository of Absolute Zero Reasoner

Python 1,737 290 Updated Aug 24, 2025

smart-lty / ParallelSpeculativeDecoding

[ICLR 2025] PEARL: Parallel Speculative Decoding with Adaptive Draft Length

Python 125 7 Updated Oct 29, 2025

OpenManus / OpenManus-RL

A live stream development of RL tunning for LLM agents

Python 3,580 498 Updated Oct 8, 2025

deepseek-ai / DeepEP

DeepEP: an efficient expert-parallel communication library

Cuda 8,697 976 Updated Nov 6, 2025

lsdefine / simple_GRPO

A very simple GRPO implement for reproducing r1-like LLM thinking.

Python 1,432 109 Updated Aug 5, 2025

byungsoo-oh / ml-systems-papers

Curated collection of papers in machine learning systems

448 29 Updated Oct 4, 2025

MuLabPKU / TransMLA

TransMLA: Multi-Head Latent Attention Is All You Need (NeurIPS 2025 Spotlight)

Python 407 22 Updated Sep 23, 2025

open-thoughts / open-thoughts

Fully open data curation for reasoning models

Python 2,136 177 Updated Sep 3, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 4,685 440 Updated Nov 4, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek-R1, Qwen3, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 48,021 3,931 Updated Nov 7, 2025

hkust-nlp / simpleRL-reason

Simple RL training for reasoning

Python 3,783 279 Updated Aug 3, 2025

hemingkx / SpeculativeDecodingPapers

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,007 52 Updated Oct 25, 2025

Unakar / Logic-RL

Reproduce R1 Zero on Logic Puzzle

Python 2,410 162 Updated Mar 20, 2025

NVIDIA-NeMo / Curator

Scalable data pre processing and curation toolkit for LLMs

Python 1,202 187 Updated Nov 7, 2025

RUCAIBox / Slow_Thinking_with_LLMs

A series of technical report on Slow Thinking with LLM

Python 743 41 Updated Aug 13, 2025

THUDM / ReST-MCTS

ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree Search (NeurIPS 2024)

Python 676 50 Updated Jan 20, 2025

EvolvingLMMs-Lab / lmms-eval

One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 3,263 420 Updated Nov 7, 2025

ShashankSubramanian / transformer-perf-estimates

Performance Estimates for Transformer AI Models in Science

Jupyter Notebook 9 1 Updated Oct 2, 2024

RLHFlow / Online-RLHF

A recipe for online RLHF and online iterative DPO.

Python 536 49 Updated Dec 28, 2024

lil-lab / icrl

Python 29 2 Updated Feb 10, 2025

trotsky1997 / MathBlackBox

Python 1,035 108 Updated Dec 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Zhuang Liu zhuango

Achievements

Achievements

Block or report zhuango

Stars

microsoft / apex_plus

LMCache / LMCache

mutinifni / splitwise-sim

IBM / 3D-CiM-LLM-Inference-Simulator

microsoft / vidur

thunlp / TritonBench

codefuse-ai / Awesome-Code-LLM

HuangOwen / Awesome-LLM-Compression

hahnyuan / LLM-Viewer