Skip to content
View ZichengXu's full-sized avatar

Block or report ZichengXu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Decoding Tree Sketching migrated to vLLM backend - retroactive branching with PagedAttention and automatic prefix caching

Python 2 1 Updated Mar 28, 2026

TinyZeroWithSFT

Python 8 Updated Dec 9, 2025
Python 4 1 Updated Oct 31, 2025

RouterArena: An open framework for evaluating LLM routers with standardized datasets, metrics, an automated framework, and a live leaderboard.

Python 94 27 Updated Jun 13, 2026

[ICML 2026] Decoding Tree Sketching (DTS): a training-free & model agonistic & plug-in framework for LLM parallel reasoning.

Python 70 12 Updated May 12, 2026

[NeurIPS 2025] VeriThinker: Learning to Verify Makes Reasoning Model Efficient

Python 66 1 Updated Sep 27, 2025

[COLM 2025] Official PyTorch implementation of "Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models"

Python 74 7 Updated Jul 8, 2025
Python 22 3 Updated Oct 3, 2024

Summer 2026 software engineering, data science, AI, quant, product management, and hardware internship postings. Updated daily by Simplify and Pitt CSC.

Python 44,903 3,181 Updated Jun 13, 2026