Skip to content
View qts0312's full-sized avatar
  • Peking University
  • Beijing
  • 23:05 (UTC +08:00)

Highlights

  • Pro

Block or report qts0312

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Pretty good call graphs for dynamic languages

Python 4,552 329 Updated Jul 27, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 20,258 3,518 Updated Mar 27, 2026

技术面试最后反问面试官的话

18,394 1,381 Updated Mar 4, 2024

Systems of Reinforcement Learning

7 Updated Nov 21, 2025

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

Python 16,994 3,389 Updated Mar 27, 2026

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 74,518 14,853 Updated Mar 27, 2026

Miles is an enterprise-facing reinforcement learning framework for LLM and VLM post-training, forked from and co-evolving with slime.

Python 1,021 140 Updated Mar 27, 2026

slime is an LLM post-training framework for RL Scaling.

Python 5,001 669 Updated Mar 27, 2026

Repository used to store the GKE AI Labs content for "Tutorials and Examples" section

Python 11 26 Updated Mar 19, 2026

Amazons implementation example

C++ 1 Updated Nov 15, 2025

Amazon Chess Program for Data Structures course design.

C++ 3 Updated Apr 29, 2024

北京大学计算机基础能力手册

TeX 479 35 Updated Mar 21, 2026

Optimized primitives for collective multi-GPU communication

C++ 4,565 1,186 Updated Mar 25, 2026

Verified Rust for low-level systems code

Rust 2,387 156 Updated Mar 27, 2026

For reproducing RPC transient errors ...

Go 1 Updated Nov 7, 2025

Codes & examples for "CUDA - From Correctness to Performance"

C++ 125 23 Updated Oct 24, 2024

Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels

Python 5,433 488 Updated Mar 27, 2026

A curated reading list for machine learning reliability research and practice

29 2 Updated Sep 18, 2025

Nano vLLM

Python 12,467 1,796 Updated Nov 3, 2025

A course of learning LLM inference serving on Apple Silicon for systems engineers: build a tiny vLLM + Qwen.

Python 4,035 296 Updated Mar 26, 2026

Collections of SysY language testcases.

C 13 1 Updated Apr 10, 2024

TiDB - the open-source, cloud-native, distributed SQL database designed for modern applications.

Go 39,931 6,158 Updated Mar 27, 2026

📰 Must-read papers and blogs on Speculative Decoding ⚡️

1,164 69 Updated Mar 9, 2026

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 25,113 5,035 Updated Mar 27, 2026

Disaggregated serving system for Large Language Models (LLMs).

Jupyter Notebook 794 89 Updated Apr 6, 2025

A Datacenter Scale Distributed Inference Serving Framework

Rust 6,412 961 Updated Mar 27, 2026

FlashInfer: Kernel Library for LLM Serving

Python 5,227 830 Updated Mar 27, 2026
Python 16 1 Updated Mar 11, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 17,069 1,367 Updated Mar 25, 2026

PyTorch library for cost-effective, fast and easy serving of MoE models.

Python 289 25 Updated Mar 25, 2026
Next