xutingl

Xuting xutingl

15 followers · 18 following

Achievements

Starred repositories

rlops / rlix

Run more RL experiments. Wait less for GPUs.

Python 243 12 Updated Apr 13, 2026

uccl-project / uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++ 1,301 136 Updated Apr 15, 2026

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Python 5,400 895 Updated Apr 15, 2026

LMCache / LMCache

Supercharge Your LLM with the Fastest KV Cache Layer

Python 7,992 1,095 Updated Apr 15, 2026

eniac / quilt

Quilt is a serverless optimizer that automatically merges workflows that consist of many functions (possibly in different languages) into one process thereby avoiding high invocation latency, commu…

C 7 1 Updated Oct 8, 2025

NVIDIA / kvpress

LLM KV cache compression made easy

Python 1,040 130 Updated Apr 14, 2026

video-fm / LASER

This is a public version of LASER: A Neuro-Symbolic Framework for Learning Spatial-Temporal Scene Graphs with Weak Supervision

Python 167 8 Updated Dec 1, 2025

ovg-project / kvcached

Virtualized Elastic KV Cache for Dynamic GPU Sharing and Beyond

Python 855 99 Updated Apr 7, 2026

aliyun / SimAI

C++ 871 151 Updated Apr 10, 2026

llm-d / llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell 2,989 413 Updated Apr 15, 2026

dbos-inc / dbos-transact-golang

Lightweight Durable Golang Workflows

Go 642 51 Updated Apr 15, 2026

pku-lemonade / TokenSim

TokenSim is a tool for simulating the behavior of large language models (LLMs) in a distributed environment.

Python 22 2 Updated Sep 20, 2025

sgl-project / SpecForge

Train speculative decoding models effortlessly and port them smoothly to SGLang serving.

Python 782 207 Updated Apr 2, 2026

eniac / serverless-bench

Python 5 Updated Jun 26, 2025

eniac / lambda-trim

A debloater for Python applications

Python 9 Updated Jun 6, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 159,431 32,880 Updated Apr 15, 2026