FoolPlayer

Bin Jia FoolPlayer

MLSys | Computer Science Master @ NUS

57 followers · 17 following

ByteDance Seed
Singapore
08:26 (UTC +08:00)

Achievements

x3 x2

Achievements

x3 x2

Stars

8 stars written in Python

Clear filter

hpcaitech / ColossalAI

Making large AI models cheaper, faster and more accessible

Python 41,371 4,517 Updated Apr 27, 2026

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 13,521 2,334 Updated Apr 30, 2026

microsoft / LLMLingua

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 6,086 369 Updated Apr 8, 2026