leeyeehoo

Follow

😮

rushing

Yuhong Li leeyeehoo

😮

rushing

Follow

sfdsafdsfsbsad

166 followers · 64 following

Burger Shot
Los Santos
http://leeyeehoo.github.io/
@yli3521

Achievements

Achievements

Stars

THUDM / slime

slime is an LLM post-training framework for RL Scaling.

Python 2,950 356 Updated Dec 23, 2025

zerocore-ai / microsandbox

opensource self-hosted sandboxes for ai agent

Rust 4,228 187 Updated Nov 21, 2025

rllm-org / rllm

Democratizing Reinforcement Learning for LLMs

Python 4,894 468 Updated Dec 21, 2025

Jiayi-Pan / TinyZero

Minimal reproduction of DeepSeek R1-Zero

Python 12,509 1,533 Updated Apr 24, 2025

openai / simple-evals

Python 4,245 459 Updated Jul 31, 2025

facebookresearch / coconut

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,409 154 Updated Aug 12, 2025

official-stockfish / Stockfish

A free and strong UCI chess engine

C++ 14,344 2,712 Updated Dec 21, 2025

xjdr-alt / entropix

Entropy Based Sampling and Parallel CoT Decoding

Python 3,431 325 Updated Nov 13, 2024

huggingface / text-generation-inference

Large Language Model Text Generation Inference

Python 10,711 1,247 Updated Dec 19, 2025

FasterDecoding / SnapKV

Python 296 26 Updated Jul 10, 2025

thunlp / InfLLM

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 396 39 Updated Apr 20, 2024

myshell-ai / JetMoE

Reaching LLaMA2 Performance with 0.1M Dollars

Python 988 77 Updated Jul 23, 2024

jquesnelle / yarn

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,652 126 Updated Apr 17, 2024

FasterDecoding / BitDelta

Jupyter Notebook 204 17 Updated Dec 5, 2024

gkamradt / LLMTest_NeedleInAHaystack

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,118 228 Updated Aug 17, 2024

FranxYao / Long-Context-Data-Engineering

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 481 30 Updated Mar 19, 2024

OpenLMLab / LEval

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 391 13 Updated Jul 9, 2024

austinsilveria / fstattention

Memory bandwidth efficient sparse tree attention

Python 2 Updated Feb 26, 2024

SqueezeAILab / KVQuant

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 395 38 Updated Aug 13, 2024

apple / axlearn

An Extensible Deep Learning Library

Python 2,303 391 Updated Dec 11, 2025

DRSY / EasyKV

Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)

Python 63 5 Updated Feb 13, 2024

datamllab / LongLM

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 663 61 Updated Jun 1, 2024

flashinfer-ai / flashinfer

FlashInfer: Kernel Library for LLM Serving

Cuda 4,334 613 Updated Dec 23, 2025

jax-ml / jax

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 34,404 3,318 Updated Dec 23, 2025

guinmoon / LLMFarm

llama and other large language models on iOS and MacOS offline using GGML library.

C 1,931 157 Updated Dec 9, 2025

leptonai / search_with_lepton

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,126 1,023 Updated Dec 2, 2025

apoorvumang / prompt-lookup-decoding

Jupyter Notebook 583 25 Updated Aug 23, 2024

NVIDIA / TensorRT-LLM

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,451 1,973 Updated Dec 23, 2025

google-deepmind / alphageometry

Python 4,721 552 Updated Dec 16, 2025

pytorch / ELF

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

C++ 3,413 571 Updated Jun 21, 2019