Skip to content
View leeyeehoo's full-sized avatar
😮
rushing
😮
rushing

Block or report leeyeehoo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

slime is an LLM post-training framework for RL Scaling.

Python 2,950 356 Updated Dec 23, 2025

opensource self-hosted sandboxes for ai agent

Rust 4,228 187 Updated Nov 21, 2025

Democratizing Reinforcement Learning for LLMs

Python 4,894 468 Updated Dec 21, 2025

Minimal reproduction of DeepSeek R1-Zero

Python 12,509 1,533 Updated Apr 24, 2025
Python 4,245 459 Updated Jul 31, 2025

Training Large Language Model to Reason in a Continuous Latent Space

Python 1,409 154 Updated Aug 12, 2025

A free and strong UCI chess engine

C++ 14,344 2,712 Updated Dec 21, 2025

Entropy Based Sampling and Parallel CoT Decoding

Python 3,431 325 Updated Nov 13, 2024

Large Language Model Text Generation Inference

Python 10,711 1,247 Updated Dec 19, 2025
Python 296 26 Updated Jul 10, 2025

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

Python 396 39 Updated Apr 20, 2024

Reaching LLaMA2 Performance with 0.1M Dollars

Python 988 77 Updated Jul 23, 2024

YaRN: Efficient Context Window Extension of Large Language Models

Python 1,652 126 Updated Apr 17, 2024
Jupyter Notebook 204 17 Updated Dec 5, 2024

Doing simple retrieval from LLM models at various context lengths to measure accuracy

Jupyter Notebook 2,118 228 Updated Aug 17, 2024

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 481 30 Updated Mar 19, 2024

[ACL'24 Outstanding] Data and code for L-Eval, a comprehensive long context language models evaluation benchmark

Python 391 13 Updated Jul 9, 2024

Memory bandwidth efficient sparse tree attention

Python 2 Updated Feb 26, 2024

[NeurIPS 2024] KVQuant: Towards 10 Million Context Length LLM Inference with KV Cache Quantization

Python 395 38 Updated Aug 13, 2024

An Extensible Deep Learning Library

Python 2,303 391 Updated Dec 11, 2025

Easy control for Key-Value Constrained Generative LLM Inference(https://arxiv.org/abs/2402.06262)

Python 63 5 Updated Feb 13, 2024

[ICML'24 Spotlight] LLM Maybe LongLM: Self-Extend LLM Context Window Without Tuning

Python 663 61 Updated Jun 1, 2024

FlashInfer: Kernel Library for LLM Serving

Cuda 4,334 613 Updated Dec 23, 2025

Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more

Python 34,404 3,318 Updated Dec 23, 2025

llama and other large language models on iOS and MacOS offline using GGML library.

C 1,931 157 Updated Dec 9, 2025

Building a quick conversation-based search demo with Lepton AI.

TypeScript 8,126 1,023 Updated Dec 2, 2025
Jupyter Notebook 583 25 Updated Aug 23, 2024

TensorRT LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and supports state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. Tensor…

Python 12,451 1,973 Updated Dec 23, 2025

ELF: a platform for game research with AlphaGoZero/AlphaZero reimplementation

C++ 3,413 571 Updated Jun 21, 2019
Next