Skip to content
View fayejf's full-sized avatar

Block or report fayejf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation

Python 124 10 Updated May 19, 2025
Python 320 36 Updated Jul 10, 2025
Shell 47 5 Updated Jun 20, 2024

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

2,121 96 Updated Jun 13, 2026

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

HTML 35 3 Updated Feb 26, 2026

Multilingual Corpus of Web Fiction

Ruby 205 9 Updated Jun 28, 2024

Efficient LLM Inference over Long Sequences

Python 395 23 Updated Jun 25, 2025

The HELMET Benchmark

Jupyter Notebook 217 42 Updated Apr 17, 2026

LOFT: A 1 Million+ Token Long-Context Benchmark

Python 233 17 Updated Apr 13, 2026

Awesome List of Attention Modules and Plug&Play Modules in Computer Vision

Python 1,271 172 Updated May 11, 2023

[ICML 2024] Quest: Query-Aware Sparsity for Efficient Long-Context LLM Inference

Cuda 393 49 Updated Jul 10, 2025

This repo contains the source code for RULER: What’s the Real Context Size of Your Long-Context Language Models?

Python 1,561 129 Updated Nov 13, 2025

Unofficial PyTorch/🤗Transformers(Gemma/Llama3) implementation of Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention

Python 376 35 Updated Apr 23, 2024

Acceptance rates for the major AI conferences

Jupyter Notebook 4,754 315 Updated Sep 23, 2025

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 760 52 Updated Sep 27, 2024

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

Python 547 36 Updated May 16, 2025

Extend existing LLMs way beyond the original training length with constant memory usage, without retraining

Python 736 45 Updated Apr 10, 2024

Ring attention implementation with flash attention

Python 1,025 99 Updated Sep 10, 2025

Large Context Attention

Python 773 53 Updated Oct 13, 2025

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,793 1,174 Updated Apr 8, 2026

Code for the paper "Evaluating Large Language Models Trained on Code"

Python 3,259 444 Updated Jan 17, 2025

From Chain-of-Thought prompting to OpenAI o1 and DeepSeek-R1 🍓

3,632 211 Updated Apr 20, 2026

Beyond the Imitation Game collaborative benchmark for measuring and extrapolating the capabilities of language models

Python 3,247 617 Updated Jul 19, 2024

Self-Supervised Speech Pre-training and Representation Learning Toolkit

Python 2,554 536 Updated Mar 12, 2026

This is the official implementation of "Progressive-Hint Prompting Improves Reasoning in Large Language Models"

Python 209 14 Updated Oct 11, 2023

Adaptive Experimentation Platform

Python 2,767 371 Updated Jun 9, 2026

The agent engineering platform.

Python 139,176 23,073 Updated Jun 13, 2026

Reproducible code for paper "qEUBO A Decision-Theoretic Acquisition Function for Preferential Bayesian Optimization" from AISTATS 2023

Python 22 4 Updated Mar 24, 2023
Next