long-context

Here are 164 public repositories matching this topic...

lucidrains / perceiver-ar-pytorch

Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch

deep-learning transformer artficial-intelligence attention-mechanism long-context

Updated Apr 10, 2023
Python

lucidrains / flash-genomics-model

Star

My own attempt at a long context genomics model, leveraging recent advances in long context attention modeling (Flash Attention + other hierarchical methods)

deep-learning genomics transformers artificial-intelligence attention-mechanisms long-context

Updated Jul 2, 2023
Python

4AI / RAN

Star

RAN: Recurrent Attention Networks for Long-text Modeling | Findings of ACL23

acl recurrent-networks long-context long-context-attention acl2023 long-context-transformers long-document-modeling recurrent-attention-networks

Updated Aug 12, 2023
Python

yangjianxin1 / LongQLoRA

Star

LongQLoRA: Extent Context Length of LLMs Efficiently

lora llm long-context qlora longlora

Updated Nov 12, 2023
Python

asigalov61 / Heptabit-Music-Transformer

Star

[DEPRECIATED] Very fast, large music transformer with 8k sequence length, efficient heptabit MIDI notes encoding, true full MIDI instruments range, chords counters and outro tokens

midi artificial-intelligence heptagram heptagon music-transformer music-ai sota-model long-context heptabit

Updated Nov 23, 2023
Python

lucaslingle / e-lra

Star

Streamlined variant of Long-Range Arena with pinned dependencies, automated data downloads, and deterministic shuffling.

transformers long-context long-range-arena

Updated Jan 9, 2024
Python

nopperl / corporate_emission_reports

Star

Finetuning and evaluating LLMs to extract GHG emissions from PDF reports using RAG and grammar-based decoding.

evaluation information-extraction data-extraction lora llm long-context

Updated Mar 22, 2024
TeX

davendw49 / Awesome-Long-Context-Language-Modeling

Star

Papers of Long Context Language Model

nlp awesome-list llm long-context

Updated Mar 28, 2024

melvinebenezer / Liah-Lie_in_a_haystack

Star

needle in a haystack for LLMs

needle-in-haystack llm long-context llm-inference llms-benchmarking

Updated Apr 15, 2024
Python

thunlp / InfLLM

Star

The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

large-language-models llm long-context training-free

Updated Apr 20, 2024
Python

dingo-actual / infini-transformer

Star

PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

deep-learning transformers pytorch attention-mechanism long-context infini-attention mixture-of-depths

Updated May 4, 2024
Python

"Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiwei Liu, Zhewei Yao, Olatunji Ruwase, Beidi Chen, Xiaoxia Wu, Zhangyang Wang.

positional-encoding large-language-models long-context lost-in-the-middle