GitHub - lucidrains/lookahead-keys-attention: Causal Attention with Lookahead Keys

Lookahead Keys Attention

Causal Attention with Lookahead Keys

Installation

pip install lookahead-keys-attention

Usage

import torch
from lookahead_keys_attention import Castle

# lookahead keys attention

model = Castle(
    dim = 512,           # input dimension
    heads = 8,           # number of attention heads
    dim_head = 64,       # dimension per head
    use_triton = None    # auto set to if cuda and triton is available, but can be forced
).cuda()

seq = torch.randn(2, 128, 512).cuda()

# parallel

parallel_output = model(seq)  # (batch_size, seq_len, dim)

# sequential

cache = None
outputs = []

for token in seq.unbind(dim = 1):
    output, cache = model(token, cache = cache, return_next_cache = True)
    outputs.append(output)

seq_output = torch.cat(outputs, dim = 1)

assert torch.allclose(parallel_output, seq_output, atol = 1e-3)

Char level Enwik8

Make sure uv is installed (pip install uv)

Then

$ uv run train_triton.py

Citations

@inproceedings{Song2025CausalAW,
    title   = {Causal Attention with Lookahead Keys},
    author  = {Zhuoqing Song and Peng Sun and Huizhuo Yuan and Quanquan Gu},
    year    = {2025},
    url     = {https://api.semanticscholar.org/CorpusID:281218151}
}

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.github/workflows		.github/workflows
data		data
lookahead_keys_attention		lookahead_keys_attention
tests		tests
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
fig3.png		fig3.png
pyproject.toml		pyproject.toml
train_triton.py		train_triton.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Lookahead Keys Attention

Installation

Usage

Char level Enwik8

Citations

About

Uh oh!

Releases 9

Packages

Languages

License

lucidrains/lookahead-keys-attention

Folders and files

Latest commit

History

Repository files navigation

Lookahead Keys Attention

Installation

Usage

Char level Enwik8

Citations

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 9

Packages 0

Languages

Packages