Build software better, together

munnabhaiiii981 / llm-attention-visualizer

🔍 Visualize attention patterns in transformer models to better understand how LLMs process text inputs with interactive heatmaps and comparisons.

visualization javascript python nlp flask machine-learning deep-learning transformers pytorch transformer attention interpretability ai-research attention-visualization streamlit hugging-face llm ollama

Updated Nov 12, 2025
Python

simboco / flash-linear-attention

Star

💥 Optimize linear attention models with efficient Triton-based implementations in PyTorch, compatible across NVIDIA, AMD, and Intel platforms.

reinforcement-learning computer-vision deep-learning transformers pytorch rnn attention language-model flame machine-learning-systems sequence-modeling fast-weight-programmers large-language-models llm triton-lang xlstm triton-kernels key-value-memory

Updated Nov 12, 2025
Python

kelvindelrosario / flash-attention-with-sink

Star

🐙 Implements Flash Attention with sink for gpt-oss-20b; includes test.py. WIP backward pass, varlen support, and community sync to return softmax_lse only.

flash attention sink with flash-attention attention-sink flash-with flash-sink attention-with with-sink flash-attention-with flash-attention-sink flash-with-sink attention-with-sink flash-attention-with-sink

Updated Nov 12, 2025
Python

satomune-dev / sparse-resonance

Star

Advanced sparse modern Hopfield models delivering fast associative memory with energy-efficient inference.

transformer attention hopfield modern-hopfield-networks modern-hopfield-model

Updated Nov 12, 2025
Python

leondgarse / keras_cv_attention_models

Star

Keras beit,caformer,CMT,CoAtNet,convnext,davit,dino,efficientdet,edgenext,efficientformer,efficientnet,eva,fasternet,fastervit,fastvit,flexivit,gcvit,ghostnet,gpvit,hornet,hiera,iformer,inceptionnext,lcnet,levit,maxvit,mobilevit,moganet,nat,nfnets,pvt,swin,tinynet,tinyvit,uniformer,volo,vanillanet,yolor,yolov7,yolov8,yolox,gpt2,llama2, alias kecam

recognition tensorflow model detection keras tf2 imagenet attention coco clip tf visualizing ddpm stable-diffusion segment-anything

Updated Nov 12, 2025
Python

lucidrains / dreamer4

Star

Implementation of Danijar's latest iteration for his Dreamer line of work

deep-learning transformers artificial-intelligence attention model-based-reinforcement-learning world-models

Updated Nov 12, 2025
Python

labmlai / annotated_deep_learning_paper_implementations

Star

🧑‍🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠

machine-learning reinforcement-learning deep-learning transformers pytorch transformer gan neural-networks literate-programming attention lora deep-learning-tutorial optimizers

Updated Nov 11, 2025
Python

muxitox / Dynamical-MF-Self-Attention

Star

This repository contains the code to reproduce the experiments performed in the Dynamical Mean-Field Theory of Self-Attention Neural Networks article.

attention graphical-models mean-field-theory hopfield-networks phase-transitions nonequilibrium-statistical-physics

Updated Nov 10, 2025
Python

nguyenphuminh / planckgpt

Star

Train a GPT from scratch on your laptop

nlp machine-learning deep-learning gpu cuda ml torch transformer attention gpt language-model dl llm

Updated Nov 10, 2025
Python

shayme92 / Titans-NNX

Star

a JAX + NNX implementation of Google's Titan NN. Transformer short-term attention + neural long-term memory with test-time updates

ai attention flax tpu jax titans-ai titans llm nnx

Updated Nov 10, 2025
Python

shreyansh26 / multihead-latent-attention

Star

A code deep-dive on one of the key innovations from Deepseek - Multihead Latent Attention (MLA)

attention mha mla multihead-attention gqa mqa multihead-latent-attention

Updated Nov 9, 2025
Python

msmrexe / pytorch-transformer-from-scratch

Star

A complete implementation of the "Attention Is All You Need" Transformer model from scratch using PyTorch. This project focuses on building and training a Transformer for neural machine translation (English-to-Italian) on the OpusBooks dataset.

machine-learning university-project pytorch transformer course-project seq2seq attention neural-machine-translation attention-mechanism encoder-decoder attention-is-all-you-need huggingface positional-encoding transformer-from-scratch cross-attention masked-attention

Updated Nov 8, 2025
Python

google-research / scenic

Star

Scenic: A Jax Library for Computer Vision Research and Beyond

research computer-vision deep-learning transformers attention jax vision-transformer

Updated Nov 12, 2025
Python

pfekin / summation-based-transformers

Star

Linear-time sequence modeling that replaces attention's O(n²d) complexity with O(nd) summation-based aggregation. Demonstrates constraint-driven emergence: how functional representations can develop from optimization pressure and architectural constraints alone, without explicit pairwise interactions.

Updated Nov 3, 2025
Python

captainzero93 / sd-webui-forge-classic-neo-extensions

Star

sd-webui-forge-classic/tree/neo working missing extentions from Forge

forge attention sag stable-diffusion freeu perturbed

Updated Oct 31, 2025
Python

sovit-123 / vision_transformers

Star

Vision Transformers for image classification, image segmentation, and object detection.

computer-vision transformers attention transformer-models vision-transformer

Updated Oct 29, 2025
Python

kyegomez / LFM2

Sponsor

Star

A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid AI in Pytorch

ai ml transformers attention agents ssms liquidnets mgqa

Updated Nov 10, 2025
Python

kyegomez / ScreenAI

Sponsor

Star

Implementation of the ScreenAI model from the paper: "A Vision-Language Model for UI and Infographics Understanding"

machine-learning ai tensorflow ml pytorch artificial-intelligence attention attention-is-all-you-need gpt-4

Updated Oct 27, 2025
Python

kyegomez / VLM-Mamba

Sponsor

Star

We introduce VLM-Mamba, the first Vision-Language Model built entirely on State Space Models (SSMs), specifically leveraging the Mamba architecture.

ai ml transformers pytorch state-space attention ssm mamba vision-transformer vision-language-model vlms vision-ssm language-ssm

Updated Oct 27, 2025
Python

lil-sussy / Hum-skipping-YT

Star

Helps preserving your attention when watching long videos by skipping hummings, "hum"s. Using the model HUMAwareVAD2025

youtube torch pytorch attention extension-chrome focus-management humming

Updated Oct 24, 2025
Python

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attention

Here are 813 public repositories matching this topic...

munnabhaiiii981 / llm-attention-visualizer

simboco / flash-linear-attention

kelvindelrosario / flash-attention-with-sink

satomune-dev / sparse-resonance

leondgarse / keras_cv_attention_models

lucidrains / dreamer4

labmlai / annotated_deep_learning_paper_implementations

muxitox / Dynamical-MF-Self-Attention

nguyenphuminh / planckgpt

shayme92 / Titans-NNX

shreyansh26 / multihead-latent-attention

msmrexe / pytorch-transformer-from-scratch

google-research / scenic

pfekin / summation-based-transformers

captainzero93 / sd-webui-forge-classic-neo-extensions

sovit-123 / vision_transformers

kyegomez / LFM2

kyegomez / ScreenAI

kyegomez / VLM-Mamba

lil-sussy / Hum-skipping-YT

Improve this page

Add this topic to your repo