cross-attention

Star

Here are 45 public repositories matching this topic...

cent664 / SSRIW

Star

Tensorflow implementation of 'Robust Image Watermarking based on Cross-Attention and Invariant Domain Learning'

deep-learning self-supervised-learning cross-attention robust-image-watermarking invariant-domain-representation

Updated Nov 11, 2025
Jupyter Notebook

msmrexe / pytorch-transformer-from-scratch

Star

A complete implementation of the "Attention Is All You Need" Transformer model from scratch using PyTorch. This project focuses on building and training a Transformer for neural machine translation (English-to-Italian) on the OpusBooks dataset.

machine-learning university-project pytorch transformer course-project seq2seq attention neural-machine-translation attention-mechanism encoder-decoder attention-is-all-you-need huggingface positional-encoding transformer-from-scratch cross-attention masked-attention

Updated Nov 8, 2025
Python

jhakrraman / rt-xnet

Star

[ICIP 2025] Official implementation of RT-X Net: RGB-Thermal cross attention network for Low-Light Image Enhancement

computer-vision transformers computational-imaging image-restoration autonomous-systems image-enhancement multimodal-learning multimodal low-light-image-enhancement cross-attention icip2025

Updated Nov 6, 2025
Python

sys0507 / tcr-epitope-generation

Star

TCR Epitope Generation Model with Top-K Prediction

machine-learning bioinformatics deep-learning transformer t-cell-receptor encoder-decoder-model cross-attention generative-ai esm2

Updated Oct 31, 2025
Jupyter Notebook

unum-cloud / UForm

Star

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️

Updated Oct 30, 2025
Python

tsnuk / trade-AId-multimodal-transformer

Star

Multimodal transformer for financial time-series prediction with dual configuration systems (YAML/programmatic), sophisticated data processing pipelines, file caching, and advanced numerical data augmentation

data-science machine-learning deep-learning time-series trading pytorch artificial-intelligence transformer neural-networks financial-data attention-mechanism stock-prediction multimodal cross-attention

Updated Oct 7, 2025
Python

Rishab27279 / MoodyAI

Star

python nlp docker computer-vision deep-learning sentiment-analysis pytorch emotion-recognition video-analysis multimodal-deep-learning streamlit distillation-model wav2vec2 cross-attention openai-whisper dino-v2 multimodal-ai

Updated Oct 6, 2025
Python

cosmaadrian / strawberry-problem

Star

Official repository for "The Strawberry Problem 🍓: Emergence of Character-level Understanding in Tokenized Language Models"

paper transformer tokenization cross-attention llms character-understanding

Updated Sep 29, 2025
Python

continental / 6Img-to-3D

Star

[IV 2025, Oral] Official code of "6Img-to-3D: Few-Image Large-Scale Outdoor Novel View Synthesis"

novel-view-synthesis cross-attention few-image-to-3d feedforward-reconstruction

Updated Sep 3, 2025
Python

prasunroy / mcma

Star

🔥 [TAI 2025] Exploring Mutual Cross-Modal Attention for Context-Aware Human Affordance Generation (official code).

vae human-pose cross-attention affordance-generation

Updated Aug 8, 2025
Python

umilISLab / artistic-prompt-interpretation

Star

Investigating how text-to-image diffusion models internally represent artistic concepts like content and style when generating artworks.

diffusion-models content-style-disentanglement cross-attention txt2img

Updated Aug 4, 2025
Jupyter Notebook

IDT-ITI / SD-VSum

Star

PyTorch Implementation of SD-VSum and S-VideoXum Dataset Distribution from "SD-VSum: A Method and Dataset for Script-Driven Video Summarization" (ACM Multimedia 2025)

script video-summarization video-dataset video-script cross-attention vision-language-models sd-vsum videoxum s-videoxum

Updated Jul 31, 2025
Python

xiaogang00 / LLVE_STCD

Star

This is the project for the paper of "Low-Light Video Enhancement via Spatial-Temporal Consistent Decomposition" in IJCAI2025

low-light-video-enhancement cross-attention ijcai2025

Updated Jul 29, 2025
Python

Rish-01 / Multimodal-Word-Stress-Detection

Star

Detecting word-level stress in English speech using wav2vec 2.0, with extensions to multimodal speech+text models via cross-attention fusion with BERT.

deep-learning attention-mechanism bert multimodal wav2vec2 cross-attention

Updated Jun 27, 2025
Jupyter Notebook

laowu-code / iTansformer_LSTM_CA_KAN

Star

This is the implementation of the paper Enhanced Photovoltaic Power Forecasting: An iTransformer and LSTM-Based Model Integrating Temporal and Covariate Interactions

lstm forecasting solar multivariate pv kan tcn optuna cross-attention itransformer

Updated Jun 17, 2025
Python

oppolla / Self-Organizing-Virtual-Lifeform

Star

SOVL System (Self-Organizing Virtual Lifeform): A complex, purpose-agnostic autonomous agent with continuous, asynchronous learning capabilities via a dynamic scaffolded LLM and a frozen base LLM

autonomous-agents lifelong-learning dynamic-fusion cross-attention lora-adapters multi-model-systems confidence-based-learning memory-augmented-ai self-organizing-systems dreaming-ai temperament-model gestational-learning

Updated May 24, 2025
Python

ntat / Class-Conditional-Diffusion

Star

Conditional Diffuser from scratch, applied on CelebA-HQ, Cifar10 and MNIST.

transformers pytorch unet conditional diffusion-model self-attention cross-attention

Updated May 1, 2025

srinadh99 / AstroFormer

Star

Photometry Guided Cross Attention Transformers for Astronomical Image Processing

astronomy image-processing photometry image-classification redshift galaxy self-attention vision-transformer fits-image cross-attention

Updated Apr 22, 2025
Jupyter Notebook

jwings1 / H2O

Star

3D Human-Object Interaction in Video A New Approach to Object Tracking via Cross-Modal Attention

computer-vision object-tracking human-object-interaction cross-attention

Updated Apr 11, 2025
Python

AxelDlv00 / DiffusionSketchColorization

Star

Anime sketch colorization using diffusion models and photo-sketch correspondence — a lightweight architecture combining semantic feature extraction, deformation flow, and cross-attention guidance.

computer-vision deep-learning anime pytorch unet huggingface diffusion-models sketch-colorization cross-attention photo-sketch psc-model

Updated Mar 24, 2025
Python

Improve this page

Add a description, image, and links to the cross-attention topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the cross-attention topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cross-attention

Here are 45 public repositories matching this topic...

cent664 / SSRIW

msmrexe / pytorch-transformer-from-scratch

jhakrraman / rt-xnet

sys0507 / tcr-epitope-generation

unum-cloud / UForm

tsnuk / trade-AId-multimodal-transformer

Rishab27279 / MoodyAI

cosmaadrian / strawberry-problem

continental / 6Img-to-3D

prasunroy / mcma

umilISLab / artistic-prompt-interpretation

IDT-ITI / SD-VSum

xiaogang00 / LLVE_STCD

Rish-01 / Multimodal-Word-Stress-Detection

laowu-code / iTansformer_LSTM_CA_KAN

oppolla / Self-Organizing-Virtual-Lifeform

ntat / Class-Conditional-Diffusion

srinadh99 / AstroFormer

jwings1 / H2O

AxelDlv00 / DiffusionSketchColorization

Improve this page

Add this topic to your repo