SceneGlue: Scene-Aware Transformer for Feature Matching without Scene-Level Annotation

Introduction

SceneGlue is a scene-aware feature matching framework that overcomes the locality limitation of traditional descriptors by integrating parallel attention for implicit global context modeling and a Visibility Transformer for explicit cross-view visibility estimation. By jointly leveraging implicit and explicit scene-level awareness without requiring scene-level annotations, it significantly improves matching accuracy, robustness, and interpretability across multiple vision tasks. SceneGlue was published in IEEE Transactions on Circuits and Systems for Video Technology, 2026.

https://ieeexplore.ieee.org/document/11483154
https://arxiv.org/abs/2604.13941

What makes SceneGlue special?

SceneGlue goes beyond traditional point-level matching by introducing scene-aware feature matching, integrating global scene information with local correspondences.
It combines implicit scene modeling (via parallel self- and cross-attention) with explicit reasoning (via a Visibility Transformer for cross-view visibility estimation).
Parallel attention mechanism enables more efficient and richer interactions than the sequential attention used in prior methods.
SceneGlue improves feature representation through multi-scale descriptors and wave-based position encoding, enhancing robustness to scale and geometry variations.

Installation

conda env create -f environment.yaml
conda activate sceneglue

We provide the download link to

the scannet-1500-testset (~1GB).
the megadepth-1500-testset (~600MB).

Test

You need to setup the testing subsets of ScanNet and MegaDepth first. We create symlinks from the previously downloaded datasets to data/{{dataset}}/test.

# set up symlinks
ln -s /path/to/scannet-1500-testset/* /path/to/LoFTR/data/scannet/test
ln -s /path/to/megadepth-1500-testset/* /path/to/LoFTR/data/megadepth/test

MegaDepth

conda activate sceneglue
bash ./scripts/reproduce_test/outdoor.sh

ScanNet

conda activate sceneglue
bash ./scripts/reproduce_test/indoor.sh

Citation

@ARTICLE{SceneGlue,
  author={Du, Songlin and Lu, Xiaoyong and Yan, Yaping and Xiao, Guobao and Lu, Xiaobo and Ikenaga, Takeshi},
  journal={IEEE Transactions on Circuits and Systems for Video Technology}, 
  title={SceneGlue: Scene-Aware Transformer for Feature Matching without Scene-Level Annotation}, 
  year={2026},
  doi={10.1109/TCSVT.2026.3684799}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.idea		.idea
assets		assets
sceneglue_models		sceneglue_models
LICENSE		LICENSE
README.md		README.md
environment.yaml		environment.yaml
framework.jpg		framework.jpg
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py
visualiaztion.jpg		visualiaztion.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SceneGlue: Scene-Aware Transformer for Feature Matching without Scene-Level Annotation

Introduction

What makes SceneGlue special?

Installation

Test

MegaDepth

ScanNet

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

SceneGlue: Scene-Aware Transformer for Feature Matching without Scene-Level Annotation

Introduction

What makes SceneGlue special?

Installation

Test

MegaDepth

ScanNet

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages