Stars
Official Repo for ICCV25-Video2BEV: Transforming Drone Videos to BEVs for Video-based Geo-localization
Code release for Revisit Anything: Visual Place Recognition via Image Segment Retrieval (ECCV 2024)
Mapillary Street-level Sequences Dataset
[ICCV 2025] Where am I? Cross-View Geo-localization with Natural Language Descriptions.
[NeurIPS 2024] Official Implementation of Hawk: Learning to Understand Open-World Video Anomalies
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Stable Diffusion web UI
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.