Skip to content
View ga1i13o's full-sized avatar

Highlights

  • Pro

Block or report ga1i13o

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Volume Transformer: Revisiting Vanilla Transformers for 3D Scene Understanding

Python 168 9 Updated Jun 14, 2026

[CVPR 2026 Oral] "MARCO: Navigating the Unseen Space of Semantic Correspondence"

Python 138 6 Updated Apr 21, 2026

About This repository is a curated collection of the most exciting and influential CVPR 2026 papers. 🔥 [Paper + Code + Demo]

Python 465 26 Updated Jun 6, 2026

MoralStack is a governance and safety layer for LLM applications. It analyzes user requests before generation, evaluates risk and intent, and decides whether the AI should answer normally, answer s…

Python 8 Updated Jun 11, 2026

[CVPR 2026 Workshop] Official code and models for Plain Mask Transformer (PMT).

Jupyter Notebook 45 3 Updated Jun 11, 2026

[CVPR 2026 Oral] "INSID3: Training-Free In-Context Segmentation with DINOv3"

Python 532 44 Updated May 29, 2026
5 Updated Mar 20, 2026

Code, models, data for the NeurIPS'25 paper, Jamais Vu: Exposing the Generalization Gap in Supervised Semantic Correspondence

Python 7 Updated Feb 13, 2026

[CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).

Python 238 23 Updated Jun 8, 2026

Code for the paper "Attention Meets Post-hoc Interpretability: A Mathematical Perspective", ICML 2024

Jupyter Notebook 22 3 Updated Nov 10, 2025
Python 20 4 Updated May 16, 2024

Official Repository for "Communication Efficient Federated Learning with Generalized Heavy-Ball Momentum", accepted at TMLR 2025

Python 28 Updated Jul 14, 2025

Official code for "To Match or Not to Match: Revisiting Image Matching for Reliable Visual Place Recognition" CVPR IMW 2025

Python 39 2 Updated Oct 4, 2025
Python 7 1 Updated Mar 17, 2022

This repo aims to include materials (papers, codes, slides) about SAM2 (segment anything in images and videos). We are continuously improving the project. Welcome to PR the works (papers, repos) th…

151 6 Updated Oct 1, 2025

Official implementation of "HiERO: understanding the hierarchy of human behavior enhances reasoning on egocentric videos", accepted at ICCV 2025.

Python 17 4 Updated May 22, 2026

Official implementation of "Hier-EgoPack: Hierarchical Egocentric Video Understanding with Diverse Task Perspectives" https://arxiv.org/abs/2502.02487

Python 12 Updated Feb 9, 2025

Official implementation of "A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives", accepted at CVPR 2024.

Python 24 Updated Jun 13, 2024

[NeurIPS 2025 Spotlight] "SANSA: Unleashing the Hidden Semantics in SAM2 for Few-Shot Segmentation."

Jupyter Notebook 200 7 Updated Dec 17, 2025

Code for EarthMatch (CVPR 2024 IMW), an iterative coregistration pipeline to localize astronaut photos of Earth

Python 38 2 Updated Mar 15, 2026

[CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation

Python 129 9 Updated Mar 10, 2025

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

Jupyter Notebook 599 58 Updated May 25, 2026

🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.

Python 350 3 Updated Dec 11, 2025
Python 32 1 Updated May 31, 2024

Wrapper of 50+ image matching models with a unified interface

Python 875 77 Updated May 25, 2026

[CVPR 2025 Highlight] "SAMWISE: Infusing Wisdom in SAM2 for Text-Driven Video Segmentation"

Python 380 31 Updated Sep 25, 2025

Code for the paper "AMEGO: Active Memory from long EGOcentric videos" published at ECCV 2024

Python 45 4 Updated Dec 7, 2024

Official repository of the CVPR24 paper "The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement"

Python 58 2 Updated Aug 10, 2024

A bunch of scripts helping with daily tasks in 3D vision research.

Python 5 Updated Nov 14, 2025

Official code for ICCV 2023 paper "EigenPlaces: Training Viewpoint Robust Models for Visual Place Recognition"

Python 156 12 Updated Mar 15, 2026
Next