Skip to content
View DdeGeus's full-sized avatar

Block or report DdeGeus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026] Official repository for the paper: "INSID3: Training-Free In-Context Segmentation with DINOv3"

Python 46 1 Updated Mar 31, 2026

[CVPR 2026 Workshop] Official code and models for Plain Mask Transformer (PMT).

Jupyter Notebook 19 Updated Mar 27, 2026

[CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).

Python 188 16 Updated Mar 4, 2026

Fast and memory-efficient exact attention

Python 23,065 2,568 Updated Mar 31, 2026

Sa2VA-i is an improved version of the popular Sa2VA model

Python 16 1 Updated Nov 25, 2025

VisualOverload (CVPR 2026) is a VQA benchmark for image understanding in dense, high-resolution scenes.

Python 16 1 Updated Mar 29, 2026

The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

Jupyter Notebook 219 6 Updated Nov 28, 2025

Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

Python 267 15 Updated Sep 24, 2025

[ICCV 2025] DONUT: A Decoder-Only Model for Trajectory Prediction

Python 41 5 Updated Mar 23, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,741 1,402 Updated Mar 3, 2026

🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.

Python 349 3 Updated Dec 11, 2025

Code for on-the-fly creation of pseudo video datasets as described in "How Important are Videos for Training Video LLMs?"

2 Updated Jun 11, 2025

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

Jupyter Notebook 564 55 Updated Feb 25, 2026

3DV 2026 | CVPRW 2025 (T4V)

Python 97 4 Updated Mar 20, 2026

[ICRA 2025] Interactive4D: Interactive 4D LiDAR Segmentation

Python 97 7 Updated Feb 10, 2026

[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning

Python 313 12 Updated Dec 21, 2025

[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Python 506 22 Updated Jan 26, 2026

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Python 1,480 151 Updated Jun 3, 2025

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,729 65 Updated Mar 30, 2026
Python 13 Updated Oct 14, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,826 2,406 Updated Mar 20, 2026

[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model

Python 210 8 Updated Aug 5, 2024

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 2,090 132 Updated Mar 11, 2026

Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

Python 341 24 Updated Jan 21, 2025

[CVPR 2024] Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations

Python 24 5 Updated Jan 20, 2025

[CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation

Python 130 10 Updated Mar 10, 2025

ALGM applied to Segmenter

Python 30 2 Updated May 27, 2024
Python 35 4 Updated Jul 18, 2025
Python 45 1 Updated Dec 4, 2023
Next