Skip to content
View DdeGeus's full-sized avatar

Block or report DdeGeus

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[CVPR 2026] Official repository for the paper: "INSID3: Training-Free In-Context Segmentation with DINOv3"

Python 104 4 Updated Apr 1, 2026

[CVPR 2026 Workshop] Official code and models for Plain Mask Transformer (PMT).

Jupyter Notebook 19 Updated Mar 27, 2026

[CVPR 2026] Official code and models for Video Encoder-only Mask Transformer (VidEoMT).

Python 190 16 Updated Mar 4, 2026

Fast and memory-efficient exact attention

Python 23,109 2,576 Updated Apr 2, 2026

Sa2VA-i is an improved version of the popular Sa2VA model

Python 16 1 Updated Nov 25, 2025

VisualOverload (CVPR 2026) is a VQA benchmark for image understanding in dense, high-resolution scenes.

Python 16 1 Updated Mar 29, 2026

The code for paper 'Learning from Videos for 3D World: Enhancing MLLMs with 3D Vision Geometry Priors'

Jupyter Notebook 220 6 Updated Nov 28, 2025

Official code of Franca: Nested Matryoshka Clustering for Scalable Visual Representation Learning

Python 267 15 Updated Sep 24, 2025

[ICCV 2025] DONUT: A Decoder-Only Model for Trajectory Prediction

Python 41 5 Updated Mar 23, 2026

[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer

Python 12,768 1,406 Updated Mar 3, 2026

🚀 Lightning-fast computer vision models. Fine-tune SOTA models with just a few lines of code. Ready for cloud ☁️ and edge 📱 deployment.

Python 349 3 Updated Dec 11, 2025

Code for on-the-fly creation of pseudo video datasets as described in "How Important are Videos for Training Video LLMs?"

Python 2 Updated Apr 2, 2026

[CVPR 2025 Highlight] Official code and models for Encoder-only Mask Transformer (EoMT).

Jupyter Notebook 565 55 Updated Feb 25, 2026

3DV 2026 | CVPRW 2025 (T4V)

Python 97 4 Updated Mar 20, 2026

[ICRA 2025] Interactive4D: Interactive 4D LiDAR Segmentation

Python 97 7 Updated Feb 10, 2026

[ECCV 2024] Improving 2D Feature Representations by 3D-Aware Fine-Tuning

Python 314 13 Updated Dec 21, 2025

[WACV'25 Oral] Fine-Tuning Image-Conditional Diffusion Models is Easier than You Think

Python 506 22 Updated Jan 26, 2026

[ICLR 2023 Spotlight] Vision Transformer Adapter for Dense Predictions

Python 1,480 153 Updated Jun 3, 2025

Official repository for "AM-RADIO: Reduce All Domains Into One"

Python 1,736 65 Updated Mar 30, 2026
Python 13 Updated Oct 14, 2025

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 18,840 2,410 Updated Mar 20, 2026

[ECCV24] VISA: Reasoning Video Object Segmentation via Large Language Model

Python 210 8 Updated Aug 5, 2024

[CVPR 2025] Official PyTorch Implementation of MambaVision: A Hybrid Mamba-Transformer Vision Backbone

Python 2,097 134 Updated Mar 11, 2026

Official implementation of the CVPR 2024 paper ViT-CoMer: Vision Transformer with Convolutional Multi-scale Feature Interaction for Dense Predictions.

Python 341 24 Updated Jan 21, 2025

[CVPR 2024] Task-aligned Part-aware Panoptic Segmentation through Joint Object-Part Representations

Python 24 5 Updated Jan 20, 2025

[CVPR 2024] PEM: Prototype-based Efficient MaskFormer for Image Segmentation

Python 130 9 Updated Mar 10, 2025

ALGM applied to Segmenter

Python 30 2 Updated May 27, 2024
Python 35 4 Updated Jul 18, 2025
Python 45 1 Updated Dec 4, 2023
Next