- London, UK
-
08:45
(UTC +01:00) - https://kerolex.github.io/
- https://orcid.org/0000-0002-8227-8529
- in/alessioxompero
Stars
[ICCV 2025] Official implementation of the paper: "Dynamic-DINO: Fine-Grained Mixture of Experts Tuning for Real-time Open-Vocabulary Object Detection"
[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
[IJCAI 2024] Papers about graph reduction including graph coarsening, graph condensation, graph sparsification, graph summarization, etc.
Papers about explainability of GNNs
✨ The all-contributors bot website and documentation. Recognize all contributors, not just the ones who push code ✨
My implementation of the original GAT paper (Veličković et al.). I've additionally included the playground.py file for visualizing the Cora dataset, GAT embeddings, an attention mechanism, and entr…
Awesome work on object 6 DoF pose estimation
Recent papers about generalizable 6DoF object pose estimation.
Repository for the paper: Generating gender-ambiguous voices for privacy-preserving speech recognition
Official repository of the paper Towards safe human-to-robot handovers of unknown containers, presented at the IEEE International Conference on Robot and Human Interactive Communication (RO-MAN), 2021
Efficient adaptive non-maximal suppression algorithms for homogeneous spatial keypoint distribution
[CVPR'22] NICE-SLAM: Neural Implicit Scalable Encoding for SLAM
Code for the CVPR2021 paper "Patch-NetVLAD: Multi-Scale Fusion of Locally-Global Descriptors for Place Recognition"
SuperGlue: Learning Feature Matching with Graph Neural Networks (CVPR 2020, Oral)
Appearance-based Loop Closure Detection using Incremental Bags of Binary Words
A General Simultaneous Localization and Mapping Framework which supports feature based or direct method and different sensors including monocular camera, RGB-D sensors or any other input types can …
CCM-SLAM: Robust and Efficient Centralized Collaborative Monocular SLAM for Robotic Teams
Code of single-view depth prediction algorithm on Internet Photos described in "MegaDepth: Learning Single-View Depth Prediction from Internet Photos, Z. Li and N. Snavely, CVPR 2018".
FBOW (Fast Bag of Words) is an extremmely optimized version of the DBow2/DBow3 libraries.
WiSE-MNet++: Wireless Simulation Environment for Multimedia Networks
Cloud framework for Cooperative Tracking And Mapping
Public code for "Data-Efficient Decentralized Visual SLAM"