Highlights
- Pro
Lists (1)
Sort Name ascending (A-Z)
Stars
gpt-oss-120b and gpt-oss-20b are two open-weight language models by OpenAI
Janus-Series: Unified Multimodal Understanding and Generation Models
🍀 Pytorch implementation of various Attention Mechanisms, MLP, Re-parameter, Convolution, which is helpful to further understand papers.⭐⭐⭐
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Hackable and optimized Transformers building blocks, supporting a composable construction.
Matplotlib styles for scientific plotting
An efficient pure-PyTorch implementation of Kolmogorov-Arnold Network (KAN).
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
Python package for the evaluation of odometry and SLAM
[NeurIPS2025] "AI-Researcher: Autonomous Scientific Innovation" -- A production-ready version: https://novix.science/chat
[NeurIPS 2025 D&B] Open-source Multi-agent Poster Generation from Papers
[CVPR 2025] MASt3R-SLAM: Real-Time Dense SLAM with 3D Reconstruction Priors
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM (CVPR 2024)
Atlas: End-to-End 3D Scene Reconstruction from Posed Images
[CVPR'24 Oral] Official repository of Point Transformer V3 (PTv3)
Unofficial implementation of Titans, SOTA memory for transformers, in Pytorch
Toolbox for quantitative trajectory evaluation of VO/VIO
[ICRA 2025 Best Paper] MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry
VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold
[ICLR 2025, Oral] EmbodiedSAM: Online Segment Any 3D Thing in Real Time
Open source impl of **MV-DUSt3R+ Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds** from Meta Reality Labs. Project page https://mv-dust3rp.github.io/
[CVPR 2025] MINIMA: Modality Invariant Image Matching
UrbanNav:An Open-sourced Multisensory Dataset for Benchmarking Positioning Algorithms Designed for Urban Areas
🌟 SHINE-Mapping: Large-Scale 3D Mapping Using Sparse Hierarchical Implicit Neural Representations (ICRA 2023)
[ECCV 2020] In-Domain GAN Inversion for Real Image Editing