Stars
An efficient single/multi-agent trajectory planner for multicopters.
Official implementation of the paper: "StreamVLN: Streaming Vision-and-Language Navigation via SlowFast Context Modeling"
Project AirSim is Microsoft's evolution of AirSim, an advanced simulation platform for building, training, and testing autonomous systems in high-fidelity virtual environments
MobileVLA-R1: Reinforcing Vision-Language-Action for Mobile Robots
OpenEQA Embodied Question Answering in the Era of Foundation Models
[Embodied-AI-Survey-2025] Paper List and Resource Repository for Embodied AI
Scaling Spatial Intelligence with Multimodal Foundation Models
Code for paper "MapTracker: Tracking with Strided Memory Fusion for Consistent Vector HD Mapping", ECCV 2024 (Oral)
Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research
Fast and memory-efficient exact attention
[CVPR'25 Highlight] Official repository of Sonata: Self-Supervised Learning of Reliable Point Representations
RoboChallenge Inference example code
GussianPretrain for Visual Pre-training in Autonomous Driving, showcasing significant improvements across various 3D perception tasks, including 3D object detection, HD-map construction, and Occupa…
📖 This is a repository for organizing papers, codes and other resources related to Visual Reinforcement Learning.
[SIGGRAPH'24] 2D Gaussian Splatting for Geometrically Accurate Radiance Fields
A differentiable rasterizer used in the project "2D Gaussian Splatting"
[Lumina Embodied AI] 具身智能技术指南 Embodied-AI-Guide
[ECCV 2024] This is the official implementation of MapQR, an end-to-end method with an emphasis on enhancing query capabilities for constructing online vectorized maps.
[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
High-Resolution Image Synthesis with Latent Diffusion Models
Elucidating the Design Space of Diffusion-Based Generative Models (EDM)
Implementation of Denoising Diffusion Probabilistic Model in Pytorch
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Code for "PolyDiffuse: Polygonal Shape Reconstruction via Guided Set Diffusion Models", NeurIPS 2023
[ICCV2023 Best Paper Finalist] PyTorch implementation of DiffusionDet (https://arxiv.org/abs/2211.09788)