Skip to content
View LuoXubo's full-sized avatar

Highlights

  • Pro

Block or report LuoXubo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Jupyter Notebook 18 Updated Dec 19, 2025

Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).

Python 58 4 Updated Dec 8, 2025

Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"

Python 261 30 Updated Oct 29, 2025

A list of works on video generation towards world model

279 5 Updated Dec 19, 2025

A Collection of Diffusion for Path Planning Papers, Toolboxes and Notes.

7 Updated Jul 28, 2025

WorldPlay: Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 570 28 Updated Dec 19, 2025

Offical code release for DynoSAM: Dynamic Object Smoothing And Mapping. Accepted Transactions on Robotics (Visual SLAM SI). A visual SLAM framework and pipeline for Dynamic environements, estimatin…

C++ 222 19 Updated Dec 17, 2025

[TRO 2025] NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning.

Python 772 71 Updated Dec 12, 2025

[CVPR 2025 Highlight] MATCHA: Towards Matching Anything

Python 5 Updated Dec 11, 2025
Python 443 59 Updated Dec 16, 2024

Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934

Python 167 7 Updated Oct 28, 2025

Vision-Language Navigation Benchmark in Isaac Lab

Python 283 24 Updated Aug 28, 2025

OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer

Python 220 11 Updated Dec 3, 2025

Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.

1,673 68 Updated Dec 12, 2025

[IROS 2025 oral] Official implementation of NOLO: Navigate Only Look Once

Python 17 1 Updated Nov 13, 2025

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

477 26 Updated Dec 15, 2025

[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024

Python 63 2 Updated Apr 9, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,465 980 Updated Aug 12, 2024

The version 2 of Mono 3D visiual grounding

Python 2 Updated Mar 21, 2025

Official code release for CoRL'25 paper: VT-Refine: Learning Bimanual Assembly with Visuo-Tactile Feedback via Simulation Fine-Tuning

Dockerfile 83 5 Updated Oct 18, 2025

Repository of the paper "AnyUp: Universal Feature Upsampling".

Jupyter Notebook 427 26 Updated Dec 19, 2025

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,641 53 Updated Nov 15, 2025

Reading list for research topics in embodied vision

689 78 Updated Jun 13, 2025

This is the code for the IROS2025 RoboSense challenge track1: LLM for Driving

Python 4 Updated Oct 29, 2025

Robot Learning Beyond Earth

Python 98 12 Updated Dec 1, 2025
Python 119 5 Updated Sep 8, 2025

[ACMMM 2025] Official implementation of SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero Shot 3D Visual Grounding

Python 16 1 Updated Nov 25, 2025

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,278 1,447 Updated Nov 28, 2025

SLAM-Former: Putting SLAM into One Transformer

403 5 Updated Sep 26, 2025

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Python 197 4 Updated Apr 21, 2025
Next