LuoXubo

Xubo Luo LuoXubo

Master student@UCAS, focusing on robotics vision, navigation and localization.

26 followers · 120 following

University of Chinese Academy of Sciences
Beijing
11:07 (UTC +08:00)
https://luoxubo.github.io/
@xubo_luo

Achievements

Highlights

Lists (24)

Sort

Starred repositories

leosegre / Multi-View-Foundation-Models

Jupyter Notebook 18 Updated Dec 19, 2025

Feliciaxyao / NavMorph

Official implementation of NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous Environments (ICCV'25).

Python 58 4 Updated Dec 8, 2025

F1y1113 / UniWM

Official implementation of paper "Unified World Models: Memory-Augmented Planning and Foresight for Visual Navigation"

Python 261 30 Updated Oct 29, 2025

ziqihuangg / Awesome-From-Video-Generation-to-World-Model

A list of works on video generation towards world model

279 5 Updated Dec 19, 2025

Yifu-Tian / Awesome-Diffusion4PathPlanning

A Collection of Diffusion for Path Planning Papers, Toolboxes and Notes.

7 Updated Jul 28, 2025

Tencent-Hunyuan / HY-WorldPlay

WorldPlay: Interactive World Modeling with Real-Time Latency and Geometric Consistency

Python 570 28 Updated Dec 19, 2025

ACFR-RPG / DynOSAM

Offical code release for DynoSAM: Dynamic Object Smoothing And Mapping. Accepted Transactions on Robotics (Visual SLAM SI). A visual SLAM framework and pipeline for Dynamic environements, estimatin…

C++ 222 19 Updated Dec 17, 2025

hanruihua / NeuPAN

[TRO 2025] NeuPAN: Direct Point Robot Navigation with End-to-End Model-based Learning.

Python 772 71 Updated Dec 12, 2025

nv-dvl / matcha

[CVPR 2025 Highlight] MATCHA: Towards Matching Anything

Python 5 Updated Dec 11, 2025

btx0424 / OmniDrones

Python 443 59 Updated Dec 16, 2024

thuml / RLVR-World

Official repository for "RLVR-World: Training World Models with Reinforcement Learning" (NeurIPS 2025), https://arxiv.org/abs/2505.13934

Python 167 7 Updated Oct 28, 2025

yang-zj1026 / NaVILA-Bench

Vision-Language Navigation Benchmark in Isaac Lab

Python 283 24 Updated Aug 28, 2025

Livioni / OmniVGGT-official

OmniVGGT: Omni-Modality Driven Visual Geometry Grounded Transformer

Python 220 11 Updated Dec 3, 2025

LMD0311 / Awesome-World-Model

Collect some World Models for Autonomous Driving (and Robotic, etc.) papers.

1,673 68 Updated Dec 12, 2025

zhoubohan0 / NOLO

[IROS 2025 oral] Official implementation of NOLO: Navigate Only Look Once

Python 17 1 Updated Nov 13, 2025

zli12321 / Vision-Language-Models-Overview

A most Frontend Collection and survey of vision-language model papers, and models GitHub repository. Continuous updates.

477 26 Updated Dec 15, 2025

ZhanYang-nwpu / Mono3DVG

[AAAI 2024] Mono3DVG: 3D Visual Grounding in Monocular Images, AAAI, 2024

Python 63 2 Updated Apr 9, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 9,465 980 Updated Aug 12, 2024

Jade-Ray / Mono3DVGv2

The version 2 of Mono 3D visiual grounding

Python 2 Updated Mar 21, 2025

NVlabs / vt-refine

Official code release for CoRL'25 paper: VT-Refine: Learning Bimanual Assembly with Visuo-Tactile Feedback via Simulation Fine-Tuning

Dockerfile 83 5 Updated Oct 18, 2025

wimmerth / anyup

Repository of the paper "AnyUp: Universal Feature Upsampling".

Jupyter Notebook 427 26 Updated Dec 19, 2025

bytetriper / RAE

Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"

Python 1,641 53 Updated Nov 15, 2025

ChanganVR / awesome-embodied-vision

Reading list for research topics in embodied vision

689 78 Updated Jun 13, 2025

wuaodi / UCAS-CSU-phase2

This is the code for the IROS2025 RoboSense challenge track1: LLM for Driving

Python 4 Updated Oct 29, 2025

AndrejOrsula / space_robotics_bench

Robot Learning Beyond Earth

Python 98 12 Updated Dec 1, 2025

haoyu-x / vision-in-action

Python 119 5 Updated Sep 8, 2025

JiawLin / SeqVLM

[ACMMM 2025] Official implementation of SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero Shot 3D Visual Grounding

Python 16 1 Updated Nov 25, 2025

QwenLM / Qwen3-VL

Qwen3-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 17,278 1,447 Updated Nov 28, 2025

Tsinghua-MARS-Lab / SLAM-Former

SLAM-Former: Putting SLAM into One Transformer

403 5 Updated Sep 26, 2025

iris0329 / SeeGround

[CVPR'25] SeeGround: See and Ground for Zero-Shot Open-Vocabulary 3D Visual Grounding

Python 197 4 Updated Apr 21, 2025

Xubo Luo LuoXubo

Highlights

Lists (24)

Attention mechanism

Autonomous driving

clip

Efficiency

ekf

Event Camera

Facial expression recognition

flow matching

Homography Estimation

IELTS

Image fusion

Image matching

Image retrieval

Lab homepage

Learning

Mulit sensor localization

NeRF

Paper codes

Pose estimation

Segmentation

SLAM with deep learning

Tracking

Visual localization

world model

Starred repositories

image-matching

MATLAB

Linux

LaTeX

GitHub API

Git

Deep learning