Skip to content
View Rayn-Wu's full-sized avatar
😀
😀

Block or report Rayn-Wu

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"

Python 225 11 Updated Dec 9, 2025

DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning

36 1 Updated Dec 16, 2025

🌐 Forging Spatial Intelligence: A Survey on Multi-Modal Pre-Training for Autonomous Systems

15 1 Updated Dec 21, 2025

[ICCV 2025] AGO: Adaptive Grounding for Open World 3D Occupancy Prediction

12 Updated Jul 29, 2025

Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Python 304 16 Updated Mar 26, 2025

SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model

14 Updated Dec 17, 2025

OccSTeP: Benchmarking 4D Occupancy Spatio-Temporal Persistence

8 Updated Dec 18, 2025

GaussianFormer with Semantic Render & Multi-Frame Surpervice

Python 3 Updated Apr 8, 2025

💫 [CVPR 2024] LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis

Python 222 18 Updated Jun 18, 2024

Code and Data for "Depth Based Semantic Scene Completion with Position Importance Aware Loss", ICRA2020 and RAL

Python 46 5 Updated Feb 5, 2020

Not All Pixels Are Equal: Learning Hardness Probability for Semantic Segmentation.

Python 37 Updated Oct 14, 2023

[NeurIPS2025 Spotlight] Implementation of "GaussianFusion: Gaussian-Based Multi-Sensor Fusion for End-to-End Autonomous Driving"

Python 58 1 Updated Oct 28, 2025

Collects papers on autonomous driving E2E learning and VLM/VLA, with organized research branches and trends in these fields.

65 8 Updated Dec 13, 2025

🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Python 149 10 Updated Dec 19, 2025

FB-OCC & FlashOcc with Spatial Retrieval Enchanced

Python 5 Updated Dec 9, 2025

Devkit, Dataset Curation Code, and Dataset (nuScenes-Geography) for Spatial Retrieval Augmented Autonomous Driving

Python 21 1 Updated Dec 9, 2025

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Python 303 10 Updated Dec 1, 2025

G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Python 230 4 Updated Nov 27, 2025

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos

Jupyter Notebook 304 12 Updated Sep 28, 2025

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

Python 27 3 Updated Oct 30, 2024

This is the official repository for the AAAI 2026 paper "DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving"

Python 3 1 Updated Dec 17, 2025

Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"

Python 112 13 Updated Aug 21, 2023

Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors, CVPR 2024

Python 22 2 Updated Oct 12, 2024

[AAAI'26] BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection

Python 21 1 Updated Dec 3, 2025

Official implementation of Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model

Python 203 10 Updated Dec 8, 2025

[NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think

Python 230 17 Updated Oct 4, 2025

Official implementation of SRCN3D: Sparse R-CNN 3D Surround-View Cameras 3D Object Detection and Tracking for Autonomous Driving

Python 56 9 Updated Oct 20, 2022

A unified framework for 3D Occupancy Prediction

Python 15 Updated Dec 3, 2025

Learning to Drive via Real-World Simulation at Scale

102 4 Updated Dec 8, 2025
Next