Rayn-Wu

😀

Yuan Wu Rayn-Wu

😀

PhD student at PCALab, NJUST, China

35 followers · 139 following

Nanjing University of Science and Technology
Nanjing, China
rayn-wu.github.io/

Achievements

Lists (12)

Sort

Stars

Wakals / CoVT

Official repo of "Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens"

Python 225 11 Updated Dec 9, 2025

happinesslz / DrivePI

DrivePI: Spatial-aware 4D MLLM for Unified Autonomous Driving Understanding, Perception, Prediction and Planning

36 1 Updated Dec 16, 2025

worldbench / awesome-spatial-intelligence

🌐 Forging Spatial Intelligence: A Survey on Multi-Modal Pre-Training for Autonomous Systems

15 1 Updated Dec 21, 2025

EdwardLeeLPZ / AGO

[ICCV 2025] AGO: Adaptive Grounding for Open World 3D Occupancy Prediction

12 Updated Jul 29, 2025

zhenghao2519 / SpaceDrive

19 2 Updated Dec 12, 2025

hustvl / AlphaDrive

Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning

Python 304 16 Updated Mar 26, 2025

MrPicklesGG / SparseWorld-TC

SparseWorld-TC: Trajectory-Conditioned Sparse Occupancy World Model

14 Updated Dec 17, 2025

InSAI-Lab / OccSTeP

OccSTeP: Benchmarking 4D Occupancy Spatio-Temporal Persistence

8 Updated Dec 18, 2025

HCaelrs / Squeeze-Out-GaussianFormer

GaussianFormer with Semantic Render & Multi-Frame Surpervice

Python 3 Updated Apr 8, 2025

ispc-lab / LiDAR4D

💫 [CVPR 2024] LiDAR4D: Dynamic Neural Fields for Novel Space-time View LiDAR Synthesis

Python 222 18 Updated Jun 18, 2024

UniLauX / PALNet

Code and Data for "Depth Based Semantic Scene Completion with Position Importance Aware Loss", ICRA2020 and RAL

Python 46 5 Updated Feb 5, 2020

Menoly-xin / Hardness-Level-Learning

Not All Pixels Are Equal: Learning Hardness Probability for Semantic Segmentation.

Python 37 Updated Oct 14, 2023

Say2L / GaussianFusion

[NeurIPS2025 Spotlight] Implementation of "GaussianFusion: Gaussian-Based Multi-Sensor Fusion for End-to-End Autonomous Driving"

Python 58 1 Updated Oct 28, 2025

AutoLab-SAI-SJTU / GE2EAD

Collects papers on autonomous driving E2E learning and VLM/VLA, with organized research branches and trends in these fields.

65 8 Updated Dec 13, 2025

worldbench / WorldLens

🌐 WorldLens: Full-Spectrum Evaluations of Driving World Models in Real World

Python 149 10 Updated Dec 19, 2025

SpatialRetrievalAD / Occupancy-Prediction

FB-OCC & FlashOcc with Spatial Retrieval Enchanced

Python 5 Updated Dec 9, 2025

SpatialRetrievalAD / SpatialRetrievalAD-Dataset-Devkit

Devkit, Dataset Curation Code, and Dataset (nuScenes-Geography) for Spatial Retrieval Augmented Autonomous Driving

Python 21 1 Updated Dec 9, 2025

lifuguan / IGGT_official

IGGT: Instance-Grounded Geometry Transformer for Semantic 3D Reconstruction

Python 303 10 Updated Dec 1, 2025

InternRobotics / G2VLM

G2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning

Python 230 4 Updated Nov 27, 2025

Perceive-Anything / PAM

Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos

Jupyter Notebook 304 12 Updated Sep 28, 2025

showlab / VisInContext

Official implementation of Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning

Python 27 3 Updated Oct 30, 2024

Hongbin98 / DriveFlow

This is the official repository for the AAAI 2026 paper "DriveFlow: Rectified Flow Adaptation for Robust 3D Object Detection in Autonomous Driving"

Python 3 1 Updated Dec 17, 2025

tusen-ai / MV2D

Code for "Object as Query: Lifting any 2D Object Detector to 3D Detection"

Python 112 13 Updated Aug 21, 2023

nullmax-vision / QAF2D

Enhancing 3D Object Detection with 2D Detection-Guided Query Anchors, CVPR 2024

Python 22 2 Updated Oct 12, 2024

gwenzhang / BEVDilation

[AAAI'26] BEVDilation: LiDAR-Centric Multi-Modal Fusion for 3D Object Detection

Python 21 1 Updated Dec 3, 2025

EnVision-Research / Lotus-2

Official implementation of Lotus-2: Advancing Geometric Dense Prediction with Powerful Image Generative Model

Python 203 10 Updated Dec 8, 2025

Martinser / REG

[NeurIPS 2025 Oral] Representation Entanglement for Generation: Training Diffusion Transformers Is Much Easier Than You Think

Python 230 17 Updated Oct 4, 2025

synsin0 / SRCN3D

Official implementation of SRCN3D: Sparse R-CNN 3D Surround-View Cameras 3D Object Detection and Tracking for Autonomous Driving

Python 56 9 Updated Oct 20, 2022

cdb342 / OccStudio

A unified framework for 3D Occupancy Prediction

Python 15 Updated Dec 3, 2025

OpenDriveLab / SimScale

Learning to Drive via Real-World Simulation at Scale

102 4 Updated Dec 8, 2025

Yuan Wu Rayn-Wu

Lists (12)

3D scene generation

AD

attention

End2End

gaussian

Height

ideas

🚀 My stack

occ

unsupervise

VLM

world model

Stars