Skip to content

Iron-LYK/ReCamDriving

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 

Repository files navigation

ReCamDriving Logo

ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation

Yaokun Li1, Shuaixian Wang1,3, Mantang Guo2, Jiehui Huang4, Taojun Ding2
Mu Hu4, Kaixuan Wang2, Shaojie Shen4, Guang Tan1

1 Sun Yat-sen University    2 ZYT    3 Shenzhen Polytechnic University   
4 The Hong Kong University of Science and Technology

   

📷 Abstract

We propose ReCamDriving, a purely vision-based, camera-controlled novel-trajectory video generation framework. While repair-based methods fail to restore complex artifacts and LiDAR-based approaches rely on sparse and incomplete cues, ReCamDriving leverages dense and scene-complete 3DGS renderings for explicit geometric guidance, achieving precise camera-controllable generation. To mitigate overfitting to restoration behaviors when conditioned on 3DGS renderings, ReCamDriving adopts a two-stage training paradigm: the first stage uses camera poses for coarse control, while the second stage incorporates 3DGS renderings for fine-grained viewpoint and geometric guidance. Furthermore, we present a 3DGS-based cross-trajectory data curation strategy to eliminate the train–test gap in camera transformation patterns, enabling scalable multi-trajectory supervision from monocular videos. Based on this strategy, we construct the ParaDrive dataset, containing over 110K parallel-trajectory video pairs. Extensive experiments demonstrate that ReCamDriving achieves state-of-the-art camera controllability and structural consistency.


ReCamDriving Teaser Image: Comparison of novel-trajectory generation methods

Comparison of novel-trajectory generation. Repair-based methods (e.g., Difix3D+) suffer from severe artifacts under novel viewpoints, while LiDAR-based camera-controlled methods (e.g., StreetCrafter) show geometric inconsistencies in occluded or distant regions due to incomplete cues. In contrast, ReCamDriving employs a coarse-to-fine two-stage training strategy that leverages dense scene-structure information from novel-trajectory 3DGS renderings for precise camera control and structurally consistent generation.

🌐 ParaDrive Dataset

Based on our data curation strategy, we constructed the ParaDrive dataset, which contains over 110K parallel-trajectory video pairs, enabling scalable multi-trajectory supervision.

✅ TODO List

We are finalizing the release of the code and data and aim to complete it as soon as possible. Please stay tuned!

  • Paper released on arXiv.
  • Release training and inference code.
  • Release model weights.
  • Release ParaDrive dataset.

🔗 Citation

If you find our work helpful, please consider citing:

@misc{li2025recamdrivinglidarfreecameracontrollednovel,
      title={ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation}, 
      author={Yaokun Li and Shuaixian Wang and Mantang Guo and Jiehui Huang and Taojun Ding and Mu Hu and Kaixuan Wang and Shaojie Shen and Guang Tan},
      year={2025},
      eprint={2512.03621},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2512.03621}, 
}

About

Official implementation of "ReCamDriving: LiDAR-Free Camera-Controlled Novel Trajectory Video Generation"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors