Skip to content
@hustvl

HUST Vision Lab

HUST Vision Lab of the School of EIC in HUST. Lab Lead @xinggangw

Welcome to the Vision Lab @ HUST!

🙋‍♀️ Introduction

Hello! This is the GitHub space for the Vision Lab led by Professor Xinggang Wang. We are based at the Artificial Intelligence Institute, School of Electronic Information and Communications, Huazhong University of Science and Technology (HUST).

Our research focuses on computer vision and deep learning. We are particularly interested in:

  • Multimodal Foundation Models
  • Visual Representation Learning
  • Object Detection, Segmentation, and Tracking
  • End-to-end Autonomous Driving
  • Novel Neural Architectures

Our group strives to push the boundaries of visual intelligence and has produced highly influential works in the field, including CCNet, Mask Scoring R-CNN, FairMOT, ByteTrack, EVA, MapTR, Vectorized Autonomous Driving (VAD), DiffusionDrive, Vision Mamba (Vim), 4D Gaussian Splatting (4DGS), YOLOS, YOLO-World, and LightningDiT & VA-VAE.

🌈 Contribution Guidelines & Collaboration

We actively contribute to the research community through publications and open-source projects.

  • Research Collaboration: We are open to collaborations in our areas of interest. Please feel free to reach out to Prof. Xinggang Wang (xgwang # hust.edu.cn).
  • Prospective Students: Our group has a strong track record of mentoring Ph.D. and Master's students who lead impactful publications. Interested students can find more information on Prof. Wang's faculty page.
  • Using Our Code: You are welcome to explore and use the code in our repositories. Please ensure you cite the corresponding publications appropriately. Specific details can usually be found in the README files of individual repositories.
  • Contributing to Projects: For guidelines on contributing to specific projects (e.g., bug reports, pull requests), please check the individual repositories.

👩‍💻 Useful Resources

Pinned Loading

  1. Vim Vim Public

    [ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

    Python 3.7k 266

  2. LightningDiT LightningDiT Public

    [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

    Python 1.3k 48

  3. 4DGaussians 4DGaussians Public

    [CVPR 2024] 4D Gaussian Splatting for Real-Time Dynamic Scene Rendering

    Jupyter Notebook 3.2k 301

  4. VAD VAD Public

    [ICCV 2023] VAD: Vectorized Scene Representation for Efficient Autonomous Driving

    Python 1.2k 139

  5. MapTR MapTR Public

    [ICLR'23 Spotlight & ECCV'24 & IJCV'24] MapTR: Structured Modeling and Learning for Online Vectorized HD Map Construction

    Python 1.4k 229

  6. DiffusionDrive DiffusionDrive Public

    [CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

    Python 1.2k 105

Repositories

Showing 10 of 119 repositories
  • DiffusionVL Public

    [ArXiv 2025] DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models

    hustvl/DiffusionVL’s past year of commit activity
    30 Apache-2.0 3 0 0 Updated Dec 18, 2025
  • MobileI2V Public

    [ArXiv 2025] MobileI2V: Fast and High-Resolution Image-to-Video on Mobile Devices

    hustvl/MobileI2V’s past year of commit activity
    Python 52 2 2 0 Updated Dec 17, 2025
  • SuperCLIP Public
    hustvl/SuperCLIP’s past year of commit activity
    Python 63 Apache-2.0 4 2 0 Updated Dec 17, 2025
  • TBCM Public

    Image-Free Timestep Distillation via Continuous-Time Consistency with Trajectory-Sampled Pairs

    hustvl/TBCM’s past year of commit activity
    Python 20 0 1 0 Updated Dec 16, 2025
  • InfiniteVL Public

    This is the offical repository of InfiniteVL

    hustvl/InfiniteVL’s past year of commit activity
    Python 53 Apache-2.0 2 0 0 Updated Dec 16, 2025
  • LightningDiT Public

    [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models

    hustvl/LightningDiT’s past year of commit activity
    Python 1,325 MIT 48 18 1 Updated Dec 16, 2025
  • DiffusionDriveV2 Public

    DiffusionDriveV2: Reinforcement Learning-Constrained Truncated Diffusion Modeling in End-to-End Autonomous Driving

    hustvl/DiffusionDriveV2’s past year of commit activity
    Python 137 MIT 11 3 0 Updated Dec 15, 2025
  • 4DLangVGGT Public

    Official implementation of “4D LangVGGT: 4D Language-Visual Geometry Grounded Transformer”

    hustvl/4DLangVGGT’s past year of commit activity
    Python 63 MIT 2 3 0 Updated Dec 10, 2025
  • DiffusionDrive Public

    [CVPR 2025 Highlight] Truncated Diffusion Model for Real-Time End-to-End Autonomous Driving

    hustvl/DiffusionDrive’s past year of commit activity
    Python 1,168 MIT 105 22 1 Updated Dec 8, 2025
  • EVA-X Public

    [npjDigitalMed (Nature Portfolio)] EVA-X: A foundation model for general chest X-ray analysis with self-supervised learning

    hustvl/EVA-X’s past year of commit activity
    Python 91 10 6 0 Updated Dec 6, 2025