This project introduces a novel object detection model, FGFA-YOLOX, specifically designed to enhance the detection of micromobility vehicles (MMVs) such as e-scooters, bikes, and skateboards in urban traffic videos. By integrating the accuracy and speed of single-frame detection with the enriched temporal features of video-based frameworks, FGFA-YOLOX significantly advances the state-of-the-art in urban mobility analytics.
- Introduction of the FGFA-YOLOX model, offering a significant improvement on video object detection models.
- Construction of a comprehensive dataset focused on urban micromobility, aiming to propel further research by making it publicly available.
Below is the architecture of our FGFA-YOLOX model, illustrating the innovative integration of spatio-temporal features for enhanced object detection in complex urban environments.
The PolyMMV dataset is crafted for detecting Micromobility Vehicles in urban environments, comprising:
- Video footage capturing diverse urban traffic scenarios.
- Annotations in three formats: YOLO, Pascal VOC, and COCO.
- Pre-trained model checkpoints, allowing users to benchmark or further fine-tune on the PolyMMV dataset.
- A results folder that contains the results for each model trained.
For accessing these resources, visit our dataset Drive.
For detailed instructions on setting up the project environment and running the FGFA-YOLOX model, please refer to the installation guide.
We extend our gratitude to the developers of mmtracking for their comprehensive platform, which played an important role in the implementation of our proposed FGFA-YOLOX architecture. Their work provided the necessary infrastructure that enabled us to seamlessly integrate our model. For their invaluable contribution to our project and the broader research community, we sincerely thank them.
Learn more about their work at mmtracking GitHub.
For any questions or issues faced during the execution of code, feel free to reach out to me in: khalil.sabri@polymtl.ca .
If you find our work useful in your research, please use the following BibTeX entry:
@inproceedings{sabri2024detection,
title = {Detection of Micromobility Vehicles in Urban Traffic Videos},
author = {Sabri, Khalil and Djilali, Célia and Bilodeau, Guillaume-Alexandre and Saunier, Nicolas and Bouachir, Wassim},
booktitle = {Proceedings of the 21st Conference on Robots and Vision (CRV)},
year = {2024},
url = {https://arxiv.org/abs/2402.18503}
}