Stars
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
A project demonstrating Lidar related AI solutions, including three GPU accelerated Lidar/camera DL networks (PointPillars, CenterPoint, BEVFusion) and the related libs (cuPCL, 3D SparseConvolution…
[ICLR 2023] Official implementation of the paper "DINO: DETR with Improved DeNoising Anchor Boxes for End-to-End Object Detection"
Real Time Detection Transformer and Tracking
RF-DETR Object Detection with DeepSORT Tracking
OpenPCDet Toolbox for LiDAR-based 3D Object Detection.
BoT-SORT: Robust Associations Multi-Pedestrian Tracking
[ECCV 2022] ByteTrack: Multi-Object Tracking by Associating Every Detection Box
LSD (LiDAR SLAM & Detection) is an open source perception architecture for autonomous vehicle/robotic
CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection
A Simple PointPillars PyTorch Implementation for 3D LiDAR(KITTI) Detection.
(IROS 2020, ECCVW 2020) Official Python Implementation for "3D Multi-Object Tracking: A Baseline and New Evaluation Metrics"
[ICCV 2023] StreamPETR: Exploring Object-Centric Temporal Modeling for Efficient Multi-View 3D Object Detection
Implementation of "TrackFormer: Multi-Object Tracking with Transformers”. [Conference on Computer Vision and Pattern Recognition (CVPR), 2022]
[ECCV2022] MOTR: End-to-End Multiple-Object Tracking with TRansformer
[CVPR2023] The official repo for OC-SORT: Observation-Centric SORT on video Multi-Object Tracking. OC-SORT is simple, online and robust to occlusion/non-linear motion.
Simple Online Realtime Tracking with a Deep Association Metric
🏆 北京理工大学相关的精选列表。(又名 bytes)A ranked list of awesome BITs — projects related to Beijing Institute of Technology. (aka. bytes)
End-to-End Object Detection with Transformers
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
FAST-LIVO2: Fast, Direct LiDAR-Inertial-Visual Odometry
[ECCV2024] Grounded Multimodal Large Language Model with Localized Visual Tokenization