Stars
Official implementation of RT-DETRv4: Painlessly Furthering Real-Time Object Detection with Vision Foundation Models
All-in-one training for vision models (YOLO, ViTs, RT-DETR, DINOv3): pretraining, fine-tuning, distillation.
[DEIMv2] Real Time Object Detection Meets DINOv3
D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]
[CVPR 2025] DEIM: DETR with Improved Matching for Fast Convergence
Keep track of big models in audio domain, including speech, singing, music etc.
Official implementation of "Unseen Visual Anomaly Generation" (CVPR 2025)
Fast and memory-efficient exact attention
Hight quility bayer interpolation and scaler in raw domain
Official implementation of the WACV 2025 ( Oral ) paper. RT-DETRv3: Real-time End-to-End Object Detection with Hierarchical Dense Positive Supervision.
Skeleton Recall Loss for Connectivity Conserving and Resource Efficient Segmentation of Thin Tubular Structures
Paper list and datasets for industrial image anomaly/defect detection (updating). 工业异常/瑕疵检测论文及数据集检索库(持续更新)。
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.
PyTorch implementation of over 30 realtime semantic segmentations models, e.g. BiSeNetv1, BiSeNetv2, CGNet, ContextNet, DABNet, DDRNet, EDANet, ENet, ERFNet, ESPNet, ESPNetv2, FastSCNN, ICNet, LEDN…
这是一个yolov8-pytorch的仓库,可以用于训练自己的数据集。
[ECCV2024 Oral] Official implementation of the paper "Relation DETR: Exploring Explicit Position Relation Prior for Object Detection"
CnOCR: Awesome Chinese/English OCR Python toolkits based on PyTorch. It comes with 20+ well-trained models for different application scenarios and can be used directly after installation. 【基于 PyTor…
[CVPR 2024] Official RT-DETR (RTDETR paddle pytorch), Real-Time DEtection TRansformer, DETRs Beat YOLOs on Real-time Object Detection. 🔥 🔥 🔥
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
OCR, layout analysis, reading order, table recognition in 90+ languages