YOLO-DTAD: Dynamic Task Alignment Detection Model for Multi-Category Power Defects Image

Introduction

This is our PyTorch implementation of the paper "YOLO-DTAD: Dynamic Task Alignment Detection Model for Multi-Category Power Defects Image" published in IEEE Transactions on Instrumentation and Measurement.

Quick Start Examples

Install

First, clone the project and configure the environment.

# 一台 24G, NVIDIA GeForce RTX 3090  
git clone https://github.com/LiuJiaji1999/yolo-dtad.git 
ultralytics版本为8.1.9, ultralytics/__init__.py/__version__             
pip install -r
    # ObjectDetection/DA
    python: 3.8.18 / [3.8.16]
    torch:  1.12.0+cu113 / [1.13.1+cu117]
    torchvision: 0.13.0+cu113 / [0.14.1+cu117] 
    numpy: 1.22.3
    timm: 0.9.8                 
    mmcv: 2.1.0                
    mmengine: 0.9.0  / 0.10.3

Train

python train.py

Test

python val.py

DTADH

To improve the detection performance for similar categories, the DTAD head (DTADH) is designed based on the decoupled head structure of the single-stage model.

ESLoss

To balance positive and negative samples, the EMASlideLoss (ESLoss) function is proposed, which dynamically adjusts sample loss weights by adaptively learning the intersection over union (IoU) threshold of the bounding box. In addition, the normalized Wasserstein distance (NWD) is introduced to reduce IoU regression errors of the multiscale bounding box.

$$ f\left ( IoU \right )=\left{\begin{matrix} 1,IoU\leq \mu -0.1 \\ e^{1-\mu },\mu -0.1< IoU < \mu \\ e^{1-IoU},IoU \geq \mu \end{matrix}\right.,\\ where,\mu =\beta \left ( \mu \right )\times \mu ^{'}+\left [ 1-\beta \left ( \mu \right ) \right ]\times 0.5 $$ $$ \beta(u)=\beta\times\left(1-e^{-\frac{u}{\tau}}\right) $$ $$ {ES\mathcal{L}}_{cls}(x,y)=f(IoU)\times\mathcal{L}_{cls}(x,y) $$

Experimental flow chart

Detection result

Citation

If you use this code or article in your research, please cite it using the following BibTeX entry:

@ARTICLE{10884832,
  author={Jiao, Runhai and Liu, Jiaji and Li, Kaihang and Qiao, Ruojiao and Liu, Yanzhi and Zhang, Wenbiao},
  journal={IEEE Transactions on Instrumentation and Measurement}, 
  title={YOLO–DTAD: Dynamic Task Alignment Detection Model for Multicategory Power Defects Image}, 
  year={2025},
  volume={74},
  number={},
  pages={1-14},
  keywords={Feature extraction;Head;YOLO;Insulators;Autonomous aerial vehicles;Adaptation models;Accuracy;Location awareness;Inspection;Defect detection;Exponential moving average (EMA);multicategory power defects;single-stage object detection;task interaction},
  doi={10.1109/TIM.2025.3541692}}

Explanation of the file

1. train.py ：训练模型的脚本
2. main_profile.py ：输出模型和模型每一层的参数,计算量的脚本
3. val.py ：使用训练好的模型计算指标的脚本
4. detect.py ： 推理的脚本
5. track.py：跟踪推理的脚本
6. test_yaml.py：用来测试所有yaml是否能正常运行的脚本
7. heatmap.py ：生成热力图的脚本
8. get_FPS.py ：计算模型储存大小、模型推理时间、FPS的脚本
    FPS最严谨来说就是1000(1s)/(preprocess+inference+postprocess),
    ✅没那么严谨的话就是只除以inference的时间
9. get_COCO_metrice.py：计算COCO指标的脚本
10. plot_result.py：绘制曲线对比图的脚本
11. transform_PGI.py去掉PGI模块.
12. export.py：  导出onnx脚本.
13. get_model_erf.py ： 绘制模型的有效感受野.

Attention

1. 执行pip uninstall ultralytics把安装在环境里面的ultralytics库卸载干净.<避免环境冲突导致一些奇怪的问题>
2. 卸载完成后同样再执行一次,如果出现WARNING: Skipping ultralytics as it is not installed.证明已经卸载干净.
3. 额外需要的包安装命令:
    numpy==1.23.5 albumentations==1.4.2
    pip install timm==0.9.8 thop efficientnet_pytorch==0.7.1 einops grad-cam==1.4.8 dill==0.3.6 albumentations==1.3.1 pytorch_wavelets==1.3.0 -i https://pypi.tuna.tsinghua.edu.cn/simple
    以下主要是使用dyhead必定需要安装的包,如果安装不成功dyhead没办法正常使用!如果执行了还是不成功,可看最下方mmcv安装问题.
        pip install -U openmim
        mim install mmengine -i https://pypi.tuna.tsinghua.edu.cn/simple
        mim install "mmcv>=2.0.0" -i https://pypi.tuna.tsinghua.edu.cn/simple
4.需要编译才能运行的一些模块:dcnv3、dcnv4
- 成功编译DCNv3 和 DCNv4：
    Installed /home/lenovo/anaconda3/envs/ObjectDetection/lib/python3.8/site-packages/DCNv3-1.1-py3.8-linux-x86_64.egg
    Processing dependencies for DCNv3==1.1
    Finished processing dependencies for DCNv3==1.1

    Installed /home/lenovo/anaconda3/envs/ObjectDetection/lib/python3.8/site-packages/DCNv4-1.0.0-py3.8-linux-x86_64.egg
    Processing dependencies for DCNv4==1.0.0
    Finished processing dependencies for DCNv4==1.0.0

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
COCO-result		COCO-result
dataset		dataset
docker		docker
docs		docs
examples		examples
img		img
result		result
runs		runs
ultralytics		ultralytics
README.md		README.md
detect.py		detect.py
get_COCO_metrice.py		get_COCO_metrice.py
get_FPS.py		get_FPS.py
get_model_erf.py		get_model_erf.py
heatmap.py		heatmap.py
loss_curve.png		loss_curve.png
main_profile.py		main_profile.py
metrice_curve.png		metrice_curve.png
plot_PR.py		plot_PR.py
plot_result.py		plot_result.py
pyproject.toml		pyproject.toml
splitImg.py		splitImg.py
test_env.py		test_env.py
test_yaml.py		test_yaml.py
track.py		track.py
train.py		train.py
transform_PGI.py		transform_PGI.py
tsne-mean.py		tsne-mean.py
update.md		update.md
val.py		val.py
video.mp4		video.mp4
vis.ipynb		vis.ipynb
yolov8m.pt		yolov8m.pt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

YOLO-DTAD: Dynamic Task Alignment Detection Model for Multi-Category Power Defects Image

Introduction

Quick Start Examples

DTADH

ESLoss

Experimental flow chart

Detection result

Citation

Explanation of the file

Attention

About

Uh oh!

Releases

Packages

Languages

LiuJiaji1999/yolo-dtad

Folders and files

Latest commit

History

Repository files navigation

YOLO-DTAD: Dynamic Task Alignment Detection Model for Multi-Category Power Defects Image

Introduction

Quick Start Examples

DTADH

ESLoss

Experimental flow chart

Detection result

Citation

Explanation of the file

Attention

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages