GLMD-CA

This repo contains the official PyTorch implementation for our paper "Infrared Few-Shot Object Detection via Global-Local Mutual Distillation Network and Confusion-Aware Loss"

Abstract

Infrared few-shot object detection (IFSOD) tackles the crucial problem of detecting novel objects with limited annotated samples in the field of thermal imaging. However, beyond the fundamental challenges in few-shot object detection, we also have to overcome the degradation in feature discriminability caused by inherent limitations of infrared imagery. To this end, we propose an IFSOD method based on global-local mutual distillation network and confusion-aware loss (GLMD-CA). First, a novel neck network, termed global-local mutual distillation (GLMD), is designed with four key components: 1) a context modeling module for global dependency capture, 2) a local refinement module for detail enhancement, 3) a mutual distillation fusion module for global-local information integration, and 4) a classification-regression decoupling module for task-specific feature learning, which together improve the overall feature representation. In addition, to mitigate the classification confusion induced by sparse annotations, we introduce the confusion-aware (CA) loss to replace the conventional cross-entropy loss for classifier optimization, effectively suppressing misleading gradients from confusion samples. Finally, we construct a dataset, named IFSODD, for performance evaluation, which comprises 11 categories and over 1,200 images. Experimental results on IFSODD demonstrate that GLMD-CA outperforms other state-of-the-art detectors, achieving an nAP of 47.5 (+2.4), nAP50 of 69.3 (+0.9), and nAP75 of 54 (+4.2) under the 10-shot setting.

Updates!!

【2025/08/22】 We release the official PyTorch implementation of GLMD-CA.

Quick Start

1. Check Requirements

Linux with Python == 3.7
torch == 1.7.1 & torchvision that matches the PyTorch version.
CUDA 11.0
GCC >= 4.9

2. Build GLMD-CA

Clone Code

git clone https://github.com/MinjieWan/GLMD-CA.git
cd GLMD-CA

Create a virtual environment (optional)
```
conda create -n GLMD-CA python=3.7
```

Install PyTorch 1.7.1 with CUDA 11.0

pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html

Install Detectron2
```
python3 -m pip install detectron2==0.3 -f https://dl.fbaipublicfiles.com/detectron2/wheels/cu110/torch1.7.1/index.html
```
- If you use other version of PyTorch/CUDA, check the latest version of Detectron2 in this page: Detectron2.

Install other requirements.

python3 -m pip install -r requirements.txt

3. Prepare Data and Weights

Data Preparation

We train models on GCOCO, and finetune them on IFSOD
IFSOD is also employed for evaluation
Download COCO2017 dataset, and convert it into GCOCO by

  python ./datasets/gcoco/coco_to_gray.py
  python ./datasets/gcoco/build_gcoco.py
  // Data folders shoule be modified to yours

Put GCOCO dataset in the following directory

  ...
  datasets
    | -- gcoco
           | -- train
           | -- annotations
  ...

Download IFSOD dataset from BaiDuYun(code:0000), and put it in the following directory

 ...
 datasets
   | -- ifsod
          | -- images
          | -- annotations
 ...

Generate few-shot dataset for finetuning based on IFSOD by

  python ./datasets/ifsod/prepare_ifsod_few_shot.py
  // Data folders shoule be modified to yours

Weights Preparation
- Resnet101 pretrain model
- DeFRCN PCB model

4. Training and Evaluation

Train

// single GPU
bash train.sh
// multi GPUS
bash mgpu_train.sh

Evaluation

// single GPU
bash finetune.sh
// multi GPUS
bash mgpu_finetune.sh

Acknowledgement

This repo is developed based on DeFRCN and Detectron2. Please check them for more details and features.

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
assets		assets
configs		configs
datasets		datasets
defrcn		defrcn
tools		tools
LICENSE		LICENSE
README.md		README.md
finetune.sh		finetune.sh
main.py		main.py
mgpu_finetune.sh		mgpu_finetune.sh
mgpu_train.sh		mgpu_train.sh
requirements.txt		requirements.txt
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GLMD-CA

Abstract

Updates!!

Quick Start

Acknowledgement

About

Uh oh!

Releases

Packages

Languages

License

MinjieWan/GLMD-CA

Folders and files

Latest commit

History

Repository files navigation

GLMD-CA

Abstract

Updates!!

Quick Start

Acknowledgement

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages