MQSPN

Code for the paper accepted by AAAI 2025: "Multi-Grained Query-Guided Set Prediction Network for Grounded Multimodal Named Entity Recognition." (Paper Link).

Requirements

python 3.8
pytorch 1.9.1
transformers 4.11.3

We suggest utilizing the conda environment. Please make sure you have enough GPU memory no less than 24GB.

conda create -n mqspn python=3.8
conda activate mqspn
pip install -r requirements.txt

Datasets

We use publicly released datasets Twitter-GMNER and Twitter-FMNERG to train and evaluate our proposed model. You can find datasets' detailed information from their original papers:

The preprocessed CoNLL format files are provided in this repo data file. For each tweet, the first line is its image id, and the following lines are its textual contents.
Download each tweet's associated images via this link.
Following H-Index , we utilize open-sourced VinVL as class-agnostic RPN to identify all the candidate regions. To make MQSPN training efficient, we execute it in an individual process and store pre-detection results under the folder named "Vinvl_detection_path". The features and regions extracted by VinVL can be found at this.
Please place the corresponding files as the following structure in the data file:

|----datasets\
    |----images_annotation # annotation xml file here
    |----Vinvl_detection_path # candidate regions for twitter15 and twitter17 here
    |----twitter # original raw text with BIO annotations
    |    |----twitter10k_types.json
    |    |----dev.txt
    |    |----samples.txt
    |    |----train.txt
    |    |----test.txt
    |----raw_images # all twitter15 and twitter17 raw images here

Usage

Train for MQSPN

python run.py train --config configs/twitter.conf

Evaluation

python run.py eval --config configs/twitter_eval.conf

Acknowledgements

The datasets we used are from Yu et al. and Wang et al.. Some codes are based on the open-sourced codes PIQN, DQSetGen and MKGFormer. Thanks for their great works!

Citation

@inproceedings{tang2025multi,
  title={Multi-grained query-guided set prediction network for grounded multimodal named entity recognition},
  author={Tang, Jielong and Wang, Zhenxing and Gong, Ziyang and Yu, Jianxing and Zhu, Xiangwei and Yin, Jian},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={39},
  number={24},
  pages={25246--25254},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
assets		assets
configs		configs
data/datasets/twitter		data/datasets/twitter
src		src
README.md		README.md
__init__.py		__init__.py
args.py		args.py
config_reader.py		config_reader.py
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MQSPN

Requirements

Datasets

Usage

Train for MQSPN

Evaluation

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

MQSPN

Requirements

Datasets

Usage

Train for MQSPN

Evaluation

Acknowledgements

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors 1

Languages

Packages