Skip to content

haopr/MMFL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

A Survey of Multimodal Federated Learning: Background, Applications, and Perspectives

This repository provides a comprehensive collection of papers focused on Multimodal Federated Learning (MMFL), with the primary researches already discussed in our latest review. This repository will continue to be updated, and we welcome you to give it a STAR⭐️.

Multimodal Federated Learning is a collaborative training process involving multiple clients, each with diverse modality settings and data, conducting learning tasks without disclosing their local raw data.

Unimodal vs Multimodal

Table of Contents

Survey

Title Authors Materials
A Survey of Multimodal Federated Learning: Background, Applications, and Perspectives H Pan, XL Zhao, LP He, YC Shi, XG Lin (2024) PUB
Multimodal Federated Learning: A Survey L Che, J Wang, Y Zhou, F Ma (2023) PUB
Federated Learning on Multimodal Data: A Comprehensive Survey YM Lin, Y Gao, MG Gong, SJ Zhang, YQ Zhang, ZY Li (2023) PUB
Multimodal Federated Learning in Healthcare: a Review J Thrasher, A Devkota, P Siwakotai, R Chivukula, P Poudel, C Hu, B Bhattarai, P Gyawali (2023) arXiv
A Survey of Advances in Multimodal Federated Learning with Applications G Barry, E Konyar, B Harvill, C Johnstone (2024) PUB

Unifying Achitectures

Title Authors Materials
FedMSplit: Correlation-Adaptive Federated Multi-Task Learning across Multimodal Split Networks J Chen, A Zhang (KDD 2022) PUB
FedMultimodal: A Benchmark For Multimodal Federated Learning T Feng, D Bose, T Zhang, R Hebbar, A Ramakrishna, R Gupta, M Zhang, S Avestimehr, S Narayanan (2023) PUB
A Multi-Modal Vertical Federated Learning Framework Based on Homomorphic Encryption M Gong, Y Zhang, Y Gao, AK Qin, Y Wu, S Wang, Y Zhang (2023) PUB
FedSea: Federated Learning via Selective Feature Alignment for Non-IID Multimodal Data M Tan, Y Feng, L Chu, J Shi, R Xiao, H Tang, J Yu (2023) PUB
On Disentanglement of Asymmetrical Knowledge Transfer for Modality-Task Agnostic Federated Learning J Chen, A Zhang (AAAI 2024) PUB
Adaptive Hyper-graph Aggregation for Modality-Agnostic Federated Learning F Qi, S Li (CVPR 2024) PUB
Robust multimodal federated learning for incomplete modalities S Yu, J Wang, W Hussein, PCK Hung (2024) PUB
Communication-Efficient Multimodal Federated Learning: Joint Modality and Client Selection L Yuan, DJ Han, S Wang, D Upadhyay, C G. Brinton (2024) arXiv

Applications

🏃‍♂️Human Activity Recognition

A unified framework for multi-modal federated learning

B Xiong, X Yang, F Qi, C Xu

Neurocomputing, 202204 PUB

Multimodal federated learning on iot data

Y Zhao, P Barnaghi, H Haddadi

IoTDI, 202205 PUB

Cross-modal federated human activity recognition via modality-agnostic and modality-specific representation learning

X Yang, B Xiong, Y Huang, C Xu

AAAI, 2022 PUB

Towards optimal multi-modal federated learning on non-iid data with hierarchical gradient blending

S Chen, B Li

INFOCOM, 2022 PUB

FL-FD: Federated learning-based fall detection with multimodal data fusion

P Qi, D Chiaro, F Piccialli

Information Fusion, 202311 PUB

Cross-Modal Federated Human Activity Recognition

X Yang, B Xiong, Y Huang, C Xu

IEEE Transactions on Pattern Analysis and Machine Intelligence, 202402 PUB

FedMEKT: Distillation-based Embedding Knowledge Transfer for Multimodal Federated Learning

HQ Le, MNH Nguyen, CM Thwal, Y Qiao, C Zhang, CS Hong

arXiv, 2023 arXiv

👩🏿‍⚕Medical Diagnosis

Multimodal melanoma detection with federated learning

BLY Agbley, J Li, AU Haq, EK Bankas, S Ahmad, IO Agyemang, D Kulevome, WD Ndiaye, B Cobbinah, S Latipova

ICCWAMTIP, 202112 PUB

Harmony: Heterogeneous multi-modal federated learning through disentangled model training

X Ouyang, Z Xie, H Fu, S Cheng, L Pan, N Ling, G Xing, J Zhou, J Huang

MobiSys, 2023 PUB

A federated learning system with data fusion for healthcare using multi-party computation and additive secret sharing

T Muazu, Y Mao, AU Muhammad, M Ibrahim, UMM Kumshe, O Samuel

Computer Communications, 202402 PUB

Medical report generation based on multimodal federated learning

J Chen, R Pan

Computerized Medical Imaging and Graphics, 202404 PUB

Federated Modality-Specific Encoders and Multimodal Anchors for Personalized Brain Tumor Segmentation

Q Dai, D Wei, H Liu, J Sun, L Wang, Y Zheng

AAAI, 2024 PUB

🔍️Cross-modal Retrieval

FedCMR: Federated cross-modal retrieval

L Zong, Q Xie, J Zhou, P Wu, X Zhang, B Xu

SIGIR, 2021 PUB

Multimodal Federated Learning via Contrastive Representation Ensemble

Q Yu, Y Liu, Y Wang, K Xu, J Liu

ICLR, 2023 PUB

💬Visual Question Answer

Think locally, act globally: Federated learning with local and global representations

PP Liang, T Liu, L Ziyin, NB Allen, RP Auerbach, D Brent, R Salakhutdinov, LP Morency

NeurIPS, 2019 arXiv

Federated learning for vision-and-language grounding problems

F Liu, X Wu, S Ge, W Fan, Y Zou

AAAI, 2020 PUB

Multimodal Federated Learning with Missing Modality via Prototype Mask and Contrast

G Bao, Q Zhang, D Miao, Z Gong, L Hu

arXiv, 2024 arXiv

🤣Emotion Recognition

Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification

A Singh, S Chandrasekar, S Saha, T Sen

EMNLP, 2023 PUB

Enhancing Emotion Recognition through Federated Learning: A Multimodal Approach with Convolutional Neural Networks

N Simić, S Suzić, N Milošević, V Stanojev, T Nosek, B Popović, D Bajović

Applied Sciences, 202402 PUB

FedCMD: A Federated Cross-Modal Knowledge Distillation for Drivers Emotion Recognition

S Bano, N Tonellotto, P Cassarà, A Gotta

ACM Transactions on Intelligent Systems and Technology, 202405 PUB

🧙‍♂️Prompt Learning and Model Finetuning

Pfedprompt: Learning personalized prompt for vision-language models in federated learning

T Guo, S Guo, J Wang

WWW, 2023 PUB

Global and Local Prompts Cooperation via Optimal Transport for Federated Learning

H Li, W Huang, J Wang, Y Shi

CVPR, 2024 arXiv

Feddat: An approach for foundation model finetuning in multi-modal heterogeneous federated learning

H Chen, Y Zhang, D Krompass, J Gu, V Tresp

AAAI, 2024 PUB

📡Internet of Things

Fedfusion: Manifold driven federated learning for multi-satellite and multi-modality fusion

DX Li, W Xie, Y Li, L Fang

Geoscience and Remote Sensing, 2023 PUB

Autofed: Heterogeneity-aware federated multimodal learning for robust autonomous driving

T Zheng, A Li, Z Chen, H Wang, J Luo

ACM MobiCom, 2023 PUB

FedUSL: A Federated Annotation Method for Driving Fatigue Detection based on Multimodal Sensing Data

S Yu, Q Yang, J Wang, C Wu

ACM Transactions on Sensor Networks, 202403 PUB

🏷️Image Classification

Fedclip: Fast generalization and personalization for clip in federated learning

W Lu, X Hu, J Wang, X Xie

Data Engineering Bulletin, 2023 arXiv

Multimodal Datasets

Datasets Paper Materials
VQA Vqa: Visual question answering PUB
MS COCO Microsoft coco: Common objects in context PUB
Flickr30k Flickr30k entities: Collecting region-to-phrase correspondences for richer image-to-sentence models PUB
IEMOCAP IEMOCAP: interactive emotional dyadic motion capture database PUB
MELD Meld: A multimodal multi-party dataset for emotion recognition in conversations arXiv
CMU-MOSEI Multimodal language analysis in the wild: Cmu-mosei dataset and interpretable dynamic fusion graph PUB
Kineics-400 The Kinetics Human Action Video Dataset arXiv
UCF101 UCF101: A dataset of 101 human actions classes from videos in the wild arXiv HOME
UR fall detection Human fall detection on embedded platform using depth maps and wireless accelerometer PUB HOME
Hateful Memes The hateful memes challenge: Detecting hate speech in multimodal memes PUB
UR-FUNNY UR-FUNNY: A multimodal language dataset for understanding humor PUB
CrisisMMD CrisisMMD: Multimodal Twitter Datasets from Natural Disasters PUB
Vehicle sensor Vehicle classification in distributed sensor networks PUB
MHealth mHealthDroid: a novel framework for agile development of mobile health applications PUB
PTB-XL PTB-XL, a large publicly available electrocardiography dataset HOME
ModelNet40 3D ShapeNets: A Deep Representation for Volumetric Shapes PUB

Citation

If you find the listing and survey useful for your work, please cite the paper:

@article{pan2024survey,
  title={A survey of multimodal federated learning: background, applications, and perspectives},
  author={Pan, Hao and Zhao, Xiaoli and He, Lipeng and Shi, Yicong and Lin, Xiaogang},
  journal={Multimedia Systems},
  volume={30},
  number={4},
  pages={222},
  year={2024},
  publisher={Springer}
}

About

This repository provides a comprehensive collection of papers focused on Multimodal Federated Learning (MMFL).

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors