A Survey of Multimodal Federated Learning: Background, Applications, and Perspectives

This repository provides a comprehensive collection of papers focused on Multimodal Federated Learning (MMFL), with the primary researches already discussed in our latest review. This repository will continue to be updated, and we welcome you to give it a STAR⭐️.

Multimodal Federated Learning is a collaborative training process involving multiple clients, each with diverse modality settings and data, conducting learning tasks without disclosing their local raw data.

Table of Contents

Survey

Title	Authors	Materials
A Survey of Multimodal Federated Learning: Background, Applications, and Perspectives	H Pan, XL Zhao, LP He, YC Shi, XG Lin (2024)	PUB
Multimodal Federated Learning: A Survey	L Che, J Wang, Y Zhou, F Ma (2023)	PUB
Federated Learning on Multimodal Data: A Comprehensive Survey	YM Lin, Y Gao, MG Gong, SJ Zhang, YQ Zhang, ZY Li (2023)	PUB
Multimodal Federated Learning in Healthcare: a Review	J Thrasher, A Devkota, P Siwakotai, R Chivukula, P Poudel, C Hu, B Bhattarai, P Gyawali (2023)	arXiv
A Survey of Advances in Multimodal Federated Learning with Applications	G Barry, E Konyar, B Harvill, C Johnstone (2024)	PUB

Unifying Achitectures

Title	Authors	Materials
FedMSplit: Correlation-Adaptive Federated Multi-Task Learning across Multimodal Split Networks	J Chen, A Zhang (KDD 2022)	PUB
FedMultimodal: A Benchmark For Multimodal Federated Learning	T Feng, D Bose, T Zhang, R Hebbar, A Ramakrishna, R Gupta, M Zhang, S Avestimehr, S Narayanan (2023)	PUB
A Multi-Modal Vertical Federated Learning Framework Based on Homomorphic Encryption	M Gong, Y Zhang, Y Gao, AK Qin, Y Wu, S Wang, Y Zhang (2023)	PUB
FedSea: Federated Learning via Selective Feature Alignment for Non-IID Multimodal Data	M Tan, Y Feng, L Chu, J Shi, R Xiao, H Tang, J Yu (2023)	PUB
On Disentanglement of Asymmetrical Knowledge Transfer for Modality-Task Agnostic Federated Learning	J Chen, A Zhang (AAAI 2024)	PUB
Adaptive Hyper-graph Aggregation for Modality-Agnostic Federated Learning	F Qi, S Li (CVPR 2024)	PUB
Robust multimodal federated learning for incomplete modalities	S Yu, J Wang, W Hussein, PCK Hung (2024)	PUB
Communication-Efficient Multimodal Federated Learning: Joint Modality and Client Selection	L Yuan, DJ Han, S Wang, D Upadhyay, C G. Brinton (2024)	arXiv

Applications

🏃‍♂️Human Activity Recognition

A unified framework for multi-modal federated learning

B Xiong, X Yang, F Qi, C Xu

Neurocomputing, 202204 PUB

Multimodal federated learning on iot data

Y Zhao, P Barnaghi, H Haddadi

IoTDI, 202205 PUB

Cross-modal federated human activity recognition via modality-agnostic and modality-specific representation learning

X Yang, B Xiong, Y Huang, C Xu

AAAI, 2022 PUB

Towards optimal multi-modal federated learning on non-iid data with hierarchical gradient blending

S Chen, B Li

INFOCOM, 2022 PUB

FL-FD: Federated learning-based fall detection with multimodal data fusion

P Qi, D Chiaro, F Piccialli

Information Fusion, 202311 PUB

Cross-Modal Federated Human Activity Recognition

X Yang, B Xiong, Y Huang, C Xu

IEEE Transactions on Pattern Analysis and Machine Intelligence, 202402 PUB

FedMEKT: Distillation-based Embedding Knowledge Transfer for Multimodal Federated Learning

HQ Le, MNH Nguyen, CM Thwal, Y Qiao, C Zhang, CS Hong

arXiv, 2023 arXiv

👩🏿‍⚕Medical Diagnosis

Multimodal melanoma detection with federated learning

BLY Agbley, J Li, AU Haq, EK Bankas, S Ahmad, IO Agyemang, D Kulevome, WD Ndiaye, B Cobbinah, S Latipova

ICCWAMTIP, 202112 PUB

Harmony: Heterogeneous multi-modal federated learning through disentangled model training

X Ouyang, Z Xie, H Fu, S Cheng, L Pan, N Ling, G Xing, J Zhou, J Huang

MobiSys, 2023 PUB

A federated learning system with data fusion for healthcare using multi-party computation and additive secret sharing

T Muazu, Y Mao, AU Muhammad, M Ibrahim, UMM Kumshe, O Samuel

Computer Communications, 202402 PUB

Medical report generation based on multimodal federated learning

J Chen, R Pan

Computerized Medical Imaging and Graphics, 202404 PUB

Federated Modality-Specific Encoders and Multimodal Anchors for Personalized Brain Tumor Segmentation

Q Dai, D Wei, H Liu, J Sun, L Wang, Y Zheng

AAAI, 2024 PUB

🔍️Cross-modal Retrieval

FedCMR: Federated cross-modal retrieval

L Zong, Q Xie, J Zhou, P Wu, X Zhang, B Xu

SIGIR, 2021 PUB

Multimodal Federated Learning via Contrastive Representation Ensemble

Q Yu, Y Liu, Y Wang, K Xu, J Liu

ICLR, 2023 PUB

💬Visual Question Answer

Think locally, act globally: Federated learning with local and global representations

PP Liang, T Liu, L Ziyin, NB Allen, RP Auerbach, D Brent, R Salakhutdinov, LP Morency

NeurIPS, 2019 arXiv

Federated learning for vision-and-language grounding problems

F Liu, X Wu, S Ge, W Fan, Y Zou

AAAI, 2020 PUB

Multimodal Federated Learning with Missing Modality via Prototype Mask and Contrast

G Bao, Q Zhang, D Miao, Z Gong, L Hu

arXiv, 2024 arXiv

🤣Emotion Recognition

Federated Meta-Learning for Emotion and Sentiment Aware Multi-modal Complaint Identification

A Singh, S Chandrasekar, S Saha, T Sen

EMNLP, 2023 PUB

Enhancing Emotion Recognition through Federated Learning: A Multimodal Approach with Convolutional Neural Networks

N Simić, S Suzić, N Milošević, V Stanojev, T Nosek, B Popović, D Bajović

Applied Sciences, 202402 PUB

FedCMD: A Federated Cross-Modal Knowledge Distillation for Drivers Emotion Recognition

S Bano, N Tonellotto, P Cassarà, A Gotta

ACM Transactions on Intelligent Systems and Technology, 202405 PUB

🧙‍♂️Prompt Learning and Model Finetuning

Pfedprompt: Learning personalized prompt for vision-language models in federated learning

T Guo, S Guo, J Wang

WWW, 2023 PUB

Global and Local Prompts Cooperation via Optimal Transport for Federated Learning

H Li, W Huang, J Wang, Y Shi

CVPR, 2024 arXiv

Feddat: An approach for foundation model finetuning in multi-modal heterogeneous federated learning

H Chen, Y Zhang, D Krompass, J Gu, V Tresp

AAAI, 2024 PUB

📡Internet of Things

Fedfusion: Manifold driven federated learning for multi-satellite and multi-modality fusion

DX Li, W Xie, Y Li, L Fang

Geoscience and Remote Sensing, 2023 PUB

Autofed: Heterogeneity-aware federated multimodal learning for robust autonomous driving

T Zheng, A Li, Z Chen, H Wang, J Luo

ACM MobiCom, 2023 PUB

FedUSL: A Federated Annotation Method for Driving Fatigue Detection based on Multimodal Sensing Data

S Yu, Q Yang, J Wang, C Wu

ACM Transactions on Sensor Networks, 202403 PUB

🏷️Image Classification

Fedclip: Fast generalization and personalization for clip in federated learning

W Lu, X Hu, J Wang, X Xie

Data Engineering Bulletin, 2023 arXiv

Multimodal Datasets

Datasets	Paper	Materials
VQA	Vqa: Visual question answering	PUB
MS COCO	Microsoft coco: Common objects in context	PUB
Flickr30k	Flickr30k entities: Collecting region-to-phrase correspondences for richer image-to-sentence models	PUB
IEMOCAP	IEMOCAP: interactive emotional dyadic motion capture database	PUB
MELD	Meld: A multimodal multi-party dataset for emotion recognition in conversations	arXiv
CMU-MOSEI	Multimodal language analysis in the wild: Cmu-mosei dataset and interpretable dynamic fusion graph	PUB
Kineics-400	The Kinetics Human Action Video Dataset	arXiv
UCF101	UCF101: A dataset of 101 human actions classes from videos in the wild	arXiv HOME
UR fall detection	Human fall detection on embedded platform using depth maps and wireless accelerometer	PUB HOME
Hateful Memes	The hateful memes challenge: Detecting hate speech in multimodal memes	PUB
UR-FUNNY	UR-FUNNY: A multimodal language dataset for understanding humor	PUB
CrisisMMD	CrisisMMD: Multimodal Twitter Datasets from Natural Disasters	PUB
Vehicle sensor	Vehicle classification in distributed sensor networks	PUB
MHealth	mHealthDroid: a novel framework for agile development of mobile health applications	PUB
PTB-XL	PTB-XL, a large publicly available electrocardiography dataset	HOME
ModelNet40	3D ShapeNets: A Deep Representation for Volumetric Shapes	PUB

Citation

If you find the listing and survey useful for your work, please cite the paper:

@article{pan2024survey,
  title={A survey of multimodal federated learning: background, applications, and perspectives},
  author={Pan, Hao and Zhao, Xiaoli and He, Lipeng and Shi, Yicong and Lin, Xiaogang},
  journal={Multimedia Systems},
  volume={30},
  number={4},
  pages={222},
  year={2024},
  publisher={Springer}
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Fig1.jpg		Fig1.jpg
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Survey of Multimodal Federated Learning: Background, Applications, and Perspectives

Survey

Unifying Achitectures

Applications

🏃‍♂️Human Activity Recognition

👩🏿‍⚕Medical Diagnosis

🔍️Cross-modal Retrieval

💬Visual Question Answer

🤣Emotion Recognition

🧙‍♂️Prompt Learning and Model Finetuning

📡Internet of Things

🏷️Image Classification

Multimodal Datasets

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

A Survey of Multimodal Federated Learning: Background, Applications, and Perspectives

Survey

Unifying Achitectures

Applications

🏃‍♂️Human Activity Recognition

👩🏿‍⚕Medical Diagnosis

🔍️Cross-modal Retrieval

💬Visual Question Answer

🤣Emotion Recognition

🧙‍♂️Prompt Learning and Model Finetuning

📡Internet of Things

🏷️Image Classification

Multimodal Datasets

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages