Federated modality-specific encoders and partially personalized fusion decoder for multimodal brain tumor segmentation

📅 2025-08-18
🏛️ Medical Image Anal.
📈 Citations: 4
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of simultaneously handling missing modalities and personalization demands in federated multi-modal medical image segmentation. To this end, the authors propose FedMEPD, a novel framework that assigns a dedicated encoder to each modality to accommodate heterogeneous modality availability across clients, and introduces a partially personalized fusion decoder. This decoder leverages global multi-modal representation anchors and cross-attention mechanisms to effectively compensate for missing modality information. As the first approach to jointly tackle modality heterogeneity and personalization under the federated learning paradigm, FedMEPD demonstrates significant performance gains over existing methods on the BraTS 2018 and 2020 datasets, validating its effectiveness and superiority in personalized federated multi-modal learning.

Technology Category

Application Category

📝 Abstract
Most existing federated learning (FL) methods for medical image analysis only considered intramodal heterogeneity, limiting their applicability to multimodal imaging applications. In practice, some FL participants may possess only a subset of the complete imaging modalities, posing intermodal heterogeneity as a challenge to effectively training a global model on all participants' data. Meanwhile, each participant expects a personalized model tailored to its local data characteristics in FL. This work proposes a new FL framework with federated modality-specific encoders and partially personalized multimodal fusion decoders (FedMEPD) to address the two concurrent issues. Specifically, FedMEPD employs an exclusive encoder for each modality to account for the intermodal heterogeneity. While these encoders are fully federated, the decoders are partially personalized to meet individual needs-using the discrepancy between global and local parameter updates to dynamically determine which decoder filters are personalized. Implementation-wise, a server with full-modal data employs a fusion decoder to fuse representations from all modality-specific encoders, thus bridging the modalities to optimize the encoders via backpropagation. Moreover, multiple anchors are extracted from the fused multimodal representations and distributed to the clients in addition to the model parameters. Conversely, the clients with incomplete modalities calibrate their missing-modal representations toward the global full-modal anchors via scaled dot-product cross-attention, making up for the information loss due to absent modalities. FedMEPD is validated on the BraTS 2018 and 2020 multimodal brain tumor segmentation benchmarks. Results show that it outperforms various up-to-date methods for multimodal and personalized FL, and its novel designs are effective.
Problem

Research questions and friction points this paper is trying to address.

federated learning
multimodal
intermodal heterogeneity
personalized model
brain tumor segmentation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Learning
Multimodal Fusion
Partial Personalization
Modality-specific Encoders
Cross-attention Calibration
🔎 Similar Papers
No similar papers found.
Hong Liu
Hong Liu
The University of Osaka & Xiamen University
Trustworthy AIComputer VisionMachine Learning
D
Dong Wei
Jarvis Research Center, Tencent YouTu Lab, Shenzhen, 518075, Guangdong, China
Q
Qian Dai
Department of Computer Science at School of Informatics, Xiamen University, Xiamen, 361005, Fujian, China
Xian Wu
Xian Wu
Director of Tencent Jarvis Lab
large language modeldata miningmachine learning
Yefeng Zheng
Yefeng Zheng
Professor, Westlake University, Hangzhou, China, IEEE Fellow, AIMBE Fellow
AI in HealthMedical ImagingComputer VisionNatural Language ProcessingLarge Language Model
L
Liansheng Wang
National Institute for Data Science in Health and Medicine, Xiamen University, Xiamen, 361005, Fujian, China; Department of Computer Science at School of Informatics, Xiamen University, Xiamen, 361005, Fujian, China