Personalized federated prototype learning in mixed heterogeneous data scenarios

📅 2025-10-04
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address performance degradation in federated learning caused by mixed heterogeneity—simultaneous feature and label distribution shifts across clients—this paper proposes Personalized Federated Prototype Learning (PFPL). The method’s core contributions are: (i) the first construction of personalized, unbiased class prototypes per client under mixed heterogeneity, augmented with domain knowledge; and (ii) a consistency regularization mechanism that aligns local instance representations with prototypes to enhance cross-client semantic consistency. PFPL integrates prototype-based learning, consistency constraints, and distributed optimization, achieving faster convergence and improved generalization while maintaining low communication overhead. Experiments on Digits and Office-Caltech benchmark datasets demonstrate that PFPL significantly outperforms state-of-the-art baselines: it reduces average communication rounds by 32% and improves classification accuracy by 2.8–5.4 percentage points.

Technology Category

Application Category

📝 Abstract
Federated learning has received significant attention for its ability to simultaneously protect customer privacy and leverage distributed data from multiple devices for model training. However, conventional approaches often focus on isolated heterogeneous scenarios, resulting in skewed feature distributions or label distributions. Meanwhile, data heterogeneity is actually a key factor in improving model performance. To address this issue, we propose a new approach called PFPL in mixed heterogeneous scenarios. The method provides richer domain knowledge and unbiased convergence targets by constructing personalized, unbiased prototypes for each client. Moreover, in the local update phase, we introduce consistent regularization to align local instances with their personalized prototypes, which significantly improves the convergence of the loss function. Experimental results on Digits and Office Caltech datasets validate the effectiveness of our approach and successfully reduce the communication cost.
Problem

Research questions and friction points this paper is trying to address.

Addresses data heterogeneity in federated learning scenarios
Builds personalized unbiased prototypes for improved model performance
Reduces communication costs while maintaining privacy protection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Personalized unbiased prototypes for each client
Consistent regularization aligns instances with prototypes
Reduces communication cost while improving convergence
🔎 Similar Papers
No similar papers found.
J
Jiahao Zeng
Guangxi Normal University, Guilin, 541004, China; Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, GuangXi Normal University, Guilin, 541004, China
W
Wolong Xing
Guangxi Normal University, Guilin, 541004, China; Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, GuangXi Normal University, Guilin, 541004, China
L
Liangtao Shi
Hefei University of Technology, Hefei, 230009, China
X
Xin Huang
Guangxi Normal University, Guilin, 541004, China; Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, GuangXi Normal University, Guilin, 541004, China
Jialin Wang
Jialin Wang
Postdoctoral Researcher, The Hong Kong University of Science and Technology (Guangzhou)
Virtual RealityHuman-Computer InteractionVisual PerceptionRoboticsComputer Graphics
Z
Zhile Cao
Guangxi Normal University, Guilin, 541004, China; Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, GuangXi Normal University, Guilin, 541004, China
Z
Zhenkui Shi
Guangxi Normal University, Guilin, 541004, China; Key Lab of Education Blockchain and Intelligent Technology, Ministry of Education, GuangXi Normal University, Guilin, 541004, China