VFedMH: Vertical Federated Learning for Training Multiple Heterogeneous Models

📅 2023-10-20

📈 Citations: 0

✨ Influential: 0

career value

232K/year

🤖 AI Summary

To address slow convergence and poor generalization caused by model heterogeneity among participants in vertical federated learning (VFL), this paper proposes VFedMH—a framework enabling collaborative training of heterogeneous local models without sharing raw data. Its key contributions are: (1) a novel lightweight blind factor–based embedding protection mechanism that jointly ensures privacy preservation and computational efficiency; (2) a bidirectional collaborative optimization paradigm wherein the active party assists passive parties in gradient computation, enabling distributed, synchronized updates of heterogeneous model gradients; and (3) decoupling of embedding aggregation from backpropagation to enhance training stability. Extensive experiments on multiple benchmark datasets demonstrate that VFedMH significantly outperforms existing VFL methods, achieving faster, more stable convergence and higher prediction accuracy.

📝 Abstract

Vertical federated learning has garnered significant attention as it allows clients to train machine learning models collaboratively without sharing local data, which protects the client's local private data. However, existing VFL methods face challenges when dealing with heterogeneous local models among participants, which affects optimization convergence and generalization. To address this challenge, this paper proposes a novel approach called Vertical federated learning for training multiple Heterogeneous models (VFedMH). VFedMH focuses on aggregating the local embeddings of each participant's knowledge during forward propagation. To protect the participants' local embedding values, we propose an embedding protection method based on lightweight blinding factors. In particular, participants obtain local embedding using local heterogeneous models. Then the passive party, who owns only features of the sample, injects the blinding factor into the local embedding and sends it to the active party. The active party aggregates local embeddings to obtain global knowledge embeddings and sends them to passive parties. The passive parties then utilize the global embeddings to propagate forward on their local heterogeneous networks. However, the passive party does not own the sample labels, so the local model gradient cannot be calculated locally. To overcome this limitation, the active party assists the passive party in computing its local heterogeneous model gradients. Then, each participant trains their local model using the heterogeneous model gradients. The objective is to minimize the loss value of their respective local heterogeneous models. Extensive experiments are conducted to demonstrate that VFedMH can simultaneously train multiple heterogeneous models with heterogeneous optimization and outperform some recent methods in model performance.

Problem

Research questions and friction points this paper is trying to address.

Addresses heterogeneous model training in vertical federated learning

Protects local embeddings via lightweight blinding factors

Enables gradient computation without sample labels

Innovation

Methods, ideas, or system contributions that make the work stand out.

Aggregates local embeddings for heterogeneous models

Uses blinding factors to protect embedding values

Active party assists in computing model gradients

🔎 Similar Papers

Federated Large Language Models: Current Progress and Future Directions