HybridVFL: Disentangled Feature Learning for Edge-Enabled Vertical Federated Multimodal Classification

📅 2025-12-11

📈 Citations: 0

✨ Influential: 0

career value

174K/year

🤖 AI Summary

Existing vertical federated learning (VFL) frameworks for privacy-sensitive multimodal classification in edge AI—such as mobile health diagnostics—are hindered by simplistic client-side feature fusion, leading to suboptimal model performance. Method: We propose a novel VFL framework tailored for resource-constrained edge devices. It introduces a lightweight feature disentanglement module on clients to separate modality-specific and shared representations, and a cross-modal Transformer on the server to enable context-aware, privacy-preserving fusion. Contribution/Results: This co-designed mechanism is the first to integrate disentangled representation learning with cross-modal modeling into the VFL paradigm. Evaluated on the HAM10000 multimodal skin lesion dataset, our method significantly outperforms standard VFL baselines, demonstrating its capability to enhance model robustness and generalization while rigorously preserving data privacy.

Technology Category

Application Category

📝 Abstract

Vertical Federated Learning (VFL) offers a privacy-preserving paradigm for Edge AI scenarios like mobile health diagnostics, where sensitive multimodal data reside on distributed, resource-constrained devices. Yet, standard VFL systems often suffer performance limitations due to simplistic feature fusion. This paper introduces HybridVFL, a novel framework designed to overcome this bottleneck by employing client-side feature disentanglement paired with a server-side cross-modal transformer for context-aware fusion. Through systematic evaluation on the multimodal HAM10000 skin lesion dataset, we demonstrate that HybridVFL significantly outperforms standard federated baselines, validating the criticality of advanced fusion mechanisms in robust, privacy-preserving systems.

Problem

Research questions and friction points this paper is trying to address.

Enhances feature fusion in vertical federated learning

Improves multimodal classification on edge devices

Addresses performance limitations in privacy-preserving systems

Innovation

Methods, ideas, or system contributions that make the work stand out.

Client-side feature disentanglement for edge devices

Server-side cross-modal transformer for context-aware fusion

Enhanced multimodal classification in vertical federated learning

🔎 Similar Papers

Computation and Communication Efficient Lightweighting Vertical Federated Learning