FedBPrompt: Federated Domain Generalization Person Re-Identification via Body Distribution Aware Visual Prompts

📅 2026-03-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenge of insufficient feature discriminability in federated domain generalization for person re-identification, which arises from heterogeneous data distributions across clients and background distractions. To mitigate this, we propose a Body-aware Prompting Mechanism (BAPM) that leverages learnable global and local body-aware visual prompts to guide the attention of Vision Transformers toward the primary human regions. Coupled with a Prompt-based Fine-Tuning Strategy (PFTS), our approach updates only lightweight prompt parameters, significantly reducing communication overhead while substantially improving cross-domain generalization performance. The method seamlessly integrates into existing Vision Transformer frameworks and achieves state-of-the-art results on multiple federated ReID benchmarks with minimal aggregation rounds.

Technology Category

Application Category

📝 Abstract
Federated Domain Generalization for Person Re-Identification (FedDG-ReID) learns domain-invariant representations from decentralized data. While Vision Transformer (ViT) is widely adopted, its global attention often fails to distinguish pedestrians from high similarity backgrounds or diverse viewpoints -- a challenge amplified by cross-client distribution shifts in FedDG-ReID. To address this, we propose Federated Body Distribution Aware Visual Prompt (FedBPrompt), introducing learnable visual prompts to guide Transformer attention toward pedestrian-centric regions. FedBPrompt employs a Body Distribution Aware Visual Prompts Mechanism (BAPM) comprising: Holistic Full Body Prompts to suppress cross-client background noise, and Body Part Alignment Prompts to capture fine-grained details robust to pose and viewpoint variations. To mitigate high communication costs, we design a Prompt-based Fine-Tuning Strategy (PFTS) that freezes the ViT backbone and updates only lightweight prompts, significantly reducing communication overhead while maintaining adaptability. Extensive experiments demonstrate that BAPM effectively enhances feature discrimination and cross-domain generalization, while PFTS achieves notable performance gains within only a few aggregation rounds. Moreover, both BAPM and PFTS can be easily integrated into existing ViT-based FedDG-ReID frameworks, making FedBPrompt a flexible and effective solution for federated person re-identification. The code is available at https://github.com/leavlong/FedBPrompt.
Problem

Research questions and friction points this paper is trying to address.

Federated Learning
Domain Generalization
Person Re-Identification
Vision Transformer
Distribution Shift
Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Learning
Person Re-Identification
Visual Prompting
Domain Generalization
Vision Transformer
🔎 Similar Papers
No similar papers found.
Xin Xu
Xin Xu
Professor of Wuhan University of Science and Technology
Person re-identificationLow-light image processingSalient object detection
W
Weilong Li
School of Computer Science and Technology, Wuhan University of Science and Technology, China
Wei Liu
Wei Liu
Wuhan University
Acoustic Signal ProcessingMicrophone ArraysSpeech Enhancement
Wenke Huang
Wenke Huang
School of Computer Science, Wuhan University
Federated LearningMLLM
Z
Zhixi Yu
School of Computer Science and Technology, Wuhan University of Science and Technology, China
B
Bin Yang
NERC for Multimedia Software, School of Computer Science, Wuhan University, China
X
Xiaoying Liao
Changsha Bus Group, China; Central South University of Forestry and Technology, China
Kui Jiang
Kui Jiang
Harbin Institute of Technology
computer visionimage processingdeep learning