DFAMS: Dynamic-flow guided Federated Alignment based Multi-prototype Search

📅 2025-08-27
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Existing federated retrieval methods struggle with ambiguous queries in cross-domain settings, leading to degraded retrieval quality and downstream generation performance. To address this, we propose a Dynamic Information Flow-guided Multi-Prototype Alignment Search framework. First, we introduce dynamic information flow analysis into federated retrieval—integrating gradient signals and Shapley values to trace neuron activation paths, enabling fine-grained query intent identification and sub-domain boundary detection. Second, we model cross-source semantic alignment via multi-prototype contrastive learning. Evaluated on five benchmarks, our method achieves up to 14.37% higher knowledge classification accuracy, 5.38% improved retrieval recall, and 6.45% greater downstream question-answering accuracy over state-of-the-art approaches, significantly enhancing federated retrieval effectiveness under cross-domain ambiguous queries.

Technology Category

Application Category

📝 Abstract
Federated Retrieval (FR) routes queries across multiple external knowledge sources, to mitigate hallucinations of LLMs, when necessary external knowledge is distributed. However, existing methods struggle to retrieve high-quality and relevant documents for ambiguous queries, especially in cross-domain scenarios, which significantly limits their effectiveness in supporting downstream generation tasks. Inspired by dynamic information flow (DIF), we propose DFAMS, a novel framework that leverages DIF to identify latent query intents and construct semantically aligned knowledge partitions for accurate retrieval across heterogeneous sources. Specifically, DFAMS probes the DIF in LLMs by leveraging gradient signals from a few annotated queries and employing Shapley value-based attribution to trace neuron activation paths associated with intent recognition and subdomain boundary detection. Then, DFAMS leverages DIF to train an alignment module via multi-prototype contrastive learning, enabling fine-grained intra-source modeling and inter-source semantic alignment across knowledge bases. Experimental results across five benchmarks show that DFAMS outperforms advanced FR methods by up to 14.37% in knowledge classification accuracy, 5.38% in retrieval recall, and 6.45% in downstream QA accuracy, demonstrating its effectiveness in complex FR scenarios.
Problem

Research questions and friction points this paper is trying to address.

Retrieving high-quality documents for ambiguous queries
Cross-domain knowledge retrieval in federated settings
Aligning heterogeneous knowledge sources for accurate retrieval
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic information flow probes query intents
Shapley value-based attribution traces neuron activation
Multi-prototype contrastive learning aligns knowledge sources
🔎 Similar Papers
No similar papers found.
Z
Zhibang Yang
Key Laboratory of High Confidence Software Technologies, Ministry of Education; School of Computer Science, Peking University, Beijing, China
X
Xinke Jiang
Key Laboratory of High Confidence Software Technologies, Ministry of Education; School of Computer Science, Peking University, Beijing, China
R
Rihong Qiu
Key Laboratory of High Confidence Software Technologies, Ministry of Education; School of Computer Science, Peking University, Beijing, China
R
Ruiqing Li
Key Laboratory of High Confidence Software Technologies, Ministry of Education; School of Computer Science, Peking University, Beijing, China
Y
Yihang Zhang
Northeastern University, Shenyang, China
Y
Yue Fang
Key Laboratory of High Confidence Software Technologies, Ministry of Education; School of Computer Science, Peking University, Beijing, China
Yongxin Xu
Yongxin Xu
Peking University
Large Language ModelsKnowledge GraphsElectronic Medical Record Analysis
H
Hongxin Ding
Key Laboratory of High Confidence Software Technologies, Ministry of Education; School of Computer Science, Peking University, Beijing, China
X
Xu Chu
Key Laboratory of High Confidence Software Technologies, Ministry of Education; School of Computer Science, Peking University, Beijing, China
Junfeng Zhao
Junfeng Zhao
Assistant Professor at Arizona State University, Director of BELIV Lab
Connected & Automated VehicleMotion Planning & ControlsElectric VehiclesAI/ML
Y
Yasha Wang
Key Laboratory of High Confidence Software Technologies, Ministry of Education; School of Computer Science, Peking University, Beijing, China