🤖 AI Summary
Federated learning faces inherent challenges in data silos, including difficulty in enhancing local knowledge, poor fairness, high communication overhead, and low reproducibility. Method: This paper proposes Knowledge-Aware Federated (KAF), a paradigm shift from global model optimization to systematic local knowledge enhancement. KAF introduces three methodological pillars—knowledge expansion, knowledge filtering, and label/feature-space calibration—integrating knowledge distillation, causal inference, subspace alignment, and uncertainty-driven active selection to handle heterogeneous, low-quality, and weakly labeled distributed data. Contribution/Results: We formally define the KAF framework and its prototype optimization objective, delineate three core technical pathways, and identify key open problems. KAF provides a principled foundation for evolving federated learning toward knowledge-empowered collaborative learning, advancing fairness, efficiency, and reproducibility while preserving data privacy and heterogeneity.
📝 Abstract
Data, as an observable form of knowledge, has become one of the most important factors of production for the development of Artificial Intelligence (AI). Meanwhile, increasing legislation and regulations on private and proprietary information results in scattered data sources also known as the ``data islands''. Although some collaborative learning paradigms such as Federated Learning (FL) can enable privacy-preserving training over decentralized data, they have inherent deficiencies in fairness, costs and reproducibility because of being learning-centric, which greatly limits the way how participants cooperate with each other. In light of this, we present a knowledge-centric paradigm termed emph{Knowledge Augmentation in Federation} (KAF), with focus on how to enhance local knowledge through collaborative effort. We provide the suggested system architecture, formulate the prototypical optimization objective, and review emerging studies that employ methodologies suitable for KAF. On our roadmap, with a three-way categorization we describe the methods for knowledge expansion, knowledge filtering, and label and feature space correction in the federation. Further, we highlight several challenges and open questions that deserve more attention from the community. With our investigation, we intend to offer new insights for what collaborative learning can bring back to decentralized data.