Federated Learning for Big Data: A Survey on Opportunities, Applications, and Future Directions

📅 2021-10-08
🏛️ arXiv.org
📈 Citations: 43
Influential: 1
📄 PDF
🤖 AI Summary
Addressing the dual challenges of privacy leakage and data silos in cross-domain big data collaboration, this paper presents a systematic survey on the deep integration of federated learning (FL) with big data services. It examines four core stages—data acquisition, storage, analytics, and privacy preservation—and extends the analysis to representative application domains, including smart cities and intelligent healthcare. The work introduces, for the first time, a dual-dimensional “service–application” classification framework, thereby filling a critical gap in comprehensive surveys at the FL–big data intersection. Key technical challenges are distilled, including model heterogeneity, communication overhead, security and robustness, and system scalability. International benchmark projects are synthesized, and viable technical pathways are proposed. The study provides researchers with a coherent theoretical taxonomy and offers practitioners a structured, implementation-oriented reference guide for deploying FL in big data ecosystems.
📝 Abstract
Big data has remarkably evolved over the last few years to realize an enormous volume of data generated from newly emerging services and applications and a massive number of Internet-of-Things (IoT) devices. The potential of big data can be realized via analytic and learning techniques, in which the data from various sources is transferred to a central cloud for central storage, processing, and training. However, this conventional approach faces critical issues in terms of data privacy as the data may include sensitive data such as personal information, governments, banking accounts. To overcome this challenge, federated learning (FL) appeared to be a promising learning technique. However, a gap exists in the literature that a comprehensive survey on FL for big data services and applications is yet to be conducted. In this article, we present a survey on the use of FL for big data services and applications, aiming to provide general readers with an overview of FL, big data, and the motivations behind the use of FL for big data. In particular, we extensively review the use of FL for key big data services, including big data acquisition, big data storage, big data analytics, and big data privacy preservation. Subsequently, we review the potential of FL for big data applications, such as smart city, smart healthcare, smart transportation, smart grid, and social media. Further, we summarize a number of important projects on FL-big data and discuss key challenges of this interesting topic along with several promising solutions and directions.
Problem

Research questions and friction points this paper is trying to address.

Addressing privacy concerns in big data processing with Federated Learning
Exploring Federated Learning applications in big data services and IoT
Reviewing challenges and future directions for Federated Learning in big data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Learning for decentralized data training
Privacy preservation in big data analytics
FL applications in smart city services
🔎 Similar Papers
No similar papers found.
T
T. Gadekallu
The College of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou 311300, China
Viet Quoc Pham
Viet Quoc Pham
Highly Cited Researcher, Trinity College Dublin
Wireless AIEdge ComputingSecurity & PrivacyWireless CommunicationsMachine Learning
Thien Huynh-The
Thien Huynh-The
Ho Chi Minh City University of Technology and Education
Signal processingImage processingWireless communicationsComputer VisionDeep learning
S
S. Bhattacharya
College of Mathematics and Computer Science, Zhejiang A&F University, Hangzhou 311300, China
P
Praveen Kumar Reddy Maddikunta
Department of Computer and Information Sciences, Faculty of Engineering and Environment, Northumbria University, Newcastle upon Tyne NE1 8ST, United Kingdom
Madhusanka Liyanage
Madhusanka Liyanage
Associate Professor/Ad Astra Fellow, University College Dublin, Ireland
6GSecurityFederated LearningBlockchainExplainable AI