FedLLM: A Privacy-Preserving Federated Large Language Model for Explainable Traffic Flow Prediction

📅 2026-04-17

📈 Citations: 0

✨ Influential: 0

career value

251K/year

🤖 AI Summary

This work addresses key limitations in existing traffic flow prediction methods—namely their reliance on centralized data, limited interpretability, and vulnerability to data heterogeneity and privacy concerns. To overcome these challenges, the authors propose FedLLM, a novel framework that integrates federated learning with domain-adapted large language models. By leveraging structured traffic prompts, lightweight LoRA-based parameter exchange, and a composite selection-scoring mechanism, FedLLM enables high-performance multi-step short-term forecasting while preserving data locality and privacy. The approach effectively mitigates non-IID data distributions, achieves prediction accuracy surpassing centralized baselines, and produces interpretable, structurally coherent reasoning outputs with strong cross-regional generalization capabilities.

Technology Category

Application Category

📝 Abstract

Traffic prediction plays a central role in intelligent transportation systems (ITS) by supporting real-time decision-making, congestion management, and long-term planning. However, many existing approaches face practical limitations. Most spatio-temporal models are trained on centralized data, rely on numerical representations, and offer limited explainability. Recent Large Language Model (LLM) methods improve reasoning capabilities but typically assume centralized data availability and do not fully capture the distributed and heterogeneous nature of real-world traffic systems. To address these challenges, this study proposes FedLLM (Federated LLM), a privacy-preserving and distributed framework for explainable multi-horizon short-term traffic flow prediction (15-60 minutes). The framework introduces four key contributions: 1) a Composite Selection Score (CSS) for data-driven freeway selection that captures structural diversity across traffic regions 2) a domain-adapted LLM fine-tuned on structured traffic prompts encoding spatial, temporal, and statistical context 3) FedLLM framework, that enables collaborative training across heterogeneous clients while exchanging only lightweight LoRA adapter parameters, 4) a structured prompt representation that supports contextual reasoning and cross-region generalization. The FedLLM design allows each client to learn from local traffic patterns while contributing to a shared global model through efficient parameter exchange, reducing communication overhead and keeping data private. This setup supports learning under non-IID traffic distributions. Experimental results show that FedLLM achieves improved predictive performance over centralized baselines, while producing structured and explainable outputs. These findings highlight the potential of combining FL with domain-adapted LLMs for scalable, privacy-aware, and explainable traffic prediction.

Problem

Research questions and friction points this paper is trying to address.

traffic flow prediction

federated learning

large language models

explainability

privacy preservation

Innovation

Methods, ideas, or system contributions that make the work stand out.

Federated Learning

Large Language Model

Traffic Flow Prediction