🤖 AI Summary
To address the challenges of limited pilot overhead, low channel state information (CSI) estimation accuracy, and poor generalization in massive MIMO systems, this paper proposes the first prediction-based foundation model framework for CSI estimation. Our method innovatively adopts a vision transformer architecture pretrained across domains, enabling joint spatiotemporal-frequency modeling and adaptive prior-measurement fusion to achieve efficient pilot encoding and strong cross-scenario transferability. This work pioneers the application of prediction-based foundation models to wireless channel estimation. Extensive evaluations demonstrate significant improvements over conventional and state-of-the-art AI-based methods across diverse deployment scenarios: CSI estimation error is reduced by 42%, pilot overhead is supported below 5%, and robustness to channel noise is substantially enhanced.
📝 Abstract
Accurate channel state information (CSI) acquisition is essential for modern wireless systems, which becomes increasingly difficult under large antenna arrays, strict pilot overhead constraints, and diverse deployment environments. Existing artificial intelligence-based solutions often lack robustness and fail to generalize across scenarios. To address this limitation, this paper introduces a predictive-foundation-model-based channel estimation framework that enables accurate, low-overhead, and generalizable CSI acquisition. The proposed framework employs a predictive foundation model trained on large-scale cross-domain CSI data to extract universal channel representations and provide predictive priors with strong cross-scenario transferability. A pilot processing network based on a vision transformer architecture is further designed to capture spatial, temporal, and frequency correlations from pilot observations. An efficient fusion mechanism integrates predictive priors with real-time measurements, enabling reliable CSI reconstruction even under sparse or noisy conditions. Extensive evaluations across diverse configurations demonstrate that the proposed estimator significantly outperforms both classical and data-driven baselines in accuracy, robustness, and generalization capability.