HeterCSI: Channel-Adaptive Heterogeneous CSI Pretraining Framework for Generalized Wireless Foundation Models

📅 2026-01-26
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the limited generalization of existing wireless foundation models to cross-scale, cross-scenario heterogeneous channel state information (CSI), which stems from fixed input dimensions or scale-isolated training. The authors propose a channel-adaptive pretraining framework for heterogeneous CSI and reveal that scale heterogeneity induces destructive gradient interference, whereas scenario diversity promotes gradient alignment. To mitigate this, they design a scale-aware adaptive batching strategy and a dual-masking mechanism to effectively disentangle true signals from padding artifacts. Their model achieves strong zero-shot performance across 12 datasets without fine-tuning, outperforming state-of-the-art zero-shot methods by 7.19 dB, 4.08 dB, and 5.27 dB in NMSE for CSI reconstruction, time-domain prediction, and frequency-domain prediction, respectively. Additionally, it reduces training latency by 53% and improves average generalization performance by 1.53 dB.

Technology Category

Application Category

📝 Abstract
Wireless foundation models promise transformative capabilities for channel state information (CSI) processing across diverse 6G network applications, yet face fundamental challenges due to the inherent dual heterogeneity of CSI across both scale and scenario dimensions. However, current pretraining approaches either constrain inputs to fixed dimensions or isolate training by scale, limiting the generalization and scalability of wireless foundation models. In this paper, we propose HeterCSI, a channel-adaptive pretraining framework that reconciles training efficiency with robust cross-scenario generalization via a new understanding of gradient dynamics in heterogeneous CSI pretraining. Our key insight reveals that CSI scale heterogeneity primarily causes destructive gradient interference, while scenario diversity actually promotes constructive gradient alignment when properly managed. Specifically, we formulate heterogeneous CSI batch construction as a partitioning optimization problem that minimizes zero-padding overhead while preserving scenario diversity. To solve this, we develop a scale-aware adaptive batching strategy that aligns CSI samples of similar scales, and design a double-masking mechanism to isolate valid signals from padding artifacts. Extensive experiments on 12 datasets demonstrate that HeterCSI establishes a generalized foundation model without scenario-specific finetuning, achieving superior average performance over full-shot baselines. Compared to the state-of-the-art zero-shot benchmark WiFo, it reduces NMSE by 7.19 dB, 4.08 dB, and 5.27 dB for CSI reconstruction, time-domain, and frequency-domain prediction, respectively. The proposed HeterCSI framework also reduces training latency by 53% compared to existing approaches while improving generalization performance by 1.53 dB on average.
Problem

Research questions and friction points this paper is trying to address.

Channel State Information (CSI)
heterogeneity
wireless foundation models
generalization
pretraining
Innovation

Methods, ideas, or system contributions that make the work stand out.

Heterogeneous CSI
Wireless Foundation Model
Adaptive Batching
Gradient Alignment
Double-Masking Mechanism
🔎 Similar Papers
No similar papers found.
C
Chenyu Zhang
National Engineering Research Center for Mobile Network Technologies, Beijing University of Posts and Telecommunications, Beijing 100876, China
Xinchen Lyu
Xinchen Lyu
Beijing University of Posts and Telecommunications
Fog computingEdge cachingSDN
C
Chenshan Ren
Key Laboratory of Ethnic Language Intelligent Analysis and Security Governance of MOE, Minzu University of China, Beijing 100081, China
S
Shuhan Liu
China Telecom Corporation Limited Gansu Branch, Gansu 730000, China
Qimei Cui
Qimei Cui
Professor , School of Information and Communication Engineering ,Beijing University of Posts and
B5G/6G wireless communicationsmobile computing and IoT
Xiaofeng Tao
Xiaofeng Tao
Beijing University of Posts and Telecommunications
wireless communication