🤖 AI Summary
This work addresses the ill-posed inverse problem of wireless channel estimation under sparse pilot signals by proposing MultiCE-Flow, a novel framework that integrates flow matching with multimodal environmental awareness. For the first time, it leverages LiDAR, camera, and positional data to provide semantic conditioning, while treating sparse pilot patterns as structural constraints. A diffusion-based Transformer enables efficient, single-step, high-fidelity channel reconstruction. By jointly exploiting semantic and structural priors, MultiCE-Flow substantially enhances generalization and robustness in out-of-distribution scenarios and across varying pilot densities. Experimental results demonstrate that the proposed method consistently outperforms both conventional and state-of-the-art generative approaches in terms of channel reconstruction accuracy, generalization capability, and robustness.
📝 Abstract
Deep generative models offer a powerful alternative to conventional channel estimation by learning complex channel distributions. By integrating the rich environmental information available in modern sensing-aided networks, this paper proposes MultiCE-Flow, a multimodal channel estimation framework based on flow matching and diffusion transformer (DiT). We design a specialized multimodal perception module that fuses LiDAR, camera, and location data into a semantic condition, while treating sparse pilots as a structural condition. These conditions guide a DiT backbone to reconstruct high-fidelity channels. Unlike standard diffusion models, we employ flow matching to learn a linear trajectory from noise to data, enabling efficient one-step sampling. By leveraging environmental semantics, our method mitigates the ill-posed nature of estimation with sparse pilots. Extensive experiments demonstrate that MultiCE-Flow consistently outperforms traditional baselines and existing generative models. Notably, it exhibits superior robustness to out-of-distribution scenarios and varying pilot densities, making it suitable for environment-aware communication systems.