Optimally Bridging Semantics and Data: Generative Semantic Communication via Schrödinger Bridge

📅 2026-04-20
📈 Citations: 0
Influential: 0
📄 PDF

career value

216K/year
🤖 AI Summary
This work addresses the limitations of existing generative semantic communication methods, which rely on Gaussian priors and consequently suffer from severe hallucinations and high computational overhead in narrowband, high-noise channels. To overcome these issues, the authors propose a Schrödinger Bridge-based Generative Semantic Communication (SBGSC) framework that dispenses with Gaussian assumptions by constructing an optimal transport trajectory between semantic and image distributions, enabling direct generative decoding. By reformulating the nonlinear drift term of diffusion models and introducing a self-consistent guidance strategy for non-Markovian generation, SBGSC effectively learns the underlying velocity field, drastically reducing sampling steps while suppressing hallucinatory artifacts. Experimental results demonstrate that SBGSC improves the Fréchet Inception Distance (FID) by at least 38%, increases Structural Similarity Index (SSIM) by 49.3%, and accelerates inference by over eightfold compared to current state-of-the-art methods.

Technology Category

Application Category

📝 Abstract
Generative Semantic Communication (GSC) is a promising solution for image transmission over narrow-band and high-noise channels. However, existing GSC methods rely on long, indirect transport trajectories from a Gaussian to an image distribution guided by semantics, causing severe hallucination and high computational cost. To address this, we propose a general framework named Schrödinger Bridge-based GSC (SBGSC). By leveraging the Schrödinger Bridge (SB) to construct optimal transport trajectories between arbitrary distributions, SBGSC breaks Gaussian limitations and enables direct generative decoding from semantics to images. Within this framework, we design Diffusion SB-based GSC (DSBGSC). DSBGSC reconstructs the nonlinear drift term of diffusion models using Schrödinger potentials, achieving direct optimal distribution transport to reduce hallucinations and computational overhead. To further accelerate generation, we propose a self-consistency-based objective guiding the model to learn a nonlinear velocity field pointing directly toward the image, bypassing Markovian noise prediction to significantly reduce sampling steps. Simulation results demonstrate that DSBGSC outperforms state-of-the-art GSC methods, improving FID by at least 38% and SSIM by 49.3%, while accelerating inference speed by over 8 times.
Problem

Research questions and friction points this paper is trying to address.

Generative Semantic Communication
Hallucination
Computational Cost
Optimal Transport
Image Transmission
Innovation

Methods, ideas, or system contributions that make the work stand out.

Generative Semantic Communication
Schrödinger Bridge
Optimal Transport
Diffusion Models
Self-consistency
🔎 Similar Papers
No similar papers found.
D
Dahua Gao
School of Artificial Intelligence, Xidian University, Xi’an, Shaanxi 710071 China; and Pazhou Lab, Huangpu, Guangdong 510555 China
R
Ruichao Liu
School of Artificial Intelligence, Xidian University, Xi’an, Shaanxi 710071 China; and Peng Cheng Laboratory, Shenzhen, Guangdong 518055 China
M
Minxi Yang
School of Artificial Intelligence, Xidian University, Xi’an, Shaanxi 710071 China; and Pazhou Lab, Huangpu, Guangdong 510555 China
S
Shuai Ma
Peng Cheng Laboratory, Shenzhen, Guangdong 518055 China
Y
Youlong Wu
School of Information Science and Technology, ShanghaiTech University, Shanghai 201210, China
Guangming Shi
Guangming Shi
School of Electronic Engineering, Xidian University, China; Peng Cheng Laboratory
compressed sensingacquisition and processing of remote sensing imagesmultimedia image communicationmedical imaging