Mixture of Semantics Transmission for Generative AI-Enabled Semantic Communication Systems

📅 2025-09-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the low semantic transmission efficiency and the trade-off between channel resource utilization and reconstruction quality in generative-AI-driven wireless semantic communication, this paper proposes a Mixture-of-Semantics (MoS) transmission strategy. MoS leverages semantic segmentation to precisely distinguish Regions of Interest (ROI) from Regions of Non-Interest (RONI), enabling differentiated semantic encoding and transmission: ROI encoding prioritizes structural fidelity, while RONI encoding emphasizes semantic consistency. At the receiver, a diffusion model jointly reconstructs the full image. The framework integrates semantic segmentation, rate-distortion-optimized coding, and generative reconstruction to realize end-to-end semantic-level communication. Experiments demonstrate that MoS achieves a 3.2 dB gain in ROI peak signal-to-noise ratio (PSNR) and an 18.7% improvement in RONI CLIP similarity over baseline methods, marking the first approach in semantic communication to simultaneously optimize visual fidelity and semantic relevance.

Technology Category

Application Category

📝 Abstract
In this paper, we propose a mixture of semantics (MoS) transmission strategy for wireless semantic communication systems based on generative artificial intelligence (AI). At the transmitter, we divide an image into regions of interest (ROI) and reigons of non-interest (RONI) to extract their semantic information respectively. Semantic information of ROI can be allocated more bandwidth, while RONI can be represented in a compact form for transmission. At the receiver, a diffusion model reconstructs the full image using the received semantic information of ROI and RONI. Compared to existing generative AI-based methods, MoS enables more efficient use of channel resources by balancing visual fidelity and semantic relevance. Experimental results demonstrate that appropriate ROI-RONI allocation is critical. The MoS achieves notable performance gains in peak signal-to-noise ratio (PSNR) of ROI and CLIP score of RONI.
Problem

Research questions and friction points this paper is trying to address.

Optimizing bandwidth allocation for ROI and RONI transmission
Balancing visual fidelity and semantic relevance in communication
Reconstructing full images using generative AI with semantic data
Innovation

Methods, ideas, or system contributions that make the work stand out.

Divides images into ROI and RONI regions
Allocates more bandwidth to ROI semantics
Uses diffusion model to reconstruct images
🔎 Similar Papers
No similar papers found.
J
Junjie Ni
Cooperative Medianet Innovation Center and Shanghai Key Laboratory of Digital Media Processing and Transmission, Shanghai Jiao Tong University, Shanghai, 200240, China
T
Tong Wu
Cooperative Medianet Innovation Center and Shanghai Key Laboratory of Digital Media Processing and Transmission, Shanghai Jiao Tong University, Shanghai, 200240, China
Zhiyong Chen
Zhiyong Chen
Shanghai Jiao Tong University
6G networksWireless CommunicationsComputing and Caching Networks
Yin Xu
Yin Xu
Beijing Jiaotong University
Power Grid ResilienceElectricity-Transportation Integrated SystemPower System High-Performance Simulation
Meixia Tao
Meixia Tao
Professor at Shanghai Jiao Tong University; Fellow of IEEE
wireless communicationscachingedge computing5G+
Wenjun Zhang
Wenjun Zhang
City University of Hong Kong
Thin film technologynanomaterials and nanodevices