Variable-Length Joint Source-Channel Coding for Semantic Communication

📅 2025-11-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
In digital semantic communication, conventional joint source-channel coding (JSCC) struggles with incompatibility between continuous semantic representations and discrete variable-length codewords, hindering fine-grained bit-level rate control. Method: This paper proposes an end-to-end trainable variable-length semantic encoding framework grounded in the information bottleneck principle. It decouples code length from semantic content via architectural design, enables direct semantic compression under finite codebooks, and employs policy gradient optimization to handle non-differentiable code-length decisions. Contribution/Results: Experiments demonstrate that the method significantly improves downstream task inference accuracy at low bitrates, outperforming existing baselines adapted for digital semantic communication systems. It is the first to achieve precise, semantic-aware bitrate regulation while maintaining high transmission efficiency—unifying semantic fidelity and bit-level controllability.

Technology Category

Application Category

📝 Abstract
This paper investigates a key challenge faced by joint source-channel coding (JSCC) in digital semantic communication (SemCom): the incompatibility between existing JSCC schemes that yield continuous encoded representations and digital systems that employ discrete variable-length codewords. It further results in feasibility issues in achieving physical bit-level rate control via such JSCC approaches for efficient semantic transmission. In this paper, we propose a novel end-to-end coding (E2EC) framework to tackle it. The semantic coding problem is formed by extending the information bottleneck (IB) theory over noisy channels, which is a tradeoff between bit-level communication rate and semantic distortion. With a structural decomposition of encoding to handle code length and content respectively, we can construct an end-to-end trainable encoder that supports the direct compression of a data source into a finite codebook. To optimize our E2EC across non-differentiable operations, e.g., sampling, we use the powerful policy gradient to support gradient-based updates. Experimental results illustrate that E2EC achieves high inference quality with low bit rates, outperforming representative baselines compatible with digital SemCom systems.
Problem

Research questions and friction points this paper is trying to address.

Resolves incompatibility between continuous JSCC encodings and digital systems
Enables physical bit-level rate control for efficient semantic transmission
Achieves direct compression into finite codebook via end-to-end trainable encoder
Innovation

Methods, ideas, or system contributions that make the work stand out.

End-to-end coding framework for digital semantic communication
Structural decomposition for finite codebook compression
Policy gradient optimization for non-differentiable operations
🔎 Similar Papers
No similar papers found.
Yujie Zhou
Yujie Zhou
Shanghai Jiao Tong University
Image GenerationVideo GenerationVideo Editing
R
Rulong Wang
School of Elect. Inform. & Commun., Huazhong Univ. of Science & Technology, China
Y
Yong Xiao
Peng Cheng Laboratory, Shenzhen, China; Pazhou Laboratory (Huangpu), Guangzhou, China
Y
Yingyu Li
School of Mech. Eng. and Elec. Info., China University of Geosciences (Wuhan), China
Guangming Shi
Guangming Shi
School of Electronic Engineering, Xidian University, China; Peng Cheng Laboratory
compressed sensingacquisition and processing of remote sensing imagesmultimedia image communicationmedical imaging