ARCHE: Autoregressive Residual Compression with Hyperprior and Excitation

📅 2026-03-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the high computational cost and limited parallelism often associated with existing learning-based image compression methods, despite their improved rate-distortion performance. The authors propose ARCHE, a novel framework that unifies hyperpriors, spatial autoregression, and channel excitation within an efficient convolutional architecture—modeling both global and local dependencies in latent variables without recurrent or Transformer components. Through adaptive feature recalibration and residual refinement, ARCHE significantly enhances representation quality. The model is end-to-end trainable and achieves substantial BD-Rate reductions of 48%, 30%, and 5% over Balle et al., Minnen & Singh, and VVC Intra, respectively, on the Kodak dataset. With only 95 million parameters, it encodes a single image in 222 ms while delivering visibly superior reconstruction quality compared to current state-of-the-art approaches.

Technology Category

Application Category

📝 Abstract
Recent progress in learning-based image compression has demonstrated that end-to-end optimization can substantially outperform traditional codecs by jointly learning compact latent representations and probabilistic entropy models. However, many existing approaches achieve high rate-distortion efficiency at the expense of increased computational cost and limited parallelism. This paper presents ARCHE - Autoregressive Residual Compression with Hyperprior and Excitation, an end-to-end learned image compression framework that balances modeling accuracy and computational efficiency. The proposed architecture unifies hierarchical, spatial, and channel-based priors within a single probabilistic framework, capturing both global and local dependencies in the latent representation of the image, while employing adaptive feature recalibration and residual refinement to enhance latent representation quality. Without relying on recurrent or transformer-based components, ARCHE attains state-of-the-art rate-distortion efficiency: it reduces the BD-Rate by approximately 48% relative to the commonly used benchmark model of Balle et al., 30% relative to the channel-wise autoregressive model of Minnen&Singh and 5% against the VVC Intra codec on the Kodak benchmark dataset. The framework maintains computational efficiency with 95M parameters and 222ms running time per image. Visual comparisons confirm sharper textures and improved color fidelity, particularly at lower bit rates, demonstrating that accurate entropy modeling can be achieved through efficient convolutional designs suitable for practical deployment.
Problem

Research questions and friction points this paper is trying to address.

image compression
rate-distortion efficiency
computational efficiency
entropy modeling
learned compression
Innovation

Methods, ideas, or system contributions that make the work stand out.

learned image compression
autoregressive prior
hyperprior
channel excitation
rate-distortion optimization
🔎 Similar Papers
No similar papers found.
S
Sofia Iliopoulou
Dept. of Electrical and Computer Engineering, University of Patras, Patras, Greece
D
Dimitris Ampeliotis
Dept. of Digital Media and Communication, Ionian University, Argostoli, Greece
Athanassios Skodras
Athanassios Skodras
University of Patras
Hand Gesture Recognition based on sEMG Signals and Deep Learning SignalImage and Video Coding / Processing / AnalysisHDR Ima