Optimized Learned Image Compression for Facial Expression Recognition

📅 2025-09-21
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address feature degradation and accuracy loss in facial expression recognition (FER) caused by lossy image compression, this paper proposes an end-to-end learnable image compression framework. The method introduces a task-specific joint optimization objective that integrates feature-aware reconstruction loss with classification-guided supervision, enabling adaptive weighting to balance compression fidelity and discriminative feature preservation. Leveraging a deep learning–based compression backbone, the framework supports both standalone fine-tuning and end-to-end joint training. Experiments show that standalone fine-tuning improves FER accuracy by 0.71% while reducing bit-rate by 49.32%; joint optimization further boosts accuracy by 4.04% and reduces bit-rate by 89.12%, maintaining model stability in both compressed and pixel domains. The core contribution is the first integration of FER-driven discriminative constraints into a learnable compression pipeline, achieving high-accuracy recognition under high compression ratios.

Technology Category

Application Category

📝 Abstract
Efficient data compression is crucial for the storage and transmission of visual data. However, in facial expression recognition (FER) tasks, lossy compression often leads to feature degradation and reduced accuracy. To address these challenges, this study proposes an end-to-end model designed to preserve critical features and enhance both compression and recognition performance. A custom loss function is introduced to optimize the model, tailored to balance compression and recognition performance effectively. This study also examines the influence of varying loss term weights on this balance. Experimental results indicate that fine-tuning the compression model alone improves classification accuracy by 0.71% and compression efficiency by 49.32%, while joint optimization achieves significant gains of 4.04% in accuracy and 89.12% in efficiency. Moreover, the findings demonstrate that the jointly optimized classification model maintains high accuracy on both compressed and uncompressed data, while the compression model reliably preserves image details, even at high compression rates.
Problem

Research questions and friction points this paper is trying to address.

Lossy image compression degrades facial expression recognition accuracy
Balancing compression efficiency with feature preservation for FER tasks
Optimizing compression models to maintain recognition performance on compressed data
Innovation

Methods, ideas, or system contributions that make the work stand out.

End-to-end model balancing compression and recognition
Custom loss function optimizes feature preservation
Joint optimization boosts accuracy and compression efficiency
🔎 Similar Papers
No similar papers found.
X
Xiumei Li
Multimedia Communications and Signal Processing, Friedrich-Alexander-Universität, Cauerstr. 7, 91058 Erlangen
M
Marc Windsheimer
Multimedia Communications and Signal Processing, Friedrich-Alexander-Universität, Cauerstr. 7, 91058 Erlangen
M
Misha Sadeghi
Machine Learning and Data Analytics Lab, Friedrich-Alexander-Universität, Carl-Thiersch-Str. 2b, 91052 Erlangen
B
Björn Eskofier
Machine Learning and Data Analytics Lab, Friedrich-Alexander-Universität, Carl-Thiersch-Str. 2b, 91052 Erlangen; Translational Digital Health Group, Institute of AI for Health, Helmholtz Zentrum München - German Research Center for Environmental Health, Ingolstädter Landstr. 1, 85764 Neuherberg
André Kaup
André Kaup
Professor, Friedrich-Alexander University Erlangen-Nuremberg
Image and Video CodingMultimedia Signal Processing