An Overview of the JPEG AI Learning-Based Image Coding Standard

📅 2025-10-13
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Conventional image coding standards struggle to simultaneously optimize for human visual fidelity and downstream machine vision task performance. Method: JPEG AI introduces the first international learned image coding standard jointly targeting human perception and machine understanding. It employs an end-to-end differentiable deep learning architecture that jointly optimizes multiple perceptual quality metrics—including MS-SSIM, FSIM, VIF, and VMAF—to achieve a unified representation supporting both human visualization and machine analysis within a single compact bitstream. Contribution/Results: Compared to JPEG and VVC, JPEG AI achieves average BD-rate reductions of 15–30% across key perceptual metrics (e.g., PSNR-HVS, IW-SSIM, NLPD), significantly improving compression efficiency and cross-device/application interoperability. Notably, it marks the first systematic integration of deep learning–based image coding into the ISO/IEC standardization framework.

Technology Category

Application Category

📝 Abstract
JPEG AI is an emerging learning-based image coding standard developed by Joint Photographic Experts Group (JPEG). The scope of the JPEG AI is the creation of a practical learning-based image coding standard offering a single-stream, compact compressed domain representation, targeting both human visualization and machine consumption. Scheduled for completion in early 2025, the first version of JPEG AI focuses on human vision tasks, demonstrating significant BD-rate reductions compared to existing standards, in terms of MS-SSIM, FSIM, VIF, VMAF, PSNR-HVS, IW-SSIM and NLPD quality metrics. Designed to ensure broad interoperability, JPEG AI incorporates various design features to support deployment across diverse devices and applications. This paper provides an overview of the technical features and characteristics of the JPEG AI standard.
Problem

Research questions and friction points this paper is trying to address.

Developing a learning-based image coding standard for compression
Targeting both human visualization and machine consumption applications
Achieving significant bitrate reduction over existing image standards
Innovation

Methods, ideas, or system contributions that make the work stand out.

Learning-based image coding standard for compression
Single-stream compact representation for human and machine use
Significant BD-rate reductions across multiple quality metrics
🔎 Similar Papers
No similar papers found.
S
Semih Esenlik
Bytedance Inc., 8910 University Center Ln, San Diego, CA 92122, USA
Yaojun Wu
Yaojun Wu
Bytedance Inc.
Data compression、Deep learning
Zhaobin Zhang
Zhaobin Zhang
Bytedance Inc., 8910 University Center Ln, San Diego, CA 92122, USA
Y
Ye-Kui Wang
Bytedance Inc., 8910 University Center Ln, San Diego, CA 92122, USA
K
Kai Zhang
Bytedance Inc., 8910 University Center Ln, San Diego, CA 92122, USA
L
Li Zhang
Bytedance Inc., 8910 University Center Ln, San Diego, CA 92122, USA
João Ascenso
João Ascenso
Instituto Superior Técnico - Instituto de Telecomunicações
multimedia signal processingvideo streamingimage and video retrieval
S
Shan Liu
Tencent Media Lab, 2747 Park Blvd, Palo Alto, CA 94306 USA