Exploring Perceptual Audio Quality Measurement on Stereo Processing Using the Open Dataset of Audio Quality

📅 2025-12-11
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study investigates how stereo processing—specifically Mid/Side versus Left/Right encoding—affects subjective audio quality perception, and evaluates the predictive accuracy of mainstream objective metrics (e.g., PEAQ, ITU-R BS.1387, DNSMOS) under spatial distortions. Leveraging the ODAQ stereo extension dataset with corresponding Mean Opinion Scores (MOS), we conduct time–frequency domain metric comparisons and statistical modeling. Our analysis quantitatively reveals, for the first time, the critical interplay between bottom-up auditory mechanisms and top-down contextual factors in stereo quality prediction. Results show that timbre-oriented metrics remain robust under simple distortions but degrade significantly under spatial distortions; current models exhibit systematic bias due to their neglect of spatial dimensions. We propose a novel three-dimensional perceptual evaluation paradigm integrating temporal, spectral, and spatial cues—providing both theoretical foundation and methodological support for next-generation audio quality metrics.

Technology Category

Application Category

📝 Abstract
ODAQ (Open Dataset of Audio Quality) provides a comprehensive framework for exploring both monaural and binaural audio quality degradations across a range of distortion classes and signals, accompanied by subjective quality ratings. A recent update of ODAQ, focusing on the impact of stereo processing methods such as Mid/Side (MS) and Left/Right (LR), provides test signals and subjective ratings for the in-depth investigation of state-of-the-art objective audio quality metrics. Our evaluation results suggest that, while timbre-focused metrics often yield robust results under simpler conditions, their prediction performance tends to suffer under the conditions with a more complex presentation context. Our findings underscore the importance of modeling the interplay of bottom-up psychoacoustic processes and top-down contextual factors, guiding future research toward models that more effectively integrate both timbral and spatial dimensions of perceived audio quality.
Problem

Research questions and friction points this paper is trying to address.

Evaluating stereo processing impact on audio quality
Testing objective metrics with subjective ratings dataset
Improving models for timbral and spatial quality perception
Innovation

Methods, ideas, or system contributions that make the work stand out.

Using ODAQ dataset for stereo audio quality evaluation
Assessing Mid/Side and Left/Right stereo processing methods
Integrating timbral and spatial dimensions in quality models
🔎 Similar Papers
No similar papers found.
P
Pablo M. Delgado
Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany
S
Sascha Dick
Fraunhofer Institute for Integrated Circuits IIS, Erlangen, Germany
C
Christoph Thompson
Ball State University, Muncie, USA
Chih-Wei Wu
Chih-Wei Wu
Audio Algorithms, Netflix, Inc
Music Information Retrievalaudio signal processingsound analysis and synthesis
P
Phillip A. Williams
Netflix, Inc., Los Gatos, USA