Perceptual Noise-Masking with Music through Deep Spectral Envelope Shaping

📅 2025-02-24
📈 Citations: 0
✨ Influential: 0
📄 PDF
🤖 AI Summary
Conventional methods exhibit insufficient noise-masking capability for music in noisy environments. Method: This paper proposes an active masking enhancement approach based on deep spectral envelope reshaping. Leveraging psychoacoustic simultaneous masking, it introduces, for the first time, a differentiable psychoacoustic masking model into an end-to-end neural network training framework, jointly optimizing masking efficacy, musical fidelity, and perceptual loudness. The method employs a CNN-LSTM-based frequency-response prediction network, integrated with a custom perceptual loss function, and is trained on synthetically generated data simulating headphone listening conditions. Results: Experiments demonstrate significant improvements over state-of-the-art methods across multiple objective masking metrics; notably, masking depth in target noise frequency bands is substantially enhanced, while preserving the structural integrity of the original mix and maintaining subjective auditory consistency.

Technology Category

Application Category

📝 Abstract
People often listen to music in noisy environments, seeking to isolate themselves from ambient sounds. Indeed, a music signal can mask some of the noise's frequency components due to the effect of simultaneous masking. In this article, we propose a neural network based on a psychoacoustic masking model, designed to enhance the music's ability to mask ambient noise by reshaping its spectral envelope with predicted filter frequency responses. The model is trained with a perceptual loss function that balances two constraints: effectively masking the noise while preserving the original music mix and the user's chosen listening level. We evaluate our approach on simulated data replicating a user's experience of listening to music with headphones in a noisy environment. The results, based on defined objective metrics, demonstrate that our system improves the state of the art.
Problem

Research questions and friction points this paper is trying to address.

Enhance music's noise-masking ability
Reshape spectral envelope using neural networks
Balance noise masking and music preservation
Innovation

Methods, ideas, or system contributions that make the work stand out.

Deep Spectral Envelope Shaping
Psychoacoustic Masking Model
Perceptual Loss Function
🔎 Similar Papers
No similar papers found.