Disentangling Hierarchical Features for Anomalous Sound Detection Under Domain Shift

📅 2025-01-03

📈 Citations: 0

✨ Influential: 0

career value

212K/year

🤖 AI Summary

Anomalous sound detection (ASD) suffers from degraded cross-domain generalization under domain shift, primarily due to inconsistent acoustic feature distributions caused by varying machine operating conditions. Method: This paper proposes a gradient reversal–driven hierarchical feature disentanglement framework. It is the first to jointly integrate domain-relevant/domain-irrelevant feature separation with meta-information–guided (e.g., region ID, acoustic attributes) fine-grained hierarchical modeling, implemented via gradient reversal layers, hierarchical encoders, and a meta-aware attention mechanism for adversarial feature disentanglement. Results: Evaluated on the DCASE 2022 Task 2 dataset, the method achieves significant improvements in AUC and pAUC, outperforming baseline methods by over 8.2% on average under severe domain shift. It substantially enhances model robustness and generalization capability against environmental and operational condition variations.

Technology Category

Application Category

📝 Abstract

Anomalous sound detection (ASD) encounters difficulties with domain shift, where the sounds of machines in target domains differ significantly from those in source domains due to varying operating conditions. Existing methods typically employ domain classifiers to enhance detection performance, but they often overlook the influence of domain-unrelated information. This oversight can hinder the model's ability to clearly distinguish between domains, thereby weakening its capacity to differentiate normal from abnormal sounds. In this paper, we propose a Gradient Reversal-based Hierarchical feature Disentanglement (GRHD) method to address the above challenge. GRHD uses gradient reversal to separate domain-related features from domain-unrelated ones, resulting in more robust feature representations. Additionally, the method employs a hierarchical structure to guide the learning of fine-grained, domain-specific features by leveraging available metadata, such as section IDs and machine sound attributes. Experimental results on the DCASE 2022 Challenge Task 2 dataset demonstrate that the proposed method significantly improves ASD performance under domain shift.

Problem

Research questions and friction points this paper is trying to address.

Anomaly Sound Detection

Environmental Variability

Model Generalization

Innovation

Methods, ideas, or system contributions that make the work stand out.

Gradient Reversal

Hierarchical Feature Decoupling

Anomaly Sound Detection

🔎 Similar Papers

Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation

2024-09-01InterspeechCitations: 2

Bosch Group

Renningen, BW, DE

Master Thesis Deep Learning for Multi-Channel Vision in Product Manufacturing

Bosch Group

Renningen, BW, DE

Authors to Follow