DDSC: Dynamic Dual-Signal Curriculum for Data-Efficient Acoustic Scene Classification under Domain Shift

📅 2025-10-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Acoustic scene classification (ASC) suffers from domain shift induced by device heterogeneity, leading to substantial performance degradation under label scarcity. Conventional curriculum learning approaches employ static sample ordering or weighting schemes, failing to adapt to the dynamic evolution of sample difficulty and marginal utility during training. To address this, we propose a dynamic dual-signal curriculum learning framework that jointly models two time-varying signals: domain invariance—capturing cross-device commonalities—and learning progress—reflecting sample difficulty and model convergence status. These signals jointly inform real-time sample weighting: early training emphasizes domain-invariant features, while device-specific modeling is progressively incorporated later. Evaluated on the DCASE 2024 Task 1 benchmark, our method significantly improves cross-device classification accuracy across diverse labeling budgets and demonstrates superior generalization to unseen devices, validating the efficacy of data-efficient, adaptive curriculum design.

Technology Category

Application Category

📝 Abstract
Acoustic scene classification (ASC) suffers from device-induced domain shift, especially when labels are limited. Prior work focuses on curriculum-based training schedules that structure data presentation by ordering or reweighting training examples from easy-to-hard to facilitate learning; however, existing curricula are static, fixing the ordering or the weights before training and ignoring that example difficulty and marginal utility evolve with the learned representation. To overcome this limitation, we propose the Dynamic Dual-Signal Curriculum (DDSC), a training schedule that adapts the curriculum online by combining two signals computed each epoch: a domain-invariance signal and a learning-progress signal. A time-varying scheduler fuses these signals into per-example weights that prioritize domain-invariant examples in early epochs and progressively emphasize device-specific cases. DDSC is lightweight, architecture-agnostic, and introduces no additional inference overhead. Under the official DCASE 2024 Task~1 protocol, DDSC consistently improves cross-device performance across diverse ASC baselines and label budgets, with the largest gains on unseen-device splits.
Problem

Research questions and friction points this paper is trying to address.

Addresses device-induced domain shift in acoustic scene classification
Overcomes limitations of static curriculum learning approaches
Enhances cross-device performance under limited label conditions
Innovation

Methods, ideas, or system contributions that make the work stand out.

Dynamic curriculum adapts online using dual signals
Fuses domain-invariance and learning-progress signals
Prioritizes domain-invariant examples then device-specific cases
🔎 Similar Papers
No similar papers found.
Peihong Zhang
Peihong Zhang
Professor of Physics, University at Buffalo
condensed matter physicselectronic structure theorymaterials science
Y
Yuxuan Liu
School of Advanced Technology, Xi’an Jiaotong-Liverpool University, Suzhou, China
R
Rui Sang
School of Advanced Technology, Xi’an Jiaotong-Liverpool University, Suzhou, China
Zhixin Li
Zhixin Li
Syracuse University School of Information Studies
Social MachinesChildrenNon-human Relationship
Y
Yiqiang Cai
School of Advanced Technology, Xi’an Jiaotong-Liverpool University, Suzhou, China
Y
Yizhou Tan
School of Advanced Technology, Xi’an Jiaotong-Liverpool University, Suzhou, China
Shengchen Li
Shengchen Li
Xi'an Jiaotong-Liverpool University
Machine Listening