Single-Channel Distance-Based Source Separation for Mobile GPU in Outdoor and Indoor Environments

📅 2025-01-06
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of distance-unaware separation and poor deployability on mobile devices for single-channel speech separation in complex real-world acoustic environments—both indoor and outdoor. Methodologically, it introduces the first distance-aware source separation framework tailored for realistic scenarios: (i) extending distance-based separation to non-ideal outdoor conditions; (ii) proposing a Relation-aware Self-Attention (RSA) mechanism that models linear geometric relationships between distance and spectral features, preserving physical interpretability while enhancing contextual modeling efficiency; and (iii) adopting a two-stage Conformer architecture accelerated via TensorFlow Lite GPU delegation for on-device inference. Experiments demonstrate state-of-the-art separation performance across both indoor and outdoor settings, alongside significant improvements in mobile energy efficiency and real-time inference speed. The proposed approach establishes a new paradigm for robust, edge-deployable speech separation.

Technology Category

Application Category

📝 Abstract
This study emphasizes the significance of exploring distance-based source separation (DSS) in outdoor environments. Unlike existing studies that primarily focus on indoor settings, the proposed model is designed to capture the unique characteristics of outdoor audio sources. It incorporates advanced techniques, including a two-stage conformer block, a linear relation-aware self-attention (RSA), and a TensorFlow Lite GPU delegate. While the linear RSA may not capture physical cues as explicitly as the quadratic RSA, the linear RSA enhances the model's context awareness, leading to improved performance on the DSS that requires an understanding of physical cues in outdoor and indoor environments. The experimental results demonstrated that the proposed model overcomes the limitations of existing approaches and considerably enhances energy efficiency and real-time inference speed on mobile devices.
Problem

Research questions and friction points this paper is trying to address.

Audio Source Separation
Mobile Graphics Processor
Indoor and Outdoor Environment
Innovation

Methods, ideas, or system contributions that make the work stand out.

Mobile GPU
Monoaural Sound Source Separation
Outdoor Environment Optimization
🔎 Similar Papers
No similar papers found.
Hanbin Bae
Hanbin Bae
Speech Intelligence Team, SAMSUNG RESEARCH
Speech Analysis & Synthesis
Byungjun Kang
Byungjun Kang
AI Solution Team, Samsung Research
Speech & Sound AI6G
J
Jiwon Kim
AI Solution Team, Samsung Research, Seoul, Republic of Korea
J
Jaeyong Hwang
AI Solution Team, Samsung Research, Seoul, Republic of Korea
H
Hosang Sung
AI Solution Team, Samsung Research, Seoul, Republic of Korea
H
Hoon-Young Cho
AI Solution Team, Samsung Research, Seoul, Republic of Korea