🤖 AI Summary
This work addresses the performance degradation of WiFi-based gesture recognition in cross-environment deployments caused by distribution shifts. To overcome the limitations of conventional approaches that rely on physical-domain labels, the authors propose GesFi, a novel system that introduces a latent domain discovery mechanism to automatically uncover and implicitly align domain factors responsible for distribution shifts directly from raw WiFi signals. GesFi integrates CSI-ratio denoising, short-time Fourier transform, class-level adversarial learning, and unsupervised clustering to achieve robust cross-domain recognition without requiring physical labels. Experimental results demonstrate that GesFi improves performance by up to 78% in cross-domain tasks under both single-pair and multi-pair transceiver configurations, significantly outperforming state-of-the-art generalization methods.
📝 Abstract
In this paper, we propose GesFi, a novel WiFi-based gesture recognition system that introduces WiFi latent domain mining to redefine domains directly from the data itself. GesFi first processes raw sensing data collected from WiFi receivers using CSI-ratio denoising, Short-Time Fast Fourier Transform, and visualization techniques to generate standardized input representations. It then employs class-wise adversarial learning to suppress gesture semantic and leverages unsupervised clustering to automatically uncover latent domain factors responsible for distributional shifts. These latent domains are then aligned through adversarial learning to support robust cross-domain generalization. Finally, the system is applied to the target environment for robust gesture inference. We deployed GesFi under both single-pair and multi-pair settings using commodity WiFi transceivers, and evaluated it across multiple public datasets and real-world environments. Compared to state-of-the-art baselines, GesFi achieves up to 78% and 50% performance improvements over existing adversarial methods, and consistently outperforms prior generalization approaches across most cross-domain tasks.