Wireless Hearables With Programmable Speech AI Accelerators

📅 2025-03-24

📈 Citations: 0

✨ Influential: 0

career value

243K/year

🤖 AI Summary

Battery- and size-constrained wireless hearing aids face a fundamental trade-off between high computational demands and strict low-power requirements for on-device real-time speech AI processing. To address this, we propose a fully on-device speech AI system featuring: (1) the first ear-worn platform integrating a programmable speech AI accelerator; (2) a low-latency dual-path CNN-RNN hybrid architecture enabling frame-level streaming inference at 6 ms frame shift; and (3) a hardware-software co-designed mixed-precision quantization framework with quantization-aware training. The system achieves real-time inference latency of 5.54 ms per frame and power consumption of only 71.6 mW. In a user study with 28 participants, it significantly outperforms existing on-device solutions in speech quality (PESQ) and noise suppression performance, thereby overcoming the deployment bottleneck for streaming deep learning on ultra-compact wearable devices.

Technology Category

Application Category

📝 Abstract

The conventional wisdom has been that designing ultra-compact, battery-constrained wireless hearables with on-device speech AI models is challenging due to the high computational demands of streaming deep learning models. Speech AI models require continuous, real-time audio processing, imposing strict computational and I/O constraints. We present NeuralAids, a fully on-device speech AI system for wireless hearables, enabling real-time speech enhancement and denoising on compact, battery-constrained devices. Our system bridges the gap between state-of-the-art deep learning for speech enhancement and low-power AI hardware by making three key technical contributions: 1) a wireless hearable platform integrating a speech AI accelerator for efficient on-device streaming inference, 2) an optimized dual-path neural network designed for low-latency, high-quality speech enhancement, and 3) a hardware-software co-design that uses mixed-precision quantization and quantization-aware training to achieve real-time performance under strict power constraints. Our system processes 6 ms audio chunks in real-time, achieving an inference time of 5.54 ms while consuming 71.6 mW. In real-world evaluations, including a user study with 28 participants, our system outperforms prior on-device models in speech quality and noise suppression, paving the way for next-generation intelligent wireless hearables that can enhance hearing entirely on-device.

Problem

Research questions and friction points this paper is trying to address.

Enabling real-time speech AI on battery-constrained wireless hearables

Bridging deep learning and low-power hardware for speech enhancement

Achieving efficient on-device noise suppression under strict power limits

Innovation

Methods, ideas, or system contributions that make the work stand out.

Wireless hearable platform with speech AI accelerator

Optimized dual-path neural network for speech enhancement

Hardware-software co-design with mixed-precision quantization

🔎 Similar Papers

Artificial Intelligence for Cochlear Implants: Review of Strategies, Challenges, and Perspectives