HuPER: A Human-Inspired Framework for Phonetic Perception

📅 2026-02-02
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This work addresses the challenges of speech perception modeling in low-resource, multilingual, and acoustically complex environments by drawing inspiration from human auditory mechanisms. It proposes an adaptive inference framework that integrates acoustic-phonological modeling with language-knowledge-guided adaptation. The approach achieves state-of-the-art performance with only 100 hours of training data and enables zero-shot transfer to 95 unseen languages. Notably, the model attains the lowest phoneme error rates on five English benchmarks, demonstrating substantial improvements in cross-lingual generalization capabilities.

Technology Category

Application Category

📝 Abstract
We propose HuPER, a human-inspired framework that models phonetic perception as adaptive inference over acoustic-phonetics evidence and linguistic knowledge. With only 100 hours of training data, HuPER achieves state-of-the-art phonetic error rates on five English benchmarks and strong zero-shot transfer to 95 unseen languages. HuPER is also the first framework to enable adaptive, multi-path phonetic perception under diverse acoustic conditions. All training data, models, and code are open-sourced. Code and demo avaliable at https://github.com/HuPER29/HuPER.
Problem

Research questions and friction points this paper is trying to address.

phonetic perception
zero-shot transfer
acoustic-phonetics
linguistic knowledge
adaptive inference
Innovation

Methods, ideas, or system contributions that make the work stand out.

human-inspired
adaptive inference
zero-shot transfer
phonetic perception
multi-path perception
🔎 Similar Papers
No similar papers found.
C
Chenxu Guo
Zhejiang University, China
Jiachen Lian
Jiachen Lian
UC Berkeley
precision healthcarespeech processingmachine learning
Y
Yisi Liu
University of California, Berkeley, USA
Baihe Huang
Baihe Huang
University of California, Berkeley
machine learning
S
Shriyaa Narayanan
University of California, Berkeley, USA
Cheol Jun Cho
Cheol Jun Cho
UC Berkeley, EECS
AIMachine LearningSpeech ProcessingNeuroscienceBrain-Computer Interfaces
G
Gopala Anumanchipalli
University of California, Berkeley, USA