Published 10+ first/co-first author papers at top-tier Speech & NLP conferences (ACL, EMNLP, ICASSP, Interspeech, ASRU, SLT).
Best Paper Award at IEEE SLT 2022.
ICLR 2025 Notable Reviewer.
Multiple papers accepted by ASRU 2025, ACL 2025, EMNLP 2024, etc.
Released Full-Duplex-Bench, the first benchmark for full-duplex spoken dialogue models.
Received IEEE SPS Travel Grant for ICASSP 2024.
Internship work with Amazon Alexa accepted by ICASSP 2023.
Paper with Prof. Nigel Ward won Best Paper Award at IEEE SLT 2022!
ISCA Travel Grant for Interspeech 2022.
Two first-author papers accepted at Interspeech 2022.
Research Experience
2025 Fall: Meta Superintelligence Lab, Research Scientist Intern at the Voice Modeling Team in Menlo Park, USA, working with Naoyuki Kanda on full-duplex speech LLM.
2025 Spring: Google DeepMind, Student Researcher at Gemini Speech team (New York City), collaborating with Kartik Audhkhasi, Soheil Khorram, and Bhuvana Ramabhadran to develop methods enhancing Gemini speech capabilities in low-resource languages.
2024 Summer: Amazon AGI, Applied Scientist Intern at Speech team in Seattle, USA (under Ivan Bulyko’s team), working with Prashanth Gurunath Shivakumar, Yile Gu, and Ankur Gandhe on Align-SLM, the first end-to-end spoken language model with reinforcement learning.
2023 Summer: Amazon Alexa AI, Applied Scientist Intern at Speech Recognition and LM team in Seattle, USA (under Ivan Bulyko’s team), working with Prashanth Gurunath Shivakumar and Andreas Stolcke on a paralinguistics-enhanced LLM.
2022 Summer: Amazon Alexa AI, Applied Scientist Intern in Cambridge, USA (under Chao Wang’s team), working with Chieh-Chi Kao and Qingming Tang on acoustic event classification using neural architecture search.
Education
Ph.D. in Communication Engineering, EECS, National Taiwan University [2021/9 - 2025/12]. Advisor: Prof. Hung-yi Lee. Transferred from M.S. program in Feb. 2023.
Background
Research interests include Speech LLMs, Full-Duplex Interaction, Spoken Language Understanding / Generation, and Test-Time Adaptation for Automatic Speech Recognition. Currently a Ph.D. student at the Speech Processing and Machine Learning Lab, National Taiwan University, under the guidance of Prof. Hung-yi Lee.
Miscellany
Beyond academia, he enjoys singing, photography, and watching MLB games.