Scholar

Guan-Ting Lin

Google Scholar ID: gojQWGIAAAAJ

National Taiwan University

Speech ProcessingNature Language ProcessingMachine Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,521

H-index

i10-index

Publications

Co-authors

list available

Contact

Emaildaniel094144@gmail.com CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

11 items

Escaping the Procrustean Bed: Groupwise Orthogonal Connectors for Audio-Language Models

2026

Cited

Rethinking Entropy Minimization in Test-Time Adaptation for Autoregressive Models

2026

Cited

ASPIRin: Action Space Projection for Interactivity-Optimized Reinforcement Learning in Full-Duplex Speech Language Models

2026

Cited

Full-Duplex-Bench-v3: Benchmarking Tool Use for Full-Duplex Voice Agents Under Real-World Disfluency

2026

Cited

AV-EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Omni-modal LLMS with Audio-visual Cues

2025

Cited

Game-Time: Evaluating Temporal Dynamics in Spoken Language Models

2025

Cited

EMO-Reasoning: Benchmarking Emotional Reasoning Capabilities in Spoken Dialogue Systems

2025

Cited

SUTA-LM: Bridging Test-Time Adaptation and Language Model Rescoring for Robust ASR

2025

Cited

Resume (English only)

Academic Achievements

Published 10+ first/co-first author papers at top-tier Speech & NLP conferences (ACL, EMNLP, ICASSP, Interspeech, ASRU, SLT).
Best Paper Award at IEEE SLT 2022.
ICLR 2025 Notable Reviewer.
Multiple papers accepted by ASRU 2025, ACL 2025, EMNLP 2024, etc.
Released Full-Duplex-Bench, the first benchmark for full-duplex spoken dialogue models.
Received IEEE SPS Travel Grant for ICASSP 2024.
Internship work with Amazon Alexa accepted by ICASSP 2023.
Paper with Prof. Nigel Ward won Best Paper Award at IEEE SLT 2022!
ISCA Travel Grant for Interspeech 2022.
Two first-author papers accepted at Interspeech 2022.

Research Experience

2025 Fall: Meta Superintelligence Lab, Research Scientist Intern at the Voice Modeling Team in Menlo Park, USA, working with Naoyuki Kanda on full-duplex speech LLM.
2025 Spring: Google DeepMind, Student Researcher at Gemini Speech team (New York City), collaborating with Kartik Audhkhasi, Soheil Khorram, and Bhuvana Ramabhadran to develop methods enhancing Gemini speech capabilities in low-resource languages.
2024 Summer: Amazon AGI, Applied Scientist Intern at Speech team in Seattle, USA (under Ivan Bulyko’s team), working with Prashanth Gurunath Shivakumar, Yile Gu, and Ankur Gandhe on Align-SLM, the first end-to-end spoken language model with reinforcement learning.
2023 Summer: Amazon Alexa AI, Applied Scientist Intern at Speech Recognition and LM team in Seattle, USA (under Ivan Bulyko’s team), working with Prashanth Gurunath Shivakumar and Andreas Stolcke on a paralinguistics-enhanced LLM.
2022 Summer: Amazon Alexa AI, Applied Scientist Intern in Cambridge, USA (under Chao Wang’s team), working with Chieh-Chi Kao and Qingming Tang on acoustic event classification using neural architecture search.

Education

Ph.D. in Communication Engineering, EECS, National Taiwan University [2021/9 - 2025/12]. Advisor: Prof. Hung-yi Lee. Transferred from M.S. program in Feb. 2023.

Background

Research interests include Speech LLMs, Full-Duplex Interaction, Spoken Language Understanding / Generation, and Test-Time Adaptation for Automatic Speech Recognition. Currently a Ph.D. student at the Speech Processing and Machine Learning Lab, National Taiwan University, under the guidance of Prof. Hung-yi Lee.

Miscellany