Scholar
Zhehuai Chen
Google Scholar ID: AZrMB-AAAAAJ
NVIDIA
Speech Recognition
Speech Synthesis
LLM
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
2,224
H-index
22
i10-index
49
Publications
20
Co-authors
35
list available
Contact
No contact links provided.
Publications
15 items
Full-Duplex-Bench-v3: Benchmarking Tool Use for Full-Duplex Voice Agents Under Real-World Disfluency
2026
Cited
0
How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation
2026
Cited
0
Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception
2026
Cited
0
SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models
2025
Cited
0
Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations
2025
Cited
0
SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding Large Language Models
2025
Cited
0
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment
2025
Cited
0
Word Level Timestamp Generation for Automatic Speech Recognition and Translation
2025
Cited
0
Load more
Resume (English only)
Co-authors
35 total
Bhuvana Ramabhadran
Director/Principal Research Scientist, Google DeepMind
Andrew Rosenberg
Google DeepMind
Co-author 3
Boris Ginsburg
NVIDIA
Gary Wang
Google
Co-author 6
Kai Yu(俞凯)
Shanghai Jiao Tong University
Yonghui Wu
Head of Research, ByteDance Seed
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up