Scholar

Zhehuai Chen

Google Scholar ID: AZrMB-AAAAAJ

NVIDIA

Speech RecognitionSpeech SynthesisLLM

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

2,224

H-index

22

i10-index

49

Publications

20

Co-authors

35

list available

Contact

No contact links provided.

Publications

15 items

Full-Duplex-Bench-v3: Benchmarking Tool Use for Full-Duplex Voice Agents Under Real-World Disfluency

2026

Cited

0

How Auditory Knowledge in LLM Backbones Shapes Audio Language Models: A Holistic Evaluation

2026

Cited

0

Speech-Hands: A Self-Reflection Voice Agentic Approach to Speech Recognition and Audio Reasoning with Omni Perception

2026

Cited

0

SAKE: Towards Editing Auditory Attribute Knowledge of Large Audio-Language Models

2025

Cited

0

Investigating Safety Vulnerabilities of Large Audio-Language Models Under Speaker Emotional Variations

2025

Cited

0

SpeechIQ: Speech Intelligence Quotient Across Cognitive Levels in Voice Understanding Large Language Models

2025

Cited

0

DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment

2025

Cited

0

Word Level Timestamp Generation for Automatic Speech Recognition and Translation

2025

Cited

0

Resume (English only)

Co-authors

35 total

Bhuvana Ramabhadran

Director/Principal Research Scientist, Google DeepMind

Andrew Rosenberg

Google DeepMind

Kai Yu（俞凯）

Shanghai Jiao Tong University

Head of Research, ByteDance Seed