Scholar
Zhixian Zhao
Google Scholar ID: WuR350wAAAAJ
Northwestern Polytechnical University
Emotion Speech Recognition
Understanding and Generation
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
129
H-index
4
i10-index
3
Publications
11
Co-authors
0
Contact
No contact links provided.
Publications
11 items
Seeing the Context: Rich Visual Context-Aware Speech Recognition via Multimodal Reasoning
2026
Cited
0
EmoOmni: Bridging Emotional Understanding and Expression in Omni-Modal LLMs
2026
Cited
0
Integrating Fine-Grained Audio-Visual Evidence for Robust Multimodal Emotion Reasoning
2026
Cited
0
dLLM-ASR: A Faster Diffusion LLM-based Framework for Speech Recognition
2026
Cited
0
The ICASSP 2026 HumDial Challenge: Benchmarking Human-like Spoken Dialogue Systems in the LLM Era
arXiv.org · 2026
Cited
1
Serial-Parallel Dual-Path Architecture for Speaking Style Recognition
2025
Cited
0
OSUM-EChat: Enhancing End-to-End Empathetic Spoken Chatbot via Understanding-Driven Spoken Dialogue
2025
Cited
0
DualDub: Video-to-Soundtrack Generation via Joint Speech and Background Audio Synthesis
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up