Scholar
Yiwen Shao
Google Scholar ID: CX2Eo2MAAAAJ
Johns Hopkins University
speech recognition
machine learning
deep learning
Natural Language Processing
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
358
H-index
8
i10-index
7
Publications
19
Co-authors
10
list available
Contact
No contact links provided.
Publications
11 items
Unlocking Strong Supervision: A Data-Centric Study of General-Purpose Audio Pre-Training Methods
2026
Cited
0
JAEGER: Joint 3D Audio-Visual Grounding and Reasoning in Simulated Physical Environments
2026
Cited
0
Towards Comprehensive Semantic Speech Embeddings for Chinese Dialects
2026
Cited
0
TagSpeech: End-to-End Multi-Speaker ASR and Diarization with Fine-Grained Temporal Grounding
2026
Cited
1
Revisiting Audio-language Pretraining for Learning General-purpose Audio Representation
2025
Cited
0
Auden-Voice: General-Purpose Voice Encoder for Speech and Language Understanding
2025
Cited
0
Taming the Chaos: Coordinated Autoscaling for Heterogeneous and Disaggregated LLM Inference
2025
Cited
0
DualSpeechLM: Towards Unified Speech Understanding and Generation via Dual Speech Token Modeling with Large Language Models
2025
Cited
0
Load more
Resume (English only)
Co-authors
10 total
Sanjeev Khudanpur
The Johns Hopkins University
Dong Yu (俞栋)
Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA Fellow
Daniel Povey
Chief Speech Scientist, Xiaomi Corp.
Shinji Watanabe
Carnegie Mellon University
Co-author 5
Yiming Wang
Microsoft
Sonal Joshi
Johns Hopkins University
Jesús Villalba
Johns Hopkins University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up