Scholar
Chao Weng
Google Scholar ID: pRA19-8AAAAJ
Anuttacon
Audio LLMs
Multimodal LLMs
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
4,190
H-index
31
i10-index
58
Publications
20
Co-authors
9
list available
Contact
No contact links provided.
Publications
4 items
Speech-DRAME: A Framework for Human-Aligned Benchmarks in Speech Role-Play
2025
Cited
0
LSZone: A Lightweight Spatial Information Modeling Architecture for Real-time In-car Multi-zone Speech Separation
2025
Cited
0
VisuRiddles: Fine-grained Perception is a Primary Bottleneck for Multimodal Large Language Models in Abstract Visual Reasoning
2025
Cited
0
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
2025
Cited
0
Resume (English only)
Co-authors
9 total
Dong Yu (俞栋)
Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA Fellow
Shinji Watanabe
Carnegie Mellon University
Co-author 3
Mike Seltzer
Facebook
Daniel Povey
Chief Speech Scientist, Xiaomi Corp.
Co-author 6
Jinyu Li
Partner Applied Science Manager, Microsoft
Co-author 8
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up