Scholar
Han Yin
Google Scholar ID: K3pFKkIAAAAJ
Tongyi Speech Lab, Alibaba Group
Audio Understanding
Multimodal LLM
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
81
H-index
5
i10-index
2
Publications
20
Co-authors
13
list available
Contact
No contact links provided.
Publications
17 items
The First Environmental Sound Deepfake Detection Challenge: Benchmarking Robustness, Evaluation, and Insights
2026
Cited
0
Focus Then Listen: Exploring Plug-and-Play Audio Enhancer for Noise-Robust Large Audio Language Models
2026
Cited
0
PolyBench: A Benchmark for Compositional Reasoning in Polyphonic Audio
2026
Cited
0
ESDD2: Environment-Aware Speech and Sound Deepfake Detection Challenge Evaluation Plan
2026
Cited
0
MoEScore: Mixture-of-Experts-Based Text-Audio Relevance Score Prediction for Text-to-Audio System Evaluation
arXiv.org · 2026
Cited
0
Can Large Audio Language Models Understand Audio Well? Speech, Scene and Events Understanding Benchmark for LALMs
2025
Cited
0
Dynamic Fusion Multimodal Network for SpeechWellness Detection
2025
Cited
0
ASCMamba: Multimodal Time-Frequency Mamba for Acoustic Scene Classification
2025
Cited
0
Load more
Resume (English only)
Co-authors
13 total
Jisheng Bai
School of Marine Science and Technology, Northwestern Polytechnical University
Mou Wang
Institute of Acoustics, Chinese Academy of Sciences
Yang Xiao
The University of Melbourne
Rohan Kumar Das
Fortemedia Singapore
Woon-Seng Gan
Professor of Audio Engineering and Director of Smart Nation Lab @ Nanyang Technological University,
Dongyuan Shi
Research Assistant Professor, Nanyang Technological University
Susanto Rahardja, FIEEE, FSEng, FAIIA, FAAIA
Zhejiang University
Chong Deng
alibaba group
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up