Scholar

Han Yin

Google Scholar ID: K3pFKkIAAAAJ

Tongyi Speech Lab, Alibaba Group

Audio UnderstandingMultimodal LLM

Google Scholar↗

Citations & Impact

All-time

Citations

81

H-index

5

i10-index

2

Publications

20

Co-authors

13

list available

Contact

No contact links provided.

Publications

22 items

AudioDER: A Deduplication-Enhanced Reasoning Dataset for Post-Training Large Audio-Language Models

2026

Cited

0

Overview of ESDD2: Environment-Aware Speech and Sound Deepfake Detection Challenge

2026

Cited

0

Why Can't They Remember? Uncovering Representation and Retrieval Bottlenecks in Multi-Turn Acoustic Memory

2026

Cited

0

ESI-Bench: Towards Embodied Spatial Intelligence that Closes the Perception-Action Loop

2026

Cited

0

Towards Generalist Game Players: An Investigation of Foundation Models in the Game Multiverse

2026

Cited

0

The First Environmental Sound Deepfake Detection Challenge: Benchmarking Robustness, Evaluation, and Insights

2026

Cited

0

Focus Then Listen: Exploring Plug-and-Play Audio Enhancer for Noise-Robust Large Audio Language Models

2026

Cited

0

PolyBench: A Benchmark for Compositional Reasoning in Polyphonic Audio

2026

Cited

0

Resume (English only)

Co-authors

13 total

School of Marine Science and Technology, Northwestern Polytechnical University

Institute of Acoustics, Chinese Academy of Sciences

The University of Melbourne

Rohan Kumar Das

Fortemedia Singapore

Professor of Audio Engineering and Director of Smart Nation Lab @ Nanyang Technological University,

Research Assistant Professor, Nanyang Technological University

Susanto Rahardja, FIEEE, FSEng, FAIIA, FAAIA

Zhejiang University