Scholar
Yudong Yang
Google Scholar ID: NOQDMrAAAAAJ
Tsinghua University
Multimodal LLM
Speech Processing
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
38
H-index
4
i10-index
1
Publications
7
Co-authors
6
list available
Contact
No contact links provided.
Publications
12 items
Learning to Attend to Depression-Related Patterns: An Adaptive Cross-Modal Gating Network for Depression Detection
2026
Cited
0
From Speech to Profile: A Protocol-Driven LLM Agent for Psychological Profile Generation
2026
Cited
0
SPX-VIX Risk Computations Via Perturbed Optimal Transport
2026
Cited
0
Speech-Audio Compositional Attacks on Multimodal LLMs and Their Mitigation with SALMONN-Guard
2025
Cited
0
video-SALMONN S: Streaming Audio-Visual LLMs Beyond Length Limits via Memory
2025
Cited
0
UTI-LLM: A Personalized Articulatory-Speech Therapy Assistance System Based on Multimodal Large Language Model
2025
Cited
0
video-SALMONN 2: Captioning-Enhanced Audio-Visual Large Language Models
2025
Cited
0
ACVUBench: Audio-Centric Video Understanding Benchmark
2025
Cited
0
Load more
Resume (English only)
Co-authors
6 total
Guangzhi Sun
University of Cambridge
Changli Tang
Tsinghua University
Yixuan Li
Department of Electronic Engineering, Tsinghua University
Siyin Wang
Tsinghua University
Qiuqiang Kong
The Chinese University of Hong Kong
Zhan Liu
Tsinghua University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up