Scholar
Yong Man Ro
Google Scholar ID: IPzfF7cAAAAJ
Professor of Electrical Engineering, KAIST, ICT Endowed Chair Professor
Multimodal learning
Vision Language integration
Image processing and Computer vision
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
6,408
H-index
38
i10-index
153
Publications
20
Co-authors
31
list available
Contact
No contact links provided.
Publications
10 items
STRIDE: When to Speak Meets Sequence Denoising for Streaming Video Understanding
2026
Cited
0
Recursive Think-Answer Process for LLMs and VLMs
2026
Cited
0
GCAgent: Long-Video Understanding via Schematic and Narrative Episodic Memory
2025
Cited
0
Emotion-Coherent Reasoning for Multimodal LLMs via Emotional Rationale Verifier
2025
Cited
0
Unified Reinforcement and Imitation Learning for Vision-Language Models
2025
Cited
0
Towards Inclusive Communication: A Unified LLM-Based Framework for Sign Language, Lip Movements, and Audio Understanding
2025
Cited
0
Remote Sensing Large Vision-Language Model: Semantic-augmented Multi-level Alignment and Semantic-aware Expert Modeling
2025
Cited
0
Language-guided Learning for Object Detection Tackling Multiple Variations in Aerial Images
2025
Cited
0
Load more
Resume (English only)
Co-authors
31 total
Co-author 1
Konstantinos N Plataniotis
Professor, ECE Department, University of Toronto
Co-author 3
Wesley De Neve
Associate Professor at Ghent University (Belgium) & Ghent University Global Campus (Korea)
Truong, Cong Thang
The University of Aizu
Co-author 6
Minsu Kim
Google DeepMind
Seong Tae Kim
Assistant Professor of Computer Science, Kyung Hee University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up