Xin Lai
Scholar

Xin Lai

Google Scholar ID: tqNDPA4AAAAJ
ByteDance
Multimodal UnderstandingMultimodal Agent
Citations & Impact
All-time
Citations
3,085
 
H-index
14
 
i10-index
14
 
Publications
16
 
Co-authors
15
list available
Resume (English only)
Academic Achievements
  • Published several papers, including Mini-o3, Step-DPO, and LISA. Some of these works have been accepted by top conferences such as CVPR and ICCV, and have gained significant attention on GitHub.
Research Experience
  • Working as a researcher at TikTok, previously involved in multiple research projects during his doctoral studies.
Education
  • Obtained a Ph.D. degree in 2024 from the Chinese University of Hong Kong (CUHK), supervised by Prof. Jiaya Jia and Prof. Liwei Wang; received a Bachelor's Degree from Harbin Institute of Technology (HIT) in 2020.
Background
  • Currently a researcher at TikTok, focusing on Large Multimodal Models (LMMs). Research interests include multimodal understanding and multimodal agents.
Miscellany
  • Actively hiring research interns, interested candidates are welcome to contact him via email with their resume.