Haiwen Diao
Scholar

Haiwen Diao

Google Scholar ID: 46eCjHQAAAAJ
Nanyang Technological University
Computer VisionVision-and-LanguageTransfer LearningMultimodal LLM
Citations & Impact
All-time
Citations
763
 
H-index
10
 
i10-index
11
 
Publications
17
 
Co-authors
19
list available
Resume (English only)
Academic Achievements
  • Published works in vision-language retrieval: SGRAF (AAAI'21), RCAR (TIP'23), DBL (TIP'24), GSSF (TIP'24).
  • Published works in efficient transfer learning: UniPT (CVPR'24), SHERL (ECCV'24), ReSoRA (ACMMM'25).
  • Published works in multi-modality perception: EVE (NeurIPS'24), EVEv2 (ICCV'25), NEO (2025), DenseFusion (NeurIPS'24), Infinity-MM (2024), Visual Jigsaw (2025).
  • Published works in multi-modality generation: NOVA (ICLR'25), MoTrans (ACMMM'24).
  • Proposed ETT (NeurIPS'25) for multi-modality unification.
  • Multiple papers accepted or under review at top venues including NeurIPS, ICLR, ICCV, CVPR, ECCV, ACMMM, AAAI, and TIP.
  • Maintains open-source resource lists: Awesome_Matching_Pretraining_Transfering and Awesome_Image_Text_Retrieval_Benchmark.