Published works in vision-language retrieval: SGRAF (AAAI'21), RCAR (TIP'23), DBL (TIP'24), GSSF (TIP'24).
Published works in efficient transfer learning: UniPT (CVPR'24), SHERL (ECCV'24), ReSoRA (ACMMM'25).
Published works in multi-modality perception: EVE (NeurIPS'24), EVEv2 (ICCV'25), NEO (2025), DenseFusion (NeurIPS'24), Infinity-MM (2024), Visual Jigsaw (2025).
Published works in multi-modality generation: NOVA (ICLR'25), MoTrans (ACMMM'24).
Proposed ETT (NeurIPS'25) for multi-modality unification.
Multiple papers accepted or under review at top venues including NeurIPS, ICLR, ICCV, CVPR, ECCV, ACMMM, AAAI, and TIP.
Maintains open-source resource lists: Awesome_Matching_Pretraining_Transfering and Awesome_Image_Text_Retrieval_Benchmark.