Handong Li
Scholar

Handong Li

Google Scholar ID: -LnWwgIAAAAJ
Institute of Automation, Chinese Academy of Sciences
cross-modality pretraining
Citations & Impact
All-time
Citations
279
 
H-index
3
 
i10-index
3
 
Publications
6
 
Co-authors
10
list available
Resume (English only)
Academic Achievements
  • "Breaking the Encoder Barrier for Seamless Video-Language Understanding", CoRR 2025
  • "Scaling Omni-modal Pretraining with Multimodal Context", ICLR 2025 (Withdrawn)
  • "Explore the Limits of Omni-modal Pretraining at Scale", CoRR 2024
  • "COSA: Concatenated Sample Pretrained Vision-Language Foundation Model", ICLR 2024
  • "VAST: A Vision-Audio-Subtitle-Text Omni-Modality Foundation Model and Dataset", NeurIPS 2023
  • "Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner", ACM Multimedia 2023