Yalong Bai (白亚龙)
Scholar

Yalong Bai (白亚龙)

Google Scholar ID: iYMBoHwAAAAJ
Canva AI
Representation LearningMultimediaComputer Vision
Citations & Impact
All-time
Citations
1,655
 
H-index
16
 
i10-index
18
 
Publications
20
 
Co-authors
8
list available
Publications
20 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • Top 3% Paper Award at ICASSP 2023
  • First Prize, Science and Technology Progress Award, China Society of Image and Graphics, 2022
  • Third Place in Open World Image Classification Challenge at CVPR 2021
  • First Place in AliProducts Challenge: Large-scale Product Recognition at CVPR 2020
  • Second Place in iMet: Fine-grained Attributes Recognition Challenge at CVPR 2020
  • First Place in iMaterialist Challenge on Product Recognition at CVPR 2019
  • First Place in Fieldguide Challenge: Moths and Butterflies at CVPR 2019
  • Second Place in iFood Challenge at FGVC workshop, CVPR 2019
  • Rank 1st in the track of without using extra data and 2nd in all teams at MSR Image Recognition Challenge at IEEE ICME 2016
  • ACM Multimedia 2015 Student Travel Grant
  • First Place in MSR-Bing Image Retreival Challenge at ACM MM 2014
Research Experience
  • Canva, CORE CN (2025.09 -- Now): Staff Research Scientist, working on multi-layer image generation, design editing, etc.
  • Du Xiaoman Financial, Multimedia Research Team for In2X (2024.05 -- 2025.09): Research Manager, leading multimodal content generation initiatives, including text-to-image (T2I), image-to-video (I2V), text-to-speech (TTS), any-to-any multimodal LLM and more.
  • JD AI Research, CV Lab (2018.02 -- 2023.06): Senior Researcher, working on snapshop, VQA, fine-grained recognition, relationships modeling in images, 3D imaging, etc.
  • Microsoft Research Asia, Web Search and Mining Group (2013.06 -- 2018.02): Research intern working on deep learning for image representation and computer vision.
  • Microsoft Research Asia, Web Search and Data Mining Group (2012.01 -- 2012.07): Research intern working on document retrieval results re-ranking.
Background
  • Compute Vision Researcher, with research interests in multimodal content generation, image-text multimodal correlation learning, etc.