Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
Top 3% Paper Award at ICASSP 2023
First Prize, Science and Technology Progress Award, China Society of Image and Graphics, 2022
Third Place in Open World Image Classification Challenge at CVPR 2021
First Place in AliProducts Challenge: Large-scale Product Recognition at CVPR 2020
Second Place in iMet: Fine-grained Attributes Recognition Challenge at CVPR 2020
First Place in iMaterialist Challenge on Product Recognition at CVPR 2019
First Place in Fieldguide Challenge: Moths and Butterflies at CVPR 2019
Second Place in iFood Challenge at FGVC workshop, CVPR 2019
Rank 1st in the track of without using extra data and 2nd in all teams at MSR Image Recognition Challenge at IEEE ICME 2016
ACM Multimedia 2015 Student Travel Grant
First Place in MSR-Bing Image Retreival Challenge at ACM MM 2014
Research Experience
Canva, CORE CN (2025.09 -- Now): Staff Research Scientist, working on multi-layer image generation, design editing, etc.
Du Xiaoman Financial, Multimedia Research Team for In2X (2024.05 -- 2025.09): Research Manager, leading multimodal content generation initiatives, including text-to-image (T2I), image-to-video (I2V), text-to-speech (TTS), any-to-any multimodal LLM and more.
JD AI Research, CV Lab (2018.02 -- 2023.06): Senior Researcher, working on snapshop, VQA, fine-grained recognition, relationships modeling in images, 3D imaging, etc.
Microsoft Research Asia, Web Search and Mining Group (2013.06 -- 2018.02): Research intern working on deep learning for image representation and computer vision.
Microsoft Research Asia, Web Search and Data Mining Group (2012.01 -- 2012.07): Research intern working on document retrieval results re-ranking.
Background
Compute Vision Researcher, with research interests in multimodal content generation, image-text multimodal correlation learning, etc.