In 2024, two papers were accepted by NeurIPS 2024: 'DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation' and 'Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model'. Another paper 'InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning' was accepted by the 4th MATH-AI Workshop at NeurIPS 24. Additionally, a paper 'COCO is “ALL’’ You Need for Visual Instruction Fine-tuning' was accepted by ICME 2024.
Research Experience
Senior Research Scientist at ByteDance Seed; Senior Applied Scientist at Microsoft Azure AI Computer Vision Team.
Education
M.S. from Duke University; B.S. from University of Science and Technology of China (USTC).
Background
Currently a Researcher at OpenAI, focusing on multimodal. Formerly a Senior Research Scientist at ByteDance Seed and a Senior Applied Scientist at Microsoft Azure AI Computer Vision Team. Research interests include computer vision, multimodal, reinforcement learning, and deep learning.