- Ph.D. research: Working with Prof. Yan Yan in Computer Science at the University of Illinois Chicago
- Internships: Adobe, SonyAI, Tencent, SenseTime
- Visiting Scholar: University of Central Florida, working with Prof. Mubarak Shah
- Teaching Assistant: CS 577: Deep Learning at Illinois Institute of Technology
Education
- Degree: Ph.D. candidate
- University: University of Illinois Chicago
- Advisor: Prof. Yan Yan
- Expected Graduation: 2027
- Major: Computer Science
- Bachelor's Degree: Mathematics from Sun Yat-sen University
- Graduation Year: 2022
- Honors: Outstanding Student Scholarship each year
Background
- Research Interests: Multimodal fine-grained understanding across image, GUI, 3D, and video domains. Focuses on building multimodal large language models (e.g., Robin3D) with optimal paradigm design (e.g., ExpVG) and training strategies (e.g., GuirlVG). Explores how to scale higher-quality data, propose stronger supervision signals (e.g., AttBalance, SegVG), and establish better benchmarks (e.g., Intent3D). Also works on improving overall system efficiency (e.g., ACTRESS, 3DResT, INTP-Video-LLM), empowering AI agents (e.g., InfantAgent-Next), and making their decision-making mechanisms more interpretable (e.g., SaCo, TokenTM).