Publications: Latent Visual Reasoning, QLIP: A Dynamic Quadtree Vision Prior Enhances MLLM Performance Without Retraining, BLINK: Multimodal Large Language Models Can See but Not Perceive, etc.; Awards: Best Demo Paper for COVID-19 Literature Knowledge Graph Construction and Drug Repurposing Report Generation at NAACL 2021.
Research Experience
Fall 2025: Student Researcher at Google, Advisors: Chen Qu, Jianmo Ni; Summer 2025: Research Intern at AMD GenAI, Advisors: Ximeng Sun, Zicheng Liu.
Education
Bachelor's Degree: University of Illinois at Urbana-Champaign, Advisor: Dr. Jiawei Han; Doctoral Degree: UC Davis, Advisor: Dr. Muhao Chen.
Background
Research Interests: Multimodal Large Language Models (MLLMs), with a focus on understanding and reasoning across text and vision; Professional Field: Computer Science; Brief Introduction: PhD candidate in Computer Science at UC Davis, previously at the University of Southern California.
Miscellany
Actively seeking full-time opportunities; Expected to graduate in Summer 2026.