Published multiple papers on topics such as embodied cognition evaluation, spatial mental modeling, and multi-turn visual state reasoning, and has received awards including the Best Paper Award at ICCV 2025 (SP4V Workshop).
Research Experience
Collaborates closely with the Stanford Vision and Learning Lab (SVL), working with Prof. Li Fei-Fei and Prof. Jiajun Wu on spatial intelligence and embodied agents.
Education
Currently a second-year Ph.D. student in Computer Science at Northwestern University, advised by Prof. Manling Li; received bachelor's degree from Zhejiang University.
Background
CS Ph.D. student, with research interests in the application of foundation models in embodied agents, spatial intelligence, and decision-making.
Miscellany
Looking for 2026 summer internships focused on foundation models (MLLMs) for embodied agents.