Papers accepted to NeurIPS 2025, IJCAI 2025, TMM, and more; awarded as a top reviewer in NeurIPS 2024; released a curated multimodal reasoning MLLMs collection repository; several preprint papers under review.
Research Experience
Started a research internship at Spotify in June 2024; involved in multiple research projects such as URPA, INT, etc.; gave talks at BMVA workshop and OpenCompass.
Education
Before joining the Computer Vision Group at QMUL, he was at Shanghai Jiao Tong University, supervised by Prof. Hongya Tuo and working closely with Prof. Junchi Yan; currently studying at Queen Mary, University of London, supervised by Prof. Shaogang Gong.
Background
Currently a final year PhD student in the Computer Vision Group at the School of Electronic Engineering and Computer Science, Queen Mary, University of London, focusing on deep learning and computer vision, with an emphasis on test-time training and adaptation of Multi-modal Large Language Models (MLLMs) and their applications.