Has authored several publications in top-tier conferences and journals, including CVPR, AAAI, ACL, ACM-MM, IEEE TIP, etc. Also serves as a reviewer for esteemed conferences and journals like NIPS, ECCV, AAAI, IEEE TIP, and IEEE TNNLS.
Research Experience
2024.11 - Present, Research Intern @ CFAR, A*STAR, focusing on Multimodal Agents for Image Generation; 2024.03 - 2024.10, Research Intern @ Shanghai AI LAB, focusing on Multimodal Agents for OS (Operating System); 2022.12 - 2024.03, Research Intern @ SGIT AI Lab, State Grid Corporation of China, focusing on Controllable Image Generation.
Ph.D. candidate in Computer Science at Xi'an Jiaotong University, supervised by Prof. Minnan Luo. Research interests include controllable image generation, multimodal learning, and object detection & identification.
Miscellany
Currently on the job market for Fall 2025. Please feel free to reach out!