Several papers have been accepted by top conferences and journals such as NeurIPS, Siggraph Asia, CVPR, MIA, IEEE TMI, AAAI, and ICCV. Notable publications include 'Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding' and 'HOComp: Interaction-Aware Human-Object Composition'.
Research Experience
During his Ph.D. studies, he has been involved in multiple research projects, including open-source projects like HunyuanWorld 1.0.
Education
Ph.D. Candidate: City University of Hong Kong, supervised by Prof. Rynson W.H. Lau; Master's Degree: Dalian University of Technology; Bachelor's Degree: ZhengZhou University.
Background
Research interests include computer vision and graphics. Currently a third-year Ph.D. candidate at City University of Hong Kong, supervised by Prof. Rynson W.H. Lau. Collaborated with Dr. Tengfei Wang from Tencent Hunyuan3D, and also worked with Dr. Ke Xu from CityU HK and Prof. Qing Guo from A*STAR.