TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation (ACM MM, 2025)
Research Experience
Worked in the video game industry at Virtuos. Current research focuses on video analysis, controllable video/image generation, and non-rigid object manipulation in robotics.
Education
PhD from the College of Computer Science at Zhejiang University, supervised by Prof. Jianke Zhu. Visited the University of California, San Diego in 2018 and worked with Prof. Michael Yip in the ECE department. Worked as a part-time researcher at Alibaba-Zhejiang University Joint Institute of Frontier Technologies during his PhD.
Background
Associate Professor at the School of Computer Science and Technology, East China Normal University. Leading the Visual Perception + X group, focusing on the intersection of computer vision, computer graphics, and robotics.