Currently a researcher at ByteDance Seed Team, working on cutting-edge large multimodal models and world models.
Education
Ph.D.: HUST Vision Lab, Huazhong University of Science and Technology
Background
Research Interests: Enabling machines/robots to comprehend world knowledge and interact with environments like human beings. Specialization: Large multimodal models and foundational visual-language models.