Published multiple papers in top conferences such as NeurIPS, TPAMI, ICCV, CVPR, and AAAI. Notable works include:
- Panoptic Captioning: Seeking An Equivalency Bridge for Image and Text
- ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations
- Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization
- Mitigating the Human-Robot Domain Discrepancy in Visual Pre-training for Robotic Manipulation
- Decoupled Distillation to Erase: A General Unlearning Method for Any Class-centric Tasks
- ParGo: Bridging Vision-Language with Partial and Global Views
- Rethinking CLIP-based Video Learners in Cross-Domain Open-Vocabulary Action Recognition
Research Experience
Currently a post-doctoral research fellow at the University of Hong Kong, under the supervision of Prof. Kai Han.
Education
PhD from Sun Yat-sen University, supervised by Prof. Wei-Shi Zheng; Visiting student at MMLab@NTU, supervised by Prof. Chen Change Loy and Prof. Henghui Ding; Bachelor's and Master's degrees from Sun Yat-Sen University.
Background
Research interests include computer vision and machine learning.