Collaborative Static and Dynamic Vision-Language Streams for Spatio-Temporal Video Grounding, CVPR 2023
Hierarchical Semantic Correspondence Networks for Video Paragraph Grounding, CVPR 2023
Augmented 2D-TAN: A Two-stage Approach for Human-centric Spatio-Temporal Video Grounding, CVPR Workshop 2021 (The 3rd Person in Context (PIC) Challenge)
Predictive Feature Learning for Future Segmentation Prediction, ICCV 2021
Action-guided 3D Human Motion Prediction, NeurIPS 2021
APANet: Auto-Path Aggregation for Future Instance Segmentation Prediction, TPAMI 2021
Interactive Video Object Segmentation via Spatio-temporal Context Aggregation and Online Learning, CVPR Workshop 2019 (The 2019 DAVIS Challenge on Video Object Segmentation)
Predicting Future Instance Segmentation with Contextual Pyramid ConvLSTMs, ACM MM 2019
Background
Currently a master’s student at Sun Yat-Sen University, with research interests in vision-language cross-modal understanding and video understanding.