2025: 'Hybrid-Level Instruction Injection for Video Token Compression in Multi-modal Large Language Models' published in CVPR 2025.
2025: 'Denoised and Dynamic Alignment Enhancement for Zero-Shot Learning' published in IEEE Transactions on Image Processing (TIP 2025).
2025: 'UFO: A Unified Approach to Fine-grained Visual Perception via Open-ended Language Interface' published in NeurIPS 2025 (Spotlight).
2025: 'What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Coverage of MLLMs' published in NeurIPS 2025.
2025: 'UniLiP: Adapting CLIP for Unified Multimodal Understanding, Generation and Editing' published in NeurIPS 2025.
2024: 'FuseTeacher: Modality-Fused Encoders are Strong Vision Supervisors' published in ECCV 2024.
2024: 'AlignZeg: Mitigating Objective Misalignment for Zero-shot Semantic Segmentation' published in ECCV 2024.
2024: 'Towards balanced alignment: Modal-enhanced semantic modeling for video moment retrieval' published in AAAI 2024.
2023: 'MomentDiff: Generative Video Moment Retrieval from Random to Real' published in NeurIPS 2023.
2023: 'Progressive Spatio-Temporal Prototype Matching for Text-Video Retrieval' published in ICCV 2023 (Oral Presentation, 2%).
2023: 'Balanced Classification: A Unified Framework for Long-Tailed Object Detection' published in IEEE Transactions on Multimedia (TMM 2023).
2022: 'Dual-Stream Knowledge-Preserving Hashing for Unsupervised Video Retrieval' published in ECCV 2022.
2022: 'Neighborhood-Adaptive Structure Augmented Metric Learning' published in AAAI 2022 (Oral Presentation, 4.5%).
2022: 'Deep Fourier Ranking Quantization for Semi-supervised Image Retrieval' published in IEEE Transactions on Image Processing (TIP 2022).
2022: 'Online Residual Quantization Via Streaming Data Correlation Preserving' published in IEEE Transactions on Multimedia (TMM 2022).
2022: 'Neighborhood-Adaptive Multi-cluster Ranking for Deep Metric Learning' published in AAAI 2022.
Research Experience
Currently a researcher at Alibaba TongYi Lab.
Education
Received Ph.D. degree from the University of Science and Technology of China (USTC) in 2024.
Background
Researcher. Research interests include general video understanding and generation.