- VoLTA: Vision-Language Transformer with Weakly-Supervised Local-Feature Alignment
- Masked Siamese ConvNets
- TiCo: Transformation Invariance and Covariance Contrast for Self-Supervised Visual Representation Learning
Research Experience
Currently a fifth-year PhD candidate in Computer Science at NYU Courant. Visiting Researcher at FAIR, Meta, hosted by Zhuang Liu and Koustuv Sinha.
Education
PhD, Computer Science, New York University, 2020 - Now; Advisor: Yann LeCun
MSc, Computer Science, New York University, 2018 - 2020
BSc, Computer Science, The Hong Kong Polytechnic University, 2010 - 2015
Background
Research interests: self-supervised learning for images and videos, as well as pretraining vision encoders for vision-language models (VLMs). Also interested in understanding the design of all kinds of neural network architectures.