Muhammad Uzair Khattak
Scholar

Muhammad Uzair Khattak

Google Scholar ID: M6fFL4gAAAAJ
EPFL
Computer VisionMulti-modal LearningVideo Processing
Citations & Impact
All-time
Citations
2,125
 
H-index
9
 
i10-index
9
 
Publications
17
 
Co-authors
17
list available
Resume (English only)
Background
  • Research interests include adapting foundational multi-modal models for vision tasks such as image recognition, object detection, and video action recognition. The goal is to steer these foundational models for downstream tasks with limited data (few-/zero-shot) while maintaining their pre-trained generalization for novel tasks.
Miscellany
  • Invited talks on multi-modal learning at Amazon Prime Video and Cohere For AI on the ProText work.