- 1 paper on audio-visual speech denoising accepted to IEEE TMM
- 1 paper accepted to CVPR 2025
- Patent on audio-visual speech denoising accepted
- 1 paper accepted to CVIU (Journal) 2022
- 1 paper accepted to WACV 2022 and CVPRW 2021
- 1 paper accepted to CVPR 2021
- 1 paper accepted to WACV 2021
- 1 paper accepted to WACV 2020
- 1 paper accepted to NCC 2017
- Outstanding Reviewer @ ICML 2022
Research Experience
- Chief Engineer at Samsung R&D Institute India-Bangalore (SRI-B)
- Postdoctoral fellow at the University of Bristol, working with Prof. Dima Damen as part of the EPSRC VisualAI Grant
- Research Scientist at TensorTour Inc.
Education
- Ph.D. in Computer Science and Engineering from Indian Institute of Technology Kanpur, advised by Dr. Gaurav Sharma and Prof. Manindra Agarwal
- Master’s in Medical Imaging and Informatics from Indian Institute of Technology Kharagpur, advised by Dr. Rajiv Ranjan Sahay and Prof. Pranab Kumar Dutta
Background
Research interests: Intersection of computer vision and machine learning, with a particular emphasis on multimodal perception. Currently, his work focuses on audio generation conditioned on multimodal inputs, aiming to enhance cross-modal synthesis and representation learning.