- Paper 'Dual Caption Preference Optimization for Diffusion Models' accepted to Transactions on Machine Learning Research (TMLR), 2025
- Paper 'Evaluating Multimodal Large Language Models Across Distribution Shifts and Augmentations' presented at CVPR EvGenFM Workshop, 2024
- Paper 'Making the V in Text-VQA Matter' presented at CVPR O-DRUM Workshop, 2023
- Paper 'Weakly Supervised Visual Question Answer Generation' presented at CVPR O-DRUM Workshop, 2023
- Preprint 'ChartQA-X: Generating Explanations for Visual Chart Reasoning' released, June 2025
- Nominated as a reviewer for NeurIPS 2025, ACM MM 2025, ACL 2025, RBFM Workshop NeurIPS 2024
Research Experience
- Joined Bosch Global Software Technologies as a Software Development Intern, January 2023
- Involved in various research projects, including projects in the VQA domain
Education
- Master's in Computer Science at Arizona State University, advised by Yezhou Yang, started August 2023
- Bachelor's in Computer Science Engineering from KLE Technological University, advised by Shankar Gangisetty, graduated July 2023
Background
Research interests include computer vision, machine learning, generative AI, and natural language processing. Focused on combining information from different sources (like text, images, and video) to help machines develop more robust, reliable, and safe commonsense reasoning.
Miscellany
Thank you to Jon Barron for the source code for the website!