- Publications: Two papers accepted at ICML 2025, 'UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction' and 'LIVS: A Pluralistic Alignment Dataset for Inclusive Public Spaces'; one paper accepted at ICLR 2025, 'BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks'.
- Awards: Received the Mila EDI in research scholarship of 8000 CAD for research on cultural understanding in AI systems; received the ESP AI Scholarship of 5000 CAD from the University of Montreal.
Research Experience
- Current Work: Working at ServiceNow Research, focusing on multimodal desktop agents, building the UI-Vision dataset, and developing foundational visual grounding agents for graphical user interfaces.
- Past Experience: Spent two years at Microsoft working on the PowerPoint mobile team; worked on reinforcement learning for energy management in microgrid networks at IISc; did dubbing work at LT Lab in Hamburg; and conducted machine translation for low-resource languages at the University of Toronto.
Education
- Degree: Research Master's (soon PhD)
- University: Mila and University of Montreal
- Advisor: Prof. Aishwarya Agrawal
- Time: Current
- Major: Computer Science
Background
- Research Interests: Cultural understanding in AI systems, including recognizing, respecting, and adapting to different cultural perspectives.
- Field: Computer Science
- Introduction: Committed to creating inclusive, safe, and fair AI and addressing broader challenges such as pluralistic alignment.
Miscellany
- Personal Interests: Cooking, history, culture, and archaeology.
- Blog: Shares cooking experiences and personal insights.
- Reading: Currently reading 'Early Indians' by Tony Joseph and 'The Golden Road' by William Dalrymple.