2024: 'Fairness Incentives in Response to Unfair Dynamic Pricing', presented at ESIF Economics and AI+ML Meeting
2023: 'Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning', presented at CoLLAs 2023 and ICLR 2022 Workshop
2023: 'DEUP: Direct Epistemic Uncertainty Prediction', published in Transactions on Machine Learning Research (TMLR), co-authored with Yoshua Bengio
2021: 'Continuous Coordination As a Realistic Scenario for Lifelong Learning', presented at ICML 2021 (NERL Workshop Spotlight) and ICLR 2021; proposed the Lifelong Hanabi benchmark
2020: 'The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning', published at NeurIPS; introduced the LoCA regret metric
Contributed to 'Algorithmic Analysis and Improvements in Multi-Agent Reinforcement Learning in Partially Observable Setting', introducing Multi Strategy LOLA
Background
Final-year PhD candidate at Mila and the University of Montreal, supervised by Sarath Chandar
Research focuses on studying fundamental principles of human intelligence by building agents that adapt to new situations and collaborate with diverse partners, aiming to develop cooperative intelligence
Specializes in Cooperative Multi-agent, Offline, and Model-based Reinforcement Learning, and Lifelong Learning
Has closely collaborated with Yoshua Bengio and Aditya Mahajan