Published papers such as 'Rubrics as Rewards' (arXiv, 2025), 'QGFN' (NeurIPS 2024), 'Deep Conservative RL for Ventilation' (AAAI 2023); participated in workshops like DGFN at NeurIPS 2023 WS and Replay Buffers for Mode Discovery at ICML 2023 WS; project 'Browser Agents Jailbreaks' accepted to ICLR 2025.
Research Experience
Was a researcher at Valence Labs – Recursion, developing QGFN by combining GFlowNets with action-value guidance for diverse solution discovery in drug discovery; was a graduate researcher at McGill University and Mila; currently a Machine Learning Research Engineer at Scale AI, building large-scale reasoning datasets and training pipelines.
Education
Started M.Sc. at Mila/McGill in May 2022, advised by Doina Precup & Emmanuel Bengio; focused on GFlowNets + RL.
Background
Machine Learning Research Engineer, focusing on building reasoning datasets, training pipelines, and safety evaluation frameworks for LLM-based browser agents. Collaborates with customers and research groups to deliver applied ML systems.