Papers published: 'To Infinity and Beyond: Tool-Use Unlocks Length Generalization in State Space Models' (with Eran Malach, Omid Saremi, Sinead Williamson, Arwen Bradley, Emmanuel Abbe, Josh Susskind, Etai Littwin); 'RL for Reasoning by Adaptively Revealing Rationales' (with Mohammad Hossein Amani, Nicolas Mario Baldwin, Samy Bengio, Mehrdad Farajtabar, Emmanuel Abbe, Robert West).
Research Experience
Currently a Research Scientist at Apple Machine Learning Research. During his PhD, he was supported by the Apple Scholars in AI/ML PhD fellowship and interned at Apple under Samy Bengio. Competed in math Olympiads, including the International Mathematical Olympiad (IMO), during high school.
Education
PhD in Computer Science from EPFL, advised by Emmanuel Abbe; BSc in Computer Engineering from Sharif University of Technology, Iran.
Background
Research interests include understanding and improving the reasoning capabilities of neural networks and large language models, exploring topics such as length generalization, chain-of-thought methodologies, curriculum learning, and reinforcement learning. Also interested in the theoretical aspects of machine learning.
Miscellany
Personal interests include participating in math Olympiads.