- Treeffuser: Probabilistic Predictions via Conditional Diffusions with Gradient-Boosted Trees. NeurIPS 2024
- Hypothesis testing the circuit hypothesis in LLMs. NeurIPS 2024
Research Experience
- Apple Health AI, Machine learning research scientist (part-time), Feb 2024 - Dec 2024: Foundation models of time series from wearables to understand user health and fitness levels.
- Apple Health AI, Machine learning research scientist (intern), Jan 2022 - Aug 2022: Estimated the causal impact of the Watch's notifications on user behavior with novel causal estimators.
- Apple Health AI, Machine learning research scientist (intern), May 2021 - Aug 2021: Identified and designed new health biomarkers from the sensor data of Apple devices.
- Palantir Technologies, Forward deployed software engineer (intern), Jun 2020 - Aug 2020: Scoped, prototyped, and deployed data-driven algorithms to reduce costs for a US healthcare insurer.
- Yosef Lab, University of California, Berkeley, Research assistant, Apr 2019 - Aug 2019: Developed an open-source Python package for single-cell data analysis: scvi-tools (1.3k+ GitHub stars); Designed generative models to impute unobserved genes in spatial genomics using sc-RNA data.
- Akwa Group, Machine learning consultant (alongside M.S.), Sep 2018 - May 2019: Designed signals and models to predict the performance of new gas stations -- surpassed human experts by 25%.
- IMC Trading, Software engineer (intern), Jun 2018 - Sep 2018: Distributed model training pipelines on a cluster for faster overnight training (HFT, futures).
- Bernardaud, Operations research consultant (alongside B.S.), Feb 2018 - Jun 2018: Designed algorithms to find optimal production processes under factory constraints; Created a user-friendly full-stack website connecting my algorithms to the databases.
- Ministry of Defense, Junior data scientist (intern), Nov 2016 - Apr 2017: Developed graph-mining and NLP models for social network analysis to produce intelligence.
Education
- Columbia University, Ph.D. in Computer Science, Jan 2021 - May 2025
- Columbia University, M.S. in Computer Science, Aug 2019 - Dec 2020
- École Polytechnique, M.S. in Applied Mathematics and Computer Science, Aug 2018 - Apr 2019
- École Polytechnique, B.S. in Applied Mathematics and Computer Science, Aug 2016 - Jun 2018
- École Spéciale Militaire de Saint-Cyr, Accelerated track to the rank of army officer (second lieutenant), Aug 2016 - Nov 2016
- Lycée Privé Sainte-Geneviève, Prépa in pure mathematics, physics, and computer science, Aug 2014 - Jul 2016
Background
Research Interests: Probabilistic and generative models, focusing on scaling them to large datasets and interpreting what they learned. Specialization: Computer Science. Brief Introduction: Ph.D. student in computer science at Columbia University, advised by Prof. David Blei and Prof. Elham Azizi. Research orientation towards practical impact, prioritizing simplicity and usability.