Published multiple papers including 'Automated Hypothesis Validation with Agentic Sequential Falsifications' (ICML, 2025), 'BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery' (NeurIPS Workshop on Scaling Environments for Agents, 2025), 'CriticAL: Model Criticism Automation with Language Models' (NeurIPS Statistical Foundations of LLMs and Foundation Models Workshop, 2024), and more.
Research Experience
Has conducted research at Microsoft Research and Two Sigma; currently focuses on enhancing the reasoning capabilities of large language models, previously worked on various topics in statistical machine learning such as Sequential Monte Carlo, variational inference, Gaussian processes, and point processes.
Education
Undergraduate degree from Princeton (summa cum laude, Phi Beta Kappa), advised by Tom Griffiths and Ryan Adams; PhD student at Stanford, advised by Noah Goodman and Emily Fox.
Background
PhD student in Computer Science, with research interests in enhancing the reasoning capabilities of large language models and statistical machine learning.
Miscellany
Website design and source code from Jon Barron's website.