A Research Scientist at Meta and an Adjunct Professor at the University of Pittsburgh.
Education
Received a Ph.D. in Operations Research and Financial Engineering from Princeton University.
Background
Research interests include reinforcement learning, RLHF (Reinforcement Learning from Human Feedback), Bayesian optimization, and adaptive experimentation. More broadly, he is interested in sequential decision-making from the perspectives of both operations research and AI. His recent work on RLHF for generative ads has been integrated into Facebook and Instagram's advertising systems.