2025: Published 'Command A: An Enterprise-Ready Large Language Model'; 2024: Published 'Data-Efficient Policy Evaluation Through Behavior Policy Search' (with Josiah P. Hanna, Philip S. Thomas, Martha White, Peter Stone, Scott Niekum) and 'Information Directed Tree Search: Reasoning and Planning with Language Agents' (with HyunJi Nam, Allen Nie, Jonathan Lee, Emma Brunskill).
Research Experience
Currently working on the post-training team at Cohere. Previously, worked as a postdoc for Prof. Emma Brunskill at Stanford University.
Education
PhD from the University of Massachusetts, advised by Prof. Philip Thomas.
Background
Research Interests: Reinforcement learning and large language models. Previously worked as a postdoc at Stanford University.