Published a series of papers in top-tier conferences and journals including but not limited to: 'Copilot Arena: A Platform for Code LLM Evaluation in the Wild', 'Specialized Foundation Models Struggle to Beat Supervised Baselines', 'ScribeAgent: Towards Specialized Web Agents Using Production-Scale Workflow Data', etc.
Research Experience
Published multiple papers in several international conferences such as ICML, ICLR, CHI, etc. Involved in projects like 'This Time is Different: An Observability Perspective on Time Series Foundation Models' among others.
Background
An associate professor in the Machine Learning Department at CMU and Chief Scientist at Datadog. Current research interests include AI for science, human-AI interaction, and developing specialized models and agents.