- Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas (with Justin Whitehouse, Kimberly Truong, Kenneth Holstein, Zhiwei Steven Wu)
- Validating LLM-as-a-Judge Systems under Rating Indeterminacy (with Solon Barocas, Kenneth Holstein, Hanna Wallach, Zhiwei Steven Wu, Alexandra Chouldechova)
- Bridging Prediction and Intervention Problems in Social Systems (with Lydia T. Liu, Inioluwa Deborah Raji, Angela Zhou, Jessica Hullman, Daniel Malinsky, et al.)
- Measurement as Bricolage: How Data Scientists Construct Target Variables for Predictive Modeling Tasks
Awards/Funding: NSF Graduate Research Fellowship, Center for Advancing Safety of Machine Intelligence, National Institute for Standards and Technology (NIST).
Research Experience
Research projects involve the development of tools and techniques for evaluating AI systems, particularly in the context of human-AI interaction within a sociotechnical framework.
Education
Pursuing a Ph.D. at Carnegie Mellon University, advised by Ken Holstein and Steven Wu.
Background
A Ph.D. student in the Human–Computer Interaction Institute (HCII) within the School of Computer Science at Carnegie Mellon University. Research interests include developing tools for measuring the capabilities, risks, and limitations of AI systems; studying statistical approaches for evaluating AI systems themselves as well as frameworks for understanding the broader sociotechnical context in which humans operate and interact with AI systems. His work bridges ideas from ML, Statistics, Human–Computer Interaction, and the Quantitative Social Sciences.