Scholar

Luke Guerdan

Google Scholar ID: XPrjbvoAAAAJ

Carnegie Mellon University

MeasurementEvaluation ScienceMachine LearningResponsible AIHuman-Computer Interaction

Citations & Impact

All-time

Citations

261

H-index

i10-index

Publications

Co-authors

Contact

Publications

4 items

2025

Cited

2025

Cited

2025

Cited

2025

Cited

Resume (English only)

Academic Achievements

Publications:
- Doubly-Robust LLM-as-a-Judge: Externally Valid Estimation with Imperfect Personas (with Justin Whitehouse, Kimberly Truong, Kenneth Holstein, Zhiwei Steven Wu)
- Validating LLM-as-a-Judge Systems under Rating Indeterminacy (with Solon Barocas, Kenneth Holstein, Hanna Wallach, Zhiwei Steven Wu, Alexandra Chouldechova)
- Bridging Prediction and Intervention Problems in Social Systems (with Lydia T. Liu, Inioluwa Deborah Raji, Angela Zhou, Jessica Hullman, Daniel Malinsky, et al.)
- Measurement as Bricolage: How Data Scientists Construct Target Variables for Predictive Modeling Tasks
Awards/Funding: NSF Graduate Research Fellowship, Center for Advancing Safety of Machine Intelligence, National Institute for Standards and Technology (NIST).

Research Experience

Research projects involve the development of tools and techniques for evaluating AI systems, particularly in the context of human-AI interaction within a sociotechnical framework.

Education

Pursuing a Ph.D. at Carnegie Mellon University, advised by Ken Holstein and Steven Wu.

Background

A Ph.D. student in the Human–Computer Interaction Institute (HCII) within the School of Computer Science at Carnegie Mellon University. Research interests include developing tools for measuring the capabilities, risks, and limitations of AI systems; studying statistical approaches for evaluating AI systems themselves as well as frameworks for understanding the broader sociotechnical context in which humans operate and interact with AI systems. His work bridges ideas from ML, Statistics, Human–Computer Interaction, and the Quantitative Social Sciences.

Co-authors

0 total

Co-authors: 0 (list not available)