Scholar
Alex Oesterling
Google Scholar ID: N7KLEsMAAAAJ
PhD Candidate, Harvard University
Information Theory
Interpretability
Machine Learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
128
H-index
6
i10-index
4
Publications
14
Co-authors
6
list available
Contact
No contact links provided.
Publications
3 items
Understanding Annotator Safety Policy with Interpretability
2026
Cited
0
Inference-Time Reward Hacking in Large Language Models
2025
Cited
0
Multi-Group Proportional Representation for Text-to-Image Models
2025
Cited
0
Resume (English only)
Co-authors
6 total
Flavio du Pin Calmon
Harvard University
Himabindu Lakkaraju
Assistant Professor, Harvard University; Senior Staff Research Scientist, Google.
Usha Bhalla
Ph.D. Student, Harvard University
Suraj Srinivas
Research Scientist at Bosch
Cynthia Rudin
Professor of Computer Science, ECE, Statistics, and Biostatistics & Bioinformatics, Duke University
Jiaqi W. Ma
Assistant Professor, University of Illinois Urbana-Champaign
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up