Scholar
Jesse Mu
Google Scholar ID: djLcGEQAAAAJ
Anthropic
Natural Language Processing
Machine Learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
2,650
H-index
15
i10-index
16
Publications
20
Co-authors
39
list available
Contact
No contact links provided.
Publications
2 items
Forecasting Rare Language Model Behaviors
2025
Cited
0
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
2025
Cited
0
Resume (English only)
Co-authors
39 total
Noah D. Goodman
Stanford University
Eric Zelikman
Stanford University
Co-author 3
Concha Bielza
Professor of Statistics and Operations Research, Technical University of Madrid / ELLIS Fellow
Co-author 5
Pedro Larrañaga
Professor of Artificial Intelligence - Universidad Politécnica de Madrid
Co-author 7
Xiang Lisa Li
Stanford University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up