AgoraResearch hub
ExploreLibraryProfile
Account
Jesse Mu
Scholar

Jesse Mu

Google Scholar ID: djLcGEQAAAAJ
Anthropic
Natural Language ProcessingMachine Learning
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
2,650
 
H-index
15
 
i10-index
16
 
Publications
20
 
Co-authors
39
list available
Contact
No contact links provided.
Publications
2 items
Forecasting Rare Language Model Behaviors
2025
Cited
0
Constitutional Classifiers: Defending against Universal Jailbreaks across Thousands of Hours of Red Teaming
2025
Cited
0
Resume (English only)
Co-authors
39 total
Noah D. Goodman
Noah D. Goodman
Stanford University
Eric Zelikman
Eric Zelikman
Stanford University
Co-author 3
Co-author 3
Concha Bielza
Concha Bielza
Professor of Statistics and Operations Research, Technical University of Madrid / ELLIS Fellow
Co-author 5
Co-author 5
Pedro Larrañaga
Pedro Larrañaga
Professor of Artificial Intelligence - Universidad Politécnica de Madrid
Co-author 7
Co-author 7
Xiang Lisa Li
Xiang Lisa Li
Stanford University

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?