Scholar
Erik Jones
Google Scholar ID: _-CU2CsAAAAJ
UC Berkeley
Machine Learning
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
2,082
H-index
14
i10-index
14
Publications
18
Co-authors
16
list available
Contact
No contact links provided.
Publications
7 items
Abstractive Red-Teaming of Language Model Character
2026
Cited
1
Eliciting Harmful Capabilities by Fine-Tuning On Safeguarded Outputs
2026
Cited
3
Beyond Data Filtering: Knowledge Localization for Capability Removal in LLMs
2025
Cited
0
Poisoning Attacks on LLMs Require a Near-constant Number of Poison Samples
2025
Cited
0
Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
2025
Cited
0
Forecasting Rare Language Model Behaviors
2025
Cited
0
How Do Large Language Monkeys Get Their Power (Laws)?
2025
Cited
0
Resume (English only)
Co-authors
16 total
Jacob Steinhardt
Stanford University
Andrew Ng
Stanford University
Pranav Rajpurkar, PhD
Associate Professor of Biomedical Informatics, Harvard Medical School
Hamid Palangi
Google and University of Washington
Aditi Raghunathan
Assistant professor, Carnegie Mellon University
Anca D Dragan
Assistant Professor at UC Berkeley // Director, AI Safety and Alignment, Google DeepMind
Percy Liang
Associate Professor of Computer Science, Stanford University
Ece Kamar
Microsoft Research
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up