Scholar
Richard Ren
Google Scholar ID: o-Vl80UAAAAJ
University of Pennsylvania
AI safety
evaluations
adversarial robustness
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
982
H-index
9
i10-index
8
Publications
11
Co-authors
0
Contact
No contact links provided.
Publications
5 items
The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems
2025
Cited
0
Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs
2025
Cited
0
Humanity's Last Exam
2025
Cited
0
Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
arXiv.org · 2024
Cited
5
Representation Engineering: A Top-Down Approach to AI Transparency
arXiv.org · 2023
Cited
298
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up