Scholar

Richard Ren

Google Scholar ID: o-Vl80UAAAAJ

University of Pennsylvania

AI safetyevaluationsadversarial robustness

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

982

H-index

9

i10-index

8

Publications

11

Co-authors

0

Contact

No contact links provided.

Publications

5 items

The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems

2025

Cited

0

Utility Engineering: Analyzing and Controlling Emergent Value Systems in AIs

2025

Cited

0

Humanity's Last Exam

2025

Cited

0

Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?

arXiv.org · 2024

Cited

5

Representation Engineering: A Top-Down Approach to AI Transparency

arXiv.org · 2023

Cited

298

Resume (English only)

Co-authors

0 total

Co-authors: 0 (list not available)