Scholar
Paul Röttger
Google Scholar ID: 7rpmd9cAAAAJ
Postdoctoral Researcher, Bocconi University
Large Language Models
Safety and Societal Impacts of AI Systems
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
2,701
H-index
21
i10-index
28
Publications
20
Co-authors
7
list available
Contact
Email
paul.rottger@unibocconi.it
Twitter
Open ↗
GitHub
Open ↗
Publications
13 items
Diffusion Language Models Are Natively Length-Aware
2026
Cited
0
SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors
2025
Cited
0
Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation
2025
Cited
0
No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models
2025
Cited
0
Principled Personas: Defining and Measuring the Intended Effects of Persona Prompting on Task Performance
2025
Cited
0
The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models
2025
Cited
0
Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals' Subjective Text Perceptions
2025
Cited
0
AILuminate: Introducing v1.0 of the AI Risk and Reliability Benchmark from MLCommons
arXiv.org · 2025
Cited
0
Load more
Resume (English only)
Academic Achievements
Jul 2025: HateDay project won Outstanding Paper Award at ACL 2025
Feb 2025: Awarded UK AISI Grant to study distortive effects of AI writing assistance
Dec 2024: PRISM alignment dataset won Best Paper (D&B) at NeurIPS 2024
Aug 2024: Work on LLM values and opinions won Outstanding Paper at ACL 2024
Jun 2024: Presented XSTest and organized WOAH workshop at NAACL 2024
Jan 2024: Launched SafetyPrompts.com, a living catalogue of open datasets for LLM safety
Jul 2023: Organized Sexism Detection Task won Best Paper at SemEval 2023
May 2023: Led HateCheck project that won Stanford AI Audit Challenge
Co-authors
7 total
Bertie Vidgen
Oxford, Mercor
Dirk Hovy
Bocconi University
Hannah Rose Kirk
University of Oxford
Janet B. Pierrehumbert
Prof. of Language Modelling, Univ. of Oxford Dept. of Engineering Science
Helen Margetts
Professor of Society and the Internet, University of Oxford
Giuseppe Attanasio
Postdoctoral Researcher, Instituto de Telecomunicações
Debora Nozza
Assistant Professor, Bocconi University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up