Agora | Research Hub

Citations & Impact

All-time

Citations

2,701

H-index

21

i10-index

28

Publications

20

Co-authors

7

list available

Contact

Emailpaul.rottger@unibocconi.it TwitterOpen ↗GitHubOpen ↗

Publications

15 items

Measuring and Mitigating Persona Distortions from AI Writing Assistance

2026

Cited

0

The Enforcement and Feasibility of Hate Speech Moderation on Twitter

2026

Cited

0

Diffusion Language Models Are Natively Length-Aware

2026

Cited

0

SimBench: Benchmarking the Ability of Large Language Models to Simulate Human Behaviors

2025

Cited

0

Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation

2025

Cited

0

No for Some, Yes for Others: Persona Prompts and Other Sources of False Refusal in Language Models

2025

Cited

0

Principled Personas: Defining and Measuring the Intended Effects of Persona Prompting on Task Performance

2025

Cited

0

The Pluralistic Moral Gap: Understanding Judgment and Value Differences between Humans and Large Language Models

2025

Cited

0

Resume (English only)

Academic Achievements

Jul 2025: HateDay project won Outstanding Paper Award at ACL 2025
Feb 2025: Awarded UK AISI Grant to study distortive effects of AI writing assistance
Dec 2024: PRISM alignment dataset won Best Paper (D&B) at NeurIPS 2024
Aug 2024: Work on LLM values and opinions won Outstanding Paper at ACL 2024
Jun 2024: Presented XSTest and organized WOAH workshop at NAACL 2024
Jan 2024: Launched SafetyPrompts.com, a living catalogue of open datasets for LLM safety
Jul 2023: Organized Sexism Detection Task won Best Paper at SemEval 2023
May 2023: Led HateCheck project that won Stanford AI Audit Challenge

Co-authors

7 total