Scholar

Anmol Goel

Google Scholar ID: hCC11qIAAAAJ

UKP Lab, TU Darmstadt & University of Copenhagen

Natural Language ProcessingPrivacy

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

153

H-index

i10-index

Publications

Co-authors

list available

Contact

TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

8 items

Intrinsic Guardrails: How Semantic Geometry of Personality Interacts with Emergent Misalignment in LLMs

2026

Cited

MASEval: Extending Multi-Agent Evaluation from Models to Systems

2026

Cited

Privacy Collapse: Benign Fine-Tuning Can Break Contextual Privacy in Language Models

2026

Cited

Auditing Language Model Unlearning via Information Decomposition

2026

Cited

Responsible Evaluation of AI for Mental Health

arXiv.org · 2026

Cited

Differentially Private Steering for Large Language Model Alignment

2025

Cited

From Human Judgements to Predictive Models: Unravelling Acceptability in Code-Mixed Sentences

arXiv.org · 2024

Cited

Socratic Reasoning Improves Positive Text Rewriting

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

Selected Papers:
- Auditing Language Model Unlearning via Information Decomposition
- Differentially Private Steering for Large Language Model Alignment
- Socratic Reasoning Improves Positive Text Rewriting
These papers are under review or have been presented at conferences such as ICLR, TPDP, and CLPsych Workshop.

Research Experience

PhD student in the ELLIS project, focusing on the privacy and safety of large language models.

Education

ELLIS PhD student, supervised by Prof. Iryna Gurevych (UKP Lab, TU Darmstadt) and co-supervised by Prof. Amartya Sanyal (University of Copenhagen); previously spent two years at IIIT Hyderabad, India, with Prof. Ponnurangam Kumaraguru.

Background

Research interests: privacy and safety of large language models; recent work focuses on developing methods for protecting sensitive information when working with billion-scale parameter models.

Miscellany