Scholar
Marius Mosbach
Google Scholar ID: O7RwHEkAAAAJ
Mila - Quebec AI Institute, McGill University
NLP
Interpretability
Machine learning
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,698
H-index
14
i10-index
19
Publications
20
Co-authors
21
list available
Contact
Email
marius.mosbach@mila.quebec
GitHub
Open ↗
Publications
12 items
The Illusion of Superposition? A Principled Analysis of Latent Thinking in Language Models
2026
Cited
0
LLM2Vec-Gen: Generative Embeddings from Large Language Models
2026
Cited
0
Operationalising the Superficial Alignment Hypothesis via Task Complexity
2026
Cited
0
LatentLens: Revealing Highly Interpretable Visual Tokens in LLMs
2026
Cited
0
CLaS-Bench: A Cross-Lingual Alignment and Steering Benchmark
2026
Cited
0
Do Generalisation Results Generalise?
2025
Cited
0
Value Drifts: Tracing Value Alignment During LLM Post-Training
2025
Cited
0
Understanding the Influence of Synthetic Data for Text Embedders
2025
Cited
0
Load more
Resume (English only)
Academic Achievements
Awards: Best Paper Award at COLING 2022, Best Theme Paper Award at ACL 2023, Most Interesting Paper Award at BabyLM Challenge 2023
Papers: Impact of data frequency on LLM unlearning, Analysis of reasoning chains of DeepSeek-R1, etc.
Workshops: Actionable Interpretability workshop accepted by ICML 2025
Research Experience
Position: Postdoctoral Researcher
Work Experience: Formerly a postdoctoral researcher at the Language Science and Technology Department of Saarland University
Research Projects: Focused on the critical adaptation stage of LLMs to make them more specialized, safe, and aligned with specific requirements
Education
PhD: Department of Computer Science, Saarland University
Advisor: Not mentioned
Time: Not specifically mentioned
Background
Research Interests: Reliable, controllable, and trustworthy Natural Language Processing (NLP) systems, particularly Large Language Models (LLMs)
Field: Computer Science
Introduction: Currently a postdoctoral researcher at Mila - Quebec AI Institute and a postdoctoral fellow at McGill University in Montréal, Canada.
Miscellany
Personal Interests: CrossFit, playing soccer, occasional baking
Co-authors
21 total
Dietrich Klakow
Saarland University, Saarland Informatics Campus, PharmaScienceHub
Siva Reddy
McGill University, Mila Quebec AI Institute
Maksym Andriushchenko
ELLIS Institute Tübingen & Max Planck Institute for Intelligent Systems
Co-author 4
Jesujoba Oluwadara Alabi
Saarland University
David Ifeoluwa Adelani
McGill University and Mila - Quebec AI Institute and Canada CIFAR AI Chair
Parishad BehnamGhader
Student at McGill University / Mila -- Quebec AI Institute
Shauli Ravfogel
Faculty Fellow, NYU
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up