Scholar
Arnab Sen Sharma
Google Scholar ID: 8ihSLrwAAAAJ
Ph.D. student at Northeastern University
Machine Learning
Natural Language Processing
Interpretable AI
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,283
H-index
11
i10-index
11
Publications
14
Co-authors
14
list available
Contact
No contact links provided.
Publications
6 items
Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers
2025
Cited
0
LLMs Process Lists With General Filter Heads
2025
Cited
0
Language Models use Lookbacks to Track Beliefs
2025
Cited
0
Elucidating Mechanisms of Demographic Bias in LLMs for Healthcare
2025
Cited
0
The Quest for the Right Mediator: A History, Survey, and Theoretical Grounding of Causal Interpretability
arXiv.org · 2024
Cited
34
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model Internals
2024
Cited
0
Resume (English only)
Co-authors
14 total
David Bau
Assistant Professor at Northeastern University
Yonatan Belinkov
Technion
Co-author 3
Aaron Mueller
Boston University
Eric Todd
PhD Student at Northeastern University
Ruhul Amin, PhD
Computer and Information Sciences, Fordham University
Alex Andonian
PhD in EECS, MIT
Byron Wallace
Associate Professor, Northeastern University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up