AgoraResearch hub
ExploreLibraryProfile
Account
Julian Minder
Scholar

Julian Minder

Google Scholar ID: mu1tLSoAAAAJ
EPFL/ETHZ
Mechanistic InterpretabilityNLPGraph Learning
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
66
 
H-index
4
 
i10-index
3
 
Publications
8
 
Co-authors
23
list available
Contact
No contact links provided.
Publications
6 items
Activation Oracles: Training and Evaluating LLMs as General-Purpose Activation Explainers
2025
Cited
0
Believe It or Not: How Deeply do LLMs Believe Implanted Facts?
2025
Cited
0
Narrow Finetuning Leaves Clearly Readable Traces in Activation Differences
2025
Cited
0
The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?
2025
Cited
0
Robustly identifying concepts introduced during chat fine-tuning using crosscoders
2025
Cited
0
Controllable Context Sensitivity and the Knob Behind It
arXiv.org · 2024
Cited
0
Resume (English only)
Co-authors
23 total
Neel Nanda
Neel Nanda
Mechanistic Interpretability Team Lead, Google DeepMind
Clément Dumas
Clément Dumas
ENS Paris-Saclay
Katya Mirylenka
Katya Mirylenka
Zalando Switzerland
Co-author 4
Co-author 4
Co-author 5
Co-author 5
Co-author 6
Co-author 6
Bilal Chughtai
Bilal Chughtai
Google DeepMind
Niklas Stoehr
Niklas Stoehr
Google DeepMind, ETH Zurich

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?