I am a research scientist at Google DeepMind, focusing on language model interpretability.
Education
I am on leave from my PhD at MIT, where I worked with Max Tegmark on mechanistic interpretability and AI safety.
Background
I am currently a research scientist at Google DeepMind, where I work on the language model interpretability team. I am primarily motivated by reducing catastrophic risks from future powerful AI systems, and I believe that this is likely the most important problem in the world.