Scholar

Ekdeep Singh Lubana

Google Scholar ID: OP7S3vsAAAAJ

Goodfire AI

AIMachine LearningDeep Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,145

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailekdeeplubana@fas.harvard.edu CVOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

25 items

Why Larger Models Learn More: Effects of Capacity, Interference, and Rare-Task Retention

2026

Cited

Stories in Space: In-Context Learning Trajectories in Conceptual Belief Space

2026

Cited

Manifold Steering Reveals the Shared Geometry of Neural Network Representation and Behavior

2026

Cited

Arithmetic in the Wild: Llama uses Base-10 Addition to Reason About Cyclic Concepts

2026

Cited

Do Sparse Autoencoders Capture Concept Manifolds?

2026

Cited

Features as Rewards: Scalable Supervision for Open-Ended Tasks via Interpretability

2026

Cited

From Isolation to Entanglement: When Do Interpretability Methods Identify and Disentangle Known Concepts?

2025

Cited

That's not natural: The Impact of Off-Policy Training Data on Probe Performance

2025

Cited

Resume (English only)

Academic Achievements

December 2024: Paper identifying (in)abilities of SAEs awarded best paper at NeurIPS Foundation model interventions workshop!
September 2024: Paper on hidden capabilities in generative models accepted as a spotlight at NeurIPS, 2024.
August 2024: Preprint on a percolation model of emergent capabilities is on arXiv now.
June 2024: Paper on identifying how jailbreaks bypass safety mechanisms accepted at NeurIPS 2024.
October 2023: Paper on analyzing in-context learning as a subjective randomness task accepted to ICLR, 2024.
October 2023: Our work on multiplicative emergence of compositional abilities was accepted to NeurIPS, 2023.
April 2023: Our work on a mechanistic understanding of loss landscapes was accepted to ICML, 2023.
January 2023: Our work analyzing loss landscape of self-supervised objectives was accepted to ICLR, 2023.
October 2021: Our work on dynamics of normalization layers was accepted to NeurIPS, 2021.
March 2021: Our work on theory of pruning was accepted as a spotlight at ICLR, 2021.

Research Experience

Currently a research fellow at the CBS-NTT Program in Physics of Intelligence at Harvard University, leading the phenomenological theory team and often collaborating with Hidenori Tanaka, David Krueger, and Demba Ba. His undergraduate research primarily focused on embedded systems, such as energy-efficient machine vision systems.

Education

Graduated with a Bachelor's degree in ECE from Indian Institute of Technology (IIT), Roorkee in 2019. Did his PhD co-affiliated with EECS, University of Michigan and CBS, Harvard, and was advised by Robert Dick and Hidenori Tanaka.

Background

Research interests include designing (faithful) abstractions of phenomena relevant to controlling or aligning neural networks, and better understanding the training dynamics of neural networks, especially via a statistical physics perspective.

Co-authors

20 total