Scholar

Wes Gurnee

Google Scholar ID: 5sxXSfwAAAAJ

Anthropic

Mechanistic InterpretabilityAI AlignmentOptimizationGovernance

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,536

H-index

12

i10-index

13

Publications

18

Co-authors

14

list available

Contact

No contact links provided.

Publications

5 items

A Shared Subcircuit Lets LLMs Count Down Across Tasks

2026

Cited

0

Emotion Concepts and their Function in a Large Language Model

2026

Cited

0

When Models Manipulate Manifolds: The Geometry of a Counting Task

arXiv.org · 2026

Cited

10

The Remarkable Robustness of LLMs: Stages of Inference?

arXiv.org · 2024

Cited

48

Not All Language Model Features Are One-Dimensionally Linear

2024

Cited

40

Resume (English only)

Co-authors

14 total

Mechanistic Interpretability Team Lead, Google DeepMind

Dimitris Bertsimas

Boeing Professor of Operations Research, MIT

Professor of Physics, MIT

Northeastern University

Nina Panickssery

Carnegie Mellon University

Google Deepmind