Scholar

Joshua Engels

Google Scholar ID: yVPnVK8AAAAJ

Google Deepmind

Mechanistic InterpretabilityAI Safety

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

295

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailjosh.adam.engels@gmail.com TwitterOpen ↗GitHubOpen ↗

Publications

14 items

How Transparent is DiffusionGemma?

2026

Cited

Training on Documents About Monitoring Leads to CoT Obfuscation

2026

Cited

Building Production-Ready Probes For Gemini

2026

Cited

CleANN: Efficient Full Dynamism in Graph-based Approximate Nearest Neighbor Search

2025

Cited

Simple Mechanistic Explanations for Out-Of-Context Reasoning

2025

Cited

Dense SAE Latents Are Features, Not Bugs

2025

Cited

Scaling Laws For Scalable Oversight

2025

Cited

Are Sparse Autoencoders Useful? A Case Study in Sparse Probing

2025

Cited

Resume (English only)

Research Experience

I am a research scientist at Google DeepMind, focusing on language model interpretability.

Education

I am on leave from my PhD at MIT, where I worked with Max Tegmark on mechanistic interpretability and AI safety.

Background

I am currently a research scientist at Google DeepMind, where I work on the language model interpretability team. I am primarily motivated by reducing catastrophic risks from future powerful AI systems, and I believe that this is likely the most important problem in the world.

Co-authors

9 total