Scholar
Noah Y. Siegel
Google Scholar ID: l2E0LR4AAAAJ
Google DeepMind
AI Alignment
Large Language Models
Scalable Oversight
Reinforcement Learning
Robotics
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
2,326
H-index
16
i10-index
20
Publications
20
Co-authors
61
list available
Contact
No contact links provided.
Publications
4 items
A Positive Case for Faithfulness: LLM Self-Explanations Help Predict Model Behavior
2026
Cited
0
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
2025
Cited
0
Advancing Event Forecasting through Massive Training of Large Language Models: Challenges, Solutions, and Broader Impacts
2025
Cited
0
Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance
2025
Cited
0
Resume (English only)
Co-authors
61 total
Nicolas Heess
DeepMind
Roland Hafner
DeepMind
Martin Riedmiller
DeepMind
Abbas Abdolmaleki
Deepmind
Michael Neunert
Google DeepMind
Thomas Lampe
DeepMind
Markus Wulfmeier
Google DeepMind
Josh Merel
Fauna
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up