AgoraResearch hub
ExploreLibraryProfile
Account
Noah Y. Siegel
Scholar

Noah Y. Siegel

Google Scholar ID: l2E0LR4AAAAJ
Google DeepMind
AI AlignmentLarge Language ModelsScalable OversightReinforcement LearningRobotics
Homepage↗Google Scholar↗
Citations & Impact
All-time
Citations
2,326
 
H-index
16
 
i10-index
20
 
Publications
20
 
Co-authors
61
list available
Contact
No contact links provided.
Publications
4 items
A Positive Case for Faithfulness: LLM Self-Explanations Help Predict Model Behavior
2026
Cited
0
LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring
2025
Cited
0
Advancing Event Forecasting through Massive Training of Large Language Models: Challenges, Solutions, and Broader Impacts
2025
Cited
0
Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance
2025
Cited
0
Resume (English only)
Co-authors
61 total
Nicolas Heess
Nicolas Heess
DeepMind
Roland Hafner
Roland Hafner
DeepMind
Martin Riedmiller
Martin Riedmiller
DeepMind
Abbas Abdolmaleki
Abbas Abdolmaleki
Deepmind
Michael Neunert
Michael Neunert
Google DeepMind
Thomas Lampe
Thomas Lampe
DeepMind
Markus Wulfmeier
Markus Wulfmeier
Google DeepMind
Josh Merel
Josh Merel
Fauna

Welcome back

Sign in to Agora

Welcome back! Please sign in to continue.

Do not have an account?