Scholar

Noah Y. Siegel

Google Scholar ID: l2E0LR4AAAAJ

Google DeepMind

AI AlignmentLarge Language ModelsScalable OversightReinforcement LearningRobotics

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

2,326

H-index

16

i10-index

20

Publications

20

Co-authors

61

list available

Contact

No contact links provided.

Publications

4 items

A Positive Case for Faithfulness: LLM Self-Explanations Help Predict Model Behavior

2026

Cited

0

LLMs Can Covertly Sandbag on Capability Evaluations Against Chain-of-Thought Monitoring

2025

Cited

0

Advancing Event Forecasting through Massive Training of Large Language Models: Challenges, Solutions, and Broader Impacts

2025

Cited

0

Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance

2025

Cited

0

Resume (English only)

Co-authors

61 total

Martin Riedmiller

Abbas Abdolmaleki

Michael Neunert

Google DeepMind

Markus Wulfmeier

Google DeepMind