Scholar

Amanda Askell

Google Scholar ID: NYOJzM4AAAAJ

Anthropic

artificial intelligencemachine learningphilosophyethicsdecision theory

Citations & Impact

All-time

Citations

145,756

H-index

i10-index

Publications

Co-authors

Contact

Publications

2 items

2025

Cited

International Conference on Learning Representations · 2023

Cited

178

Resume (English only)

Academic Achievements

No specific academic achievements mentioned, such as publications, awards, or patents.

Research Experience

Working on finetuning and AI alignment at Anthropic; Formerly a research scientist on the policy team at OpenAI, focusing on AI safety via debate and human baselines.

Education

PhD in Philosophy from NYU with a thesis on infinite ethics; BPhil in Philosophy from the University of Oxford; Undergraduate degree in Philosophy from the University of Dundee, starting out as a fine art and philosophy student at the Duncan of Jordanstone art school.

Background

Philosopher working on finetuning and AI alignment at Anthropic. Her team trains models to be more honest and to have good character traits, and works on developing new finetuning techniques so that their interventions can scale to more capable models. Previously, she worked as a research scientist on the policy team at OpenAI, where she focused on AI safety via debate and human baselines for AI performance.

Miscellany

Member of Giving What We Can, pledged to donate at least 10% of her lifetime income to charity, aiming to make that more than 50%, primarily donating to global poverty charities.

Co-authors

0 total

Co-authors: 0 (list not available)