Amanda Askell
Scholar

Amanda Askell

Google Scholar ID: NYOJzM4AAAAJ
Anthropic
artificial intelligencemachine learningphilosophyethicsdecision theory
Citations & Impact
All-time
Citations
145,756
 
H-index
48
 
i10-index
53
 
Publications
20
 
Co-authors
0
 
Resume (English only)
Academic Achievements
  • No specific academic achievements mentioned, such as publications, awards, or patents.
Research Experience
  • Working on finetuning and AI alignment at Anthropic; Formerly a research scientist on the policy team at OpenAI, focusing on AI safety via debate and human baselines.
Education
  • PhD in Philosophy from NYU with a thesis on infinite ethics; BPhil in Philosophy from the University of Oxford; Undergraduate degree in Philosophy from the University of Dundee, starting out as a fine art and philosophy student at the Duncan of Jordanstone art school.
Background
  • Philosopher working on finetuning and AI alignment at Anthropic. Her team trains models to be more honest and to have good character traits, and works on developing new finetuning techniques so that their interventions can scale to more capable models. Previously, she worked as a research scientist on the policy team at OpenAI, where she focused on AI safety via debate and human baselines for AI performance.
Miscellany
  • Member of Giving What We Can, pledged to donate at least 10% of her lifetime income to charity, aiming to make that more than 50%, primarily donating to global poverty charities.
Co-authors
0 total
Co-authors: 0 (list not available)