International Conference on Learning Representations · 2023
Cited
178
Resume (English only)
Academic Achievements
No specific academic achievements mentioned, such as publications, awards, or patents.
Research Experience
Working on finetuning and AI alignment at Anthropic; Formerly a research scientist on the policy team at OpenAI, focusing on AI safety via debate and human baselines.
Education
PhD in Philosophy from NYU with a thesis on infinite ethics; BPhil in Philosophy from the University of Oxford; Undergraduate degree in Philosophy from the University of Dundee, starting out as a fine art and philosophy student at the Duncan of Jordanstone art school.
Background
Philosopher working on finetuning and AI alignment at Anthropic. Her team trains models to be more honest and to have good character traits, and works on developing new finetuning techniques so that their interventions can scale to more capable models. Previously, she worked as a research scientist on the policy team at OpenAI, where she focused on AI safety via debate and human baselines for AI performance.
Miscellany
Member of Giving What We Can, pledged to donate at least 10% of her lifetime income to charity, aiming to make that more than 50%, primarily donating to global poverty charities.