Sapana Chaudhary
Scholar

Sapana Chaudhary

Google Scholar ID: dsb5VjkAAAAJ
AWS AI
Reinforcement LearningPost-TrainingOnline Optimization
Citations & Impact
All-time
Citations
143
 
H-index
7
 
i10-index
4
 
Publications
12
 
Co-authors
2
list available
Resume (English only)
Academic Achievements
  • Paper on reasoning distillation out on arxiv! (Mar 2025)
  • AgentOccam accepted to ICLR 2025! (Jan 2025)
  • Paper on Risk Averse RLHF accepted to Neurips 2024! (Sep 2024)
  • Paper on Pedagogical Alignment of LLMs accepted to EMNLP 2024! (Sep 2024)
  • Paper on Safe distributed OCO accepted to TMLR! (Aug 2023)
  • Paper on meta-RL in sparse reward environments accepted to NeurIPS 2022! (Sep 2022)
  • Paper on Safe online convex optimization accepted to AAAI 2022! (Dec 2021)
Background
  • Applied Scientist at Amazon, specializing in reinforcement learning (RL) Post-training. Research interests include safety in online learning, RL, and reinforcement learning from human feedback (RLHF).
Miscellany
  • Hobbies include hiking, cooking, painting, and photography.