International Conference on Learning Representations · 2024
Cited
18
Resume (English only)
Academic Achievements
Paper on reasoning distillation out on arxiv! (Mar 2025)
AgentOccam accepted to ICLR 2025! (Jan 2025)
Paper on Risk Averse RLHF accepted to Neurips 2024! (Sep 2024)
Paper on Pedagogical Alignment of LLMs accepted to EMNLP 2024! (Sep 2024)
Paper on Safe distributed OCO accepted to TMLR! (Aug 2023)
Paper on meta-RL in sparse reward environments accepted to NeurIPS 2022! (Sep 2022)
Paper on Safe online convex optimization accepted to AAAI 2022! (Dec 2021)
Background
Applied Scientist at Amazon, specializing in reinforcement learning (RL) Post-training. Research interests include safety in online learning, RL, and reinforcement learning from human feedback (RLHF).
Miscellany
Hobbies include hiking, cooking, painting, and photography.