Mudit Verma
Scholar

Mudit Verma

Google Scholar ID: 8TtypKwAAAAJ
Google
AI AgentsRLHFHuman AI Interaction
Citations & Impact
All-time
Citations
1,262
 
H-index
12
 
i10-index
15
 
Publications
20
 
Co-authors
10
list available
Publications
20 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
  • Ph.D. dissertation: 'Guidance Priors to Reduce Human Feedback Burden in Sequential Decision Making' (2024).
  • Paper 'Hindsight PRIORs for Reward Learning from Human Preferences' accepted at ICLR 2024.
  • Work 'Symbol Guided Hindsight Priors for Reward Learning from Human Preferences' presented at IROS RLCONFORM and NeurIPS HILL 2022.
  • Awarded Best Intern Project at SSIR in 2018; finalist in 2017.
  • Gold Medalist during undergraduate studies with a CGPA of 9.51/10.0.
Research Experience
  • Sept. 2024 – Present: Research Scientist at Google LLC, Gemini/Bard group, Mountain View, CA, USA.
  • Summer 2023: Machine Learning Research Intern at Apple Inc., worked on reward learning from human preferences, advised by Rin Metcalf Susa and Barry Theobald.
  • Summer 2022: Machine Learning Research Intern at Apple Inc., conducted preference-based reinforcement learning research, advised by Rin Metcalf Susa and Barry Theobald.
  • Summer 2021: Deep Learning Software Engineering Intern at Intel Corporation, analyzed and optimized float32 ResNet50 on Intel IceLake, advised by Wei Wang.
  • Summer 2018: Software Engineering Intern at Samsung Semiconductor India Research, developed DRAM Bank Simulator and novel redundancy analysis algorithms, advised by Atishay Kumar.
  • Summer 2017: Software Engineering Intern at Samsung Semiconductor India Research, built SSD simulator and LSTM-based algorithm STRASDAC to reduce write-wear, advised by Sandeep Sammatshetti.