Scholar

Mudit Verma

Google Scholar ID: 8TtypKwAAAAJ

Google

AI AgentsRLHFHuman AI Interaction

Citations & Impact

All-time

Citations

1,262

H-index

i10-index

Publications

Co-authors

list available

Contact

Publications

20 items

Browse publications on Google Scholar (top-right) ↗

Resume (English only)

Academic Achievements

Ph.D. dissertation: 'Guidance Priors to Reduce Human Feedback Burden in Sequential Decision Making' (2024).
Paper 'Hindsight PRIORs for Reward Learning from Human Preferences' accepted at ICLR 2024.
Work 'Symbol Guided Hindsight Priors for Reward Learning from Human Preferences' presented at IROS RLCONFORM and NeurIPS HILL 2022.
Awarded Best Intern Project at SSIR in 2018; finalist in 2017.
Gold Medalist during undergraduate studies with a CGPA of 9.51/10.0.

Research Experience

Sept. 2024 – Present: Research Scientist at Google LLC, Gemini/Bard group, Mountain View, CA, USA.
Summer 2023: Machine Learning Research Intern at Apple Inc., worked on reward learning from human preferences, advised by Rin Metcalf Susa and Barry Theobald.
Summer 2022: Machine Learning Research Intern at Apple Inc., conducted preference-based reinforcement learning research, advised by Rin Metcalf Susa and Barry Theobald.
Summer 2021: Deep Learning Software Engineering Intern at Intel Corporation, analyzed and optimized float32 ResNet50 on Intel IceLake, advised by Wei Wang.
Summer 2018: Software Engineering Intern at Samsung Semiconductor India Research, developed DRAM Bank Simulator and novel redundancy analysis algorithms, advised by Atishay Kumar.
Summer 2017: Software Engineering Intern at Samsung Semiconductor India Research, built SSD simulator and LSTM-based algorithm STRASDAC to reduce write-wear, advised by Sandeep Sammatshetti.

Co-authors

10 total