Scholar
Debajoy Mukherjee
Google Scholar ID: Tyhed3QAAAAJ
PhD Computer Science
Reinforcement learning
large language models
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
38
H-index
2
i10-index
1
Publications
5
Co-authors
1
list available
Contact
No contact links provided.
Publications
2 items
MAVIS: Multi-Objective Alignment via Value-Guided Inference-Time Search
2025
Cited
0
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback
International Conference on Learning Representations · 2024
Cited
2
Resume (English only)
Co-authors
1 total
Ujwal Dinesha
Doctoral Candidate, Texas A&M University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up