Scholar
Amirhossein Kazemnejad
Google Scholar ID: b4qOuDYAAAAJ
Mila
Post-Training
RLHF
Reasoning
Generalization
Transformer Architecture
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
515
H-index
7
i10-index
6
Publications
13
Co-authors
11
list available
Contact
No contact links provided.
Publications
5 items
The Markovian Thinker
2025
Cited
0
The Promise of RL for Autoregressive Image Editing
2025
Cited
0
AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories
2025
Cited
0
DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning
2025
Cited
0
VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment
arXiv.org · 2024
Cited
39
Resume (English only)
Co-authors
11 total
Siva Reddy
McGill University, Mila Quebec AI Institute
Alessandro Sordoni
Microsoft Research
Aaron Courville
Professor, DIRO, Université de Montréal, Mila, Cifar CAI chair
Nicolas Le Roux
McGill, UdeM
Payel Das
Manager and Principal Research Staff Member, AI research, IBM Watson, NY
Dieuwke Hupkes
Meta
Mahdieh Soleymani Baghshah
Associate Professor, Computer Engineering Department, Sharif University of Technology
Koustuv Sinha
Research Scientist, Meta AI (Fundamental AI Research), McGill University (MSc, PhD)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up