Scholar

Amirhossein Kazemnejad

Google Scholar ID: b4qOuDYAAAAJ

Mila

Post-TrainingRLHFReasoningGeneralizationTransformer Architecture

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

515

H-index

7

i10-index

6

Publications

13

Co-authors

11

list available

Contact

No contact links provided.

Publications

5 items

The Markovian Thinker

2025

Cited

0

The Promise of RL for Autoregressive Image Editing

2025

Cited

0

AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

2025

Cited

0

DeepSeek-R1 Thoughtology: Let's think about LLM Reasoning

2025

Cited

0

VinePPO: Unlocking RL Potential For LLM Reasoning Through Refined Credit Assignment

arXiv.org · 2024

Cited

39

Resume (English only)

Co-authors

11 total

McGill University, Mila Quebec AI Institute

Alessandro Sordoni

Microsoft Research

Aaron Courville

Professor, DIRO, Université de Montréal, Mila, Cifar CAI chair

Nicolas Le Roux

Manager and Principal Research Staff Member, AI research, IBM Watson, NY

Mahdieh Soleymani Baghshah

Associate Professor, Computer Engineering Department, Sharif University of Technology

Research Scientist, Meta AI (Fundamental AI Research), McGill University (MSc, PhD)