Scholar
Ramé Alexandre
Google Scholar ID: 7znwivwAAAAJ
Google DeepMind
Deep Learning
Generalization
Alignment
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
4,016
H-index
19
i10-index
21
Publications
20
Co-authors
6
list available
Contact
Email
alexandre.rame.cl@gmail.com
CV
Open ↗
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
20 items
Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
PhD thesis awarded the 2024 SSFAM prize for best French PhD in Machine Learning
Contributed to Gemma 3 Technical Report (arXiv), introducing a novel post-training recipe that significantly improves math and chat capabilities
Proposed Diversity-Rewarded CFG Distillation (ICLR 2025), a fine-tuning method combining RL and weight averaging to enhance distillation
Introduced WARP (arXiv), leveraging weight-averaged policies to improve the KL-reward trade-off in RLHF
Proposed WARM (ICML 2024), a reward modeling strategy that merges multiple reward models for improved robustness and reduced reward hacking
PhD research (2023) analyzed how weight-averaged ensembling improves out-of-distribution generalization and alignment
Co-authored ICLR 2024 paper on hallucinations and explainability in large multimodal models
Co-authors
6 total
Matthieu Cord
Professor Sorbonne University / Scientific Director valeo.ai
Johan Ferret
Research Scientist, Google DeepMind
Arthur Douillard
Research Scientist, DeepMind
Olivier Bachem
Research Scientist, Google Brain
Guillaume Couairon
INRIA
Léonard Hussenot
Google DeepMind
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up