Scholar
Matthieu Zimmer
Google Scholar ID: 6z-GF2sAAAAJ
RL Research Scientist @ Huawei Noah’s Ark Lab
artificial intelligence : learning
developmental learning
reinforcement learning
neural networks
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
811
H-index
12
i10-index
14
Publications
20
Co-authors
4
list available
Contact
No contact links provided.
Publications
9 items
The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus
2026
Cited
0
Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers
2026
Cited
0
Multi-Task GRPO: Reliable LLM Reasoning Across Tasks
2026
Cited
0
Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening
2026
Cited
2
Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective
2025
Cited
0
Tree-OPO: Off-policy Monte Carlo Tree-Guided Advantage Optimization for Multistep Reasoning
2025
Cited
0
Bourbaki: Self-Generated and Goal-Conditioned MDPs for Theorem Proving
2025
Cited
0
Almost Surely Safe Alignment of Large Language Models at Inference-Time
2025
Cited
0
Load more
Resume (English only)
Co-authors
4 total
Paul Weng
Duke Kunshan University
Juan Rojas
Lipscomb University
Co-author 3
Stephane Doncieux
Professor of Computer Science, ISIR, Sorbonne University-CNRS
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up