Scholar

Matthieu Zimmer

Google Scholar ID: 6z-GF2sAAAAJ

RL Research Scientist @ Huawei Noah’s Ark Lab

artificial intelligence : learningdevelopmental learningreinforcement learningneural networks

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

811

H-index

12

i10-index

14

Publications

20

Co-authors

4

list available

Contact

No contact links provided.

Publications

11 items

Risk-Controlled Lean-as-Judge for Natural-Language Mathematical Reasoning

2026

Cited

0

The Model Knows, the Decoder Finds: Future Value Guided Particle Power Sampling

2026

Cited

0

The $\mathbf{Y}$-Combinator for LLMs: Solving Long-Context Rot with $λ$-Calculus

2026

Cited

0

Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers

2026

Cited

0

Multi-Task GRPO: Reliable LLM Reasoning Across Tasks

2026

Cited

0

Scalable Power Sampling: Unlocking Efficient, Training-Free Reasoning for LLMs via Distribution Sharpening

2026

Cited

2

Rethinking Large Language Model Distillation: A Constrained Markov Decision Process Perspective

2025

Cited

0

Tree-OPO: Off-policy Monte Carlo Tree-Guided Advantage Optimization for Multistep Reasoning

2025

Cited

0

Resume (English only)

Co-authors

4 total

Duke Kunshan University

Lipscomb University

Stephane Doncieux

Professor of Computer Science, ISIR, Sorbonne University-CNRS