Scholar

Ambroise Odonnat

Google Scholar ID: M_OS-3kAAAAJ

Ph.D. Student - Noah's Ark Lab & Inria

Deep LearningMachine LearningVision TransformersLLMsDistribution Shifts

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

112

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailambroiseodonnettechnologie@gmail.com CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

11 items

Layer by layer, module by module: Choose both for optimal OOD probing of ViT

2026

Cited

Vision Transformer Finetuning Benefits from Non-Smooth Components

2026

Cited

Optimal Self-Consistency for Efficient Reasoning with Large Language Models

2025

Cited

Provable Benefits of In-Tool Learning for Large Language Models

2025

Cited

CauKer: classification time series foundation models can be pretrained on synthetic data only

2025

Cited

Easing Optimization Paths: a Circuit Perspective

2025

Cited

Clustering Head: A Visual Case Study of the Training Dynamics in Transformers

2024

Cited

Zero-shot Model-based Reinforcement Learning using Large Language Models

arXiv.org · 2024

Cited

Resume (English only)

Academic Achievements

Received ICML Oral Award, ICASSP Oral Award, and QBIN Best Flash Talk Award for his research; one of his recent articles was featured in Forbes; multiple publications accepted or preprinted, including 'Easing Optimization Paths: A Circuit Perspective' accepted at ICASSP 2025, 'Clustering Heads' as a preprint, and 'Large Language Models as Markov Chains' also featured in Forbes.

Research Experience

Currently a Ph.D. student at Huawei Noah’s Ark Lab & Inria in Paris; presented research at leading institutions such as EPFL, ENS Ulm, and Criteo; contributed to open-source libraries.

Education

Graduated from Ecole des Ponts ParisTech in 2023 and holds a master’s degree from ENS Paris-Saclay in Mathematics, Vision, and Machine Learning (MVA). Supervised by Romain Tavenard, Laetitia Chapel, and Ievgen Redko.

Background

Interested in improving the core understanding of Transformers, particularly large language models, out-of-distribution generalization, Transformer training and fine-tuning, Vision Transformers, and Time Series forecasting.

Miscellany