Scholar

Jean Mercat

Google Scholar ID: xmUDkvQAAAAJ

Research scientist at Toyota Research Institute

Neural networks

Citations & Impact

All-time

Citations

1,292

H-index

i10-index

Publications

Co-authors

Contact

Publications

3 items

2026

Cited

2026

Cited

Neural Information Processing Systems · 2024

Cited

Resume (English only)

Academic Achievements

A Careful Examination of Large Behavior Models for Multitask Dexterous Manipulation (Jul 7, 2025)
OpenThoughts: Data Recipes for Reasoning Models (Jun 4, 2025)
Should VLMs be Pre-trained with Image Data? (Mar 10, 2025)
DataComp-LM: In Search of the Next Generation of Training Sets for Language Models (Jun 17, 2024)
Linearizing Large Language Models (May 14, 2024)
Language Models Scale Reliably with Over-Training and on Downstream Tasks (Mar 14, 2024)

Research Experience

Senior research scientist at Toyota Research Institute, working on pre-training, uptraining, fine-tuning, experimentation, and research with Large Language Models, Vision Language Models, and Large Behavior Models. Attempts to understand and improve large models, their evaluation process, and their training data. Applies large models to robotic manipulation, pushing the boundary of open-ended embodied intelligence.

Education

PhD in Machine Learning from Paris Saclay University, L2S and Renault; MEng in Scientific Computing from ENSEIRB-MatMéca, Bordeaux, France.

Background

A senior machine learning research scientist specializing in transformers, large language models, vision language models, and large behavior models. Passionate about self-driving cars, robotics, and language processing. Emphasizes a careful scientific process, including thorough evaluation-driven experiments, aiming for a broad downstream impact of his research and continuous learning from awesome coworkers.

Co-authors

0 total

Co-authors: 0 (list not available)