Scholar

Mikhail Yurochkin

Google Scholar ID: QjBF9sUAAAAJ

Staff AI Scientist, IFM MBZUAI, ex MIT-IBM Watson AI Lab

Machine LearningFoundation ModelsEvaluationModel Fusion

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

4,982

H-index

i10-index

Publications

Co-authors

list available

Contact

CVOpen ↗TwitterOpen ↗GitHubOpen ↗LinkedInOpen ↗

Publications

11 items

Complementing Self-Consistency with Cross-Model Disagreement for Uncertainty Quantification

2026

Cited

A Latent Variable Framework for Scaling Laws in Large Language Models

2025

Cited

K2-Think: A Parameter-Efficient Reasoning System

2025

Cited

Limitations of refinement methods for weak to strong generalization

2025

Cited

Bridging Human and LLM Judgments: Understanding and Narrowing the Gap

2025

Cited

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

2025

Cited

Granite Vision: a lightweight, open-source multimodal model for enterprise Intelligence

2025

Cited

Speculate, then Collaborate: Fusing Knowledge of Language Models during Decoding

2025

Cited

Resume (English only)

Academic Achievements

Several papers accepted at NeurIPS 2024, Findings of EMNLP 2024, COLM 2024, ICML 2024, ICLR 2024, and other top conferences. Involved in projects such as LLM routing, LoRA compression, and tinyBenchmarks.

Research Experience

Currently a Staff AI Scientist at the IFM MBZUAI Silicon Valley Lab, leading data mixing for LLM pre-training. Previously, he was a research manager at the MIT-IBM Watson AI Lab, leading the Statistical Large Language Modeling group.

Education

PhD in Statistics from the University of Michigan, advised by Prof. Long Nguyen.

Background

Interested in a variety of LLM-related problems—pre- and post-training, data quality, reasoning, evaluation, routing, and efficient inference—and enjoy exploring statistical modeling approaches to solve them. Also worked on OOD generalization, algorithmic fairness, optimal transport, federated learning, and Bayesian nonparametrics.

Miscellany