Scholar

Sujay Sanghavi

Google Scholar ID: O-DazBUAAAAJ

Professor, Electrical and Computer Engineering, University of Texas, Austin

Machine LearningOptimization and AlgorithmsNetworks

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

5,582

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailemail@email.com TwitterOpen ↗GitHubOpen ↗

Publications

19 items

Test-Time Speculation

2026

Cited

MURPHY: Multi-Turn GRPO for Self Correcting Code Generation

2025

Cited

Sculpting Latent Spaces With MMD: Disentanglement With Programmable Priors

2025

Cited

HiSpec: Hierarchical Speculative Decoding for LLMs

2025

Cited

Enabling Approximate Joint Sampling in Diffusion LMs

2025

Cited

SFT Doesn't Always Hurt General Capabilities: Revisiting Domain-Specific Fine-Tuning in LLMs

2025

Cited

Context-Free Synthetic Data Mitigates Forgetting

2025

Cited

Geometric Median Matching for Robust k-Subset Selection from Noisy Data

2025

Cited

Resume (English only)

Academic Achievements

ICML 2025 Papers:
- Upweighting Easy Samples in Fine-Tuning Mitigates Forgetting
- Learning Mixtures of Experts with EM: A Mirror Descent Perspective
- Retraining with Predicted Hard Labels Provably Increases Model Accuracy
- Geometric Median Matching for Robust k-Subset Selection from Noisy Data
ICLR 2025 Papers:
- Enhancing Language Model Agents using Diversity of Thoughts
- Infilling Score: A Pretraining Data Detection Algorithm for Large Language Models
Graduated PhD students: Rudrajit Das (headed to Google Research) and Anish Acharya (headed to AWS)
Completed MS degrees: Atula Tejaswi (continuing at UT for a PhD) and Vijay Lingam (headed to Amazon Q)
Invited talk in Workshop on Theoretical Perspectives on LLMs (March 2025)

Research Experience

Since 2019, he has been a Principal Research Scientist and then an Amazon Scholar, working in Amazon’s Search org; Visiting Scientist at Google Research; Senior quant and founding member of an algorithmic trading team at the hedge fund Engineers Gate.

Background

Research interests are in machine learning, with a current focus on understanding and improving the architecture and training of large-scale models for language and representation learning. Also broadly interested in understanding machine learning from a theoretical perspective.

Co-authors

42 total