2025: 'The Best Instruction-Tuning Data are Those That Fit' (NeurIPS)
2025: 'The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning' (NeurIPS)
2025: 'Reinforcement Learning Finetunes Small Subnetworks in Large Language Models' (NeurIPS)
Background
Assistant Professor at the Department of Computer Science, University of Illinois at Urbana-Champaign (UIUC)
Current research focuses on large language models (LLMs)
Works on solving complex reasoning problems in a generalizable way, emphasizing learning from experience (e.g., reinforcement learning) and insights from human cognition
Interested in causal understanding and reasoning about the world
Committed to positively impacting society through AI
Aims to advance the frontier of human knowledge and contribute to scientific discovery as the ultimate demonstration of true generalization beyond training data