Scholar

Hao Peng

Google Scholar ID: 6Y37nm0AAAAJ

University of Illinois Urbana-Champaign

Natural Language ProcessingMachine Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

6,203

H-index

i10-index

Publications

Co-authors

Contact

Emailhaopeng@illinois.edu TwitterOpen ↗GitHubOpen ↗

Publications

10 items

MedGuideX: Internalizing Decision Logic from Executable Guidelines into Large Language Models for Clinical Reasoning

2026

Cited

CausaLab: A Scalable Environment for Interactive Causal Discovery Toward AI Scientists

2026

Cited

Towards a Universal Causal Reasoner

2026

Cited

RoPE Distinguishes Neither Positions Nor Tokens in Long Contexts, Provably

2026

Cited

Useful Memories Become Faulty When Continuously Updated by LLMs

2026

Cited

Improving Clinical Diagnosis with Counterfactual Multi-Agent Reasoning

2026

Cited

Countdown-Code: A Testbed for Studying The Emergence and Generalization of Reward Hacking in RLVR

2026

Cited

Do We Need Adam? Surprisingly Strong and Sparse Reinforcement Learning with SGD in LLMs

2026

Cited

Resume (English only)

Academic Achievements

2025: 'From f(x) and g(x) to f(g(x)): LLMs Learn New Skills in RL by Composing Old Ones'
2025: Co-authored 'Probing the Critical Point (CritPt) of AI Reasoning: a Frontier Physics Research Benchmark' (58 authors total)
2025: 'Executable Counterfactuals: Improving LLMs’ Causal Reasoning Through Code'
2025: 'Context Length Alone Hurts LLM Performance Despite Perfect Retrieval' (EMNLP Findings)
2025: 'The Best Instruction-Tuning Data are Those That Fit' (NeurIPS)
2025: 'The Unreasonable Effectiveness of Entropy Minimization in LLM Reasoning' (NeurIPS)
2025: 'Reinforcement Learning Finetunes Small Subnetworks in Large Language Models' (NeurIPS)

Background

Assistant Professor at the Department of Computer Science, University of Illinois at Urbana-Champaign (UIUC)
Current research focuses on large language models (LLMs)
Works on solving complex reasoning problems in a generalizable way, emphasizing learning from experience (e.g., reinforcement learning) and insights from human cognition
Interested in causal understanding and reasoning about the world
Committed to positively impacting society through AI
Aims to advance the frontier of human knowledge and contribute to scientific discovery as the ultimate demonstration of true generalization beyond training data

Co-authors

0 total

Co-authors: 0 (list not available)