Scholar
Weihua Du
Google Scholar ID: 6CqQ4L8AAAAJ
LTI, Carnegie Mellon University
language models
reinforcement learning
embodied AI
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
466
H-index
5
i10-index
3
Publications
11
Co-authors
13
list available
Contact
Email
weihuada@cs.cmu.edu
CV
Open ↗
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
8 items
Mind the Sim2Real Gap in User Simulation for Agentic Tasks
2026
Cited
0
GradAlign: Gradient-Aligned Data Selection for LLM Reinforcement Learning
2026
Cited
0
Training Proactive and Personalized LLM Agents
2025
Cited
0
Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
2025
Cited
0
Generalizable End-to-End Tool-Use RL with Synthetic CodeGym
2025
Cited
0
Agentic-R1: Distilled Dual-Strategy Reasoning
2025
Cited
0
Optimizing Temperature for Language Models with Multi-Sample Inference
2025
Cited
0
Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge
Neural Information Processing Systems · 2024
Cited
1
Resume (English only)
Co-authors
13 total
Hongxin Zhang
University of Massachusetts, Amherst
Chuang Gan
UMass Amherst | MIT-IBM Watson AI Lab
Yilun Du
Harvard University
Joshua B. Tenenbaum
MIT
Qinhong Zhou
University of Massachusetts Amherst
Tianmin Shu
Assistant Professor, JHU
Zehui Chen
USTC
Wenwei Zhang
Shanghai AI Laboratory
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up