Scholar
Tengyu Xu
Google Scholar ID: g0Rc2skAAAAJ
OpenAI
Reinforcement Learning
Optimization
Large Language Model
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,304
H-index
19
i10-index
20
Publications
20
Co-authors
4
list available
Contact
No contact links provided.
Publications
7 items
Boosting LLM Reasoning via Spontaneous Self-Correction
2025
Cited
0
LlamaRL: A Distributed Asynchronous Reinforcement Learning Framework for Efficient Large-scale LLM Training
2025
Cited
0
Reinforcement Learning from User Feedback
2025
Cited
0
Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation
2025
Cited
0
HyperZero: A Customized End-to-End Auto-Tuning System for Recommendation with Hourly Feedback
2025
Cited
0
Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization
2025
Cited
0
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback
2025
Cited
0
Resume (English only)
Co-authors
4 total
Yingbin Liang
The Ohio State University
Guanghui (George) Lan
Professor, Georgia Institute of Technology
Co-author 3
Zhaoran Wang
Associate Professor at Northwestern University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up