Scholar

Tengyu Xu

Google Scholar ID: g0Rc2skAAAAJ

OpenAI

Reinforcement LearningOptimizationLarge Language Model

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,304

H-index

19

i10-index

20

Publications

20

Co-authors

4

list available

Contact

No contact links provided.

Publications

7 items

Boosting LLM Reasoning via Spontaneous Self-Correction

2025

Cited

0

LlamaRL: A Distributed Asynchronous Reinforcement Learning Framework for Efficient Large-scale LLM Training

2025

Cited

0

Reinforcement Learning from User Feedback

2025

Cited

0

Learning Auxiliary Tasks Improves Reference-Free Hallucination Detection in Open-Domain Long-Form Generation

2025

Cited

0

HyperZero: A Customized End-to-End Auto-Tuning System for Recommendation with Hourly Feedback

2025

Cited

0

Think Smarter not Harder: Adaptive Reasoning with Inference Aware Optimization

2025

Cited

0

Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback

2025

Cited

0

Resume (English only)

Co-authors

4 total

The Ohio State University

Guanghui (George) Lan

Professor, Georgia Institute of Technology

Associate Professor at Northwestern University