Scholar
Xuesong Yao
Google Scholar ID: YcgQGDMAAAAJ
Master of Mechanics, Peking University
Machine Learning
Large language model
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
160
H-index
4
i10-index
3
Publications
8
Co-authors
0
Contact
No contact links provided.
Publications
9 items
Critique-RL: Training Language Models for Critiquing through Two-Stage Reinforcement Learning
2025
Cited
0
Scaling Long-Horizon LLM Agent via Context-Folding
2025
Cited
0
Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
2025
Cited
0
Generalizable End-to-End Tool-Use RL with Synthetic CodeGym
2025
Cited
0
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
2025
Cited
0
Feedback-Driven Tool-Use Improvements in Large Language Models via Automated Build Environments
2025
Cited
0
Seed-Thinking-v1.5: Advancing Superb Reasoning Models with Reinforcement Learning
2025
Cited
1
Recitation over Reasoning: How Cutting-Edge Language Models Can Fail on Elementary School-Level Reasoning Problems?
2025
Cited
0
Load more
Resume (English only)
Co-authors
0 total
Co-authors: 0 (list not available)
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up