Scholar
Xueru Wen
Google Scholar ID: MGhwXeUAAAAJ
School of Computer Science and Technology, University of Chinese Academy of Sciences
Natural Language Processing
Alignment
Large Language Model
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
66
H-index
5
i10-index
2
Publications
12
Co-authors
13
list available
Contact
No contact links provided.
Publications
9 items
Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards
2026
Cited
0
Coupled Variational Reinforcement Learning for Language Model General Reasoning
2025
Cited
0
The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models
2025
Cited
0
Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch
2025
Cited
0
Scalable Oversight for Superhuman AI via Recursive Self-Critiquing
2025
Cited
0
Transferable Post-training via Inverse Value Learning
North American Chapter of the Association for Computational Linguistics · 2024
Cited
1
Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?
arXiv.org · 2024
Cited
1
Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic
arXiv.org · 2024
Cited
14
Load more
Resume (English only)
Co-authors
13 total
Le Sun
Institute of Software, CAS
Yaojie Lu
Institute of Software, Chinese Academy of Sciences
Han Xianpei
Professor, Institute of Software, Chinese Academy of Sciences
Jie Lou
Xiaohongshu
Hongyu Lin
Institute of Software, Chinese Academy of Sciences
Xinyu Lu
UCAS
Ben He
Professor, University of Chinese Academy of Sciences
Xinyan Guan
Institute of Software, Chinese Academy of Sciences
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up