Scholar

Xueru Wen

Google Scholar ID: MGhwXeUAAAAJ

School of Computer Science and Technology, University of Chinese Academy of Sciences

Natural Language ProcessingAlignmentLarge Language Model

Google Scholar↗

Citations & Impact

All-time

Citations

H-index

i10-index

Publications

Co-authors

list available

Contact

No contact links provided.

Publications

10 items

Learning from Failures: Correction-Oriented Policy Optimization with Verifiable Rewards

2026

Cited

Decoupling Reasoning and Confidence: Resurrecting Calibration in Reinforcement Learning from Verifiable Rewards

2026

Cited

Coupled Variational Reinforcement Learning for Language Model General Reasoning

2025

Cited

The Devil Is in the Details: Tackling Unimodal Spurious Correlations for Generalizable Multimodal Reward Models

2025

Cited

Cheems: A Practical Guidance for Building and Evaluating Chinese Reward Models from Scratch

2025

Cited

Scalable Oversight for Superhuman AI via Recursive Self-Critiquing

2025

Cited

Transferable Post-training via Inverse Value Learning

North American Chapter of the Association for Computational Linguistics · 2024

Cited

Rethinking Reward Model Evaluation: Are We Barking up the Wrong Tree?

arXiv.org · 2024

Cited

Resume (English only)

Co-authors

13 total

Le Sun

Institute of Software, CAS

Yaojie Lu

Institute of Software, Chinese Academy of Sciences

Han Xianpei

Professor, Institute of Software, Chinese Academy of Sciences

Jie Lou

Xiaohongshu

Hongyu Lin

Institute of Software, Chinese Academy of Sciences

Xinyu Lu

UCAS

Ben He

Professor, University of Chinese Academy of Sciences

Xinyan Guan

Institute of Software, Chinese Academy of Sciences