Scholar
Rui Yuan
Google Scholar ID: 4QZgrj0AAAAJ
Unknown affiliation
Machine learning
Deep learning
Reinforcement learning
Optimization
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
256
H-index
5
i10-index
4
Publications
11
Co-authors
7
list available
Contact
Email
yy42606r@gmail.com
CV
Open ↗
Twitter
Open ↗
GitHub
Open ↗
LinkedIn
Open ↗
Publications
10 items
Beyond Uniform Credit: Causal Credit Assignment for Policy Optimization
2026
Cited
1
AlignTune: Modular Toolkit for Post-Training Alignment of Large Language Models
2026
Cited
0
Beyond KL Divergence: Policy Optimization with Flexible Bregman Divergences for LLM Reasoning
2026
Cited
1
SLAM-LLM: A Modular, Open-Source Multimodal Large Language Model Framework and Best Practice for Speech, Language, Audio and Music Processing
IEEE Journal on Selected Topics in Signal Processing · 2026
Cited
0
The Initialization Determines Whether In-Context Learning Is Gradient Descent
2025
Cited
0
Wireless Power Transfer and Intent-Driven Network Optimization in AAVs-assisted IoT for 6G Sustainable Connectivity
2025
Cited
0
A digital SRAM-based compute-in-memory macro for weight-stationary dynamic matrix multiplication in Transformer attention score computation
2025
Cited
0
From predictions to confidence intervals: an empirical study of conformal prediction methods for in-context learning
2025
Cited
0
Load more
Resume (English only)
Academic Achievements
Understanding In-Context Learning in Transformers, ICLR Blogposts Track, 2024
A Novel Framework for Policy Mirror Descent with General Parameterization and Linear Convergence, NeurIPS, 2023
Thesis: Stochastic Second Order Methods and Finite Time Analysis of Policy Gradient Methods, 2023
Linear Convergence of Natural Policy Gradient Methods with Log-Linear Policies, ICLR, 2023
A General Sample Complexity Analysis of Vanilla Policy Gradient, AISTATS, 2022
SAN: Stochastic Average Newton Algorithm for Minimizing Finite Sums, AISTATS, 2022
Sketched Newton-Raphson, SIAM Journal on Optimization (SIOPT), 2022
Co-authors
7 total
Robert Mansel Gower
Research Scientist, Center for Computational Mathematics, Flatiron Institute, Simons Foundation
Alessandro Lazaric
Research Scientist, Facebook Artificial Intelligence Research
Lin Xiao
Meta AI, FAIR (Fundamental AI Research)
Simon Shaolei Du
Associate Professor, School of Computer Science and Engineering, University of Washington
Carlo Alfano
University of Oxford
Simone Rossi
Assistant Professor, EURECOM
Guillaume Garrigos
Université Paris Cité
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up