Scholar

Bolian Li

Google Scholar ID: wNDoepwAAAAJ

Purdue University

LLM Post-TrainingAI SafetyBayesian Deep Learning

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

293

H-index

i10-index

Publications

Co-authors

Contact

CVOpen ↗TwitterOpen ↗GitHubOpen ↗

Publications

15 items

Uniform-Correct Policy Optimization: Breaking RLVR's Indifference to Diversity

2026

Cited

Addressing Performance Saturation for LLM RL via Precise Entropy Curve Control

2026

Cited

SARL: Label-Free Reinforcement Learning by Rewarding Reasoning Topology

2026

Cited

Learning Self-Correction in Vision-Language Models via Rollout Augmentation

2026

Cited

Why Reasoning Fails to Plan: A Planning-Centric Analysis of Long-Horizon Decision Making in LLM Agents

2026

Cited

Structure-R1: Dynamically Leveraging Structural Knowledge in LLM Reasoning through Reinforcement Learning

2025

Cited

From Personal to Collective: On the Role of Local and Global Memory in LLM Personalization

2025

Cited

DRIFT: Learning from Abundant User Dissatisfaction in Real-World Preference Learning

2025

Cited

Resume (English only)

Academic Achievements

1 short paper accepted by EMNLP 2025; 2 papers accepted by COLM 2025; 1 paper accepted by ICML 2025; 1 paper accepted by ICLR 2025; 1 paper accepted by TMLR 2024.

Research Experience

No detailed information provided.

Education

PhD: Department of Computer Science at Purdue University, Advisor: Ruqi Zhang; BE: Computer Science and Technology at Tianjin University, Advisor: Changqing Zhang.

Background

Research Interests: Building statistical frameworks to enhance the stability and efficiency of LLM post-training. Recently focusing on the sampling process in RLVR algorithms. Also broadly interested in preference alignment, (multimodal) LLM safety, and Bayesian deep learning.

Miscellany

No detailed information provided.

Co-authors

0 total

Co-authors: 0 (list not available)