Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
Published 'Value-Guided Search for Efficient Chain-of-Thought Reasoning' and 'Q#: Provably Optimal Distributional RL for LLM Post-Training' at NeurIPS 2025.
Published 'Conditional Language Policy: A General Framework for Steerable Multi-Objective Finetuning' at EMNLP 2024.
Published 'Provable Benefits of Representational Transfer in Reinforcement Learning' at COLT 2023.
Published survey paper 'The Central Role of the Loss Function in Reinforcement Learning' in Statistical Science 2025.
Published 'More Benefits of Being Distributional: Second-Order Bounds for Reinforcement Learning' at ICML 2024.
Published 'The Benefits of Being Distributional: Small-Loss Bounds for Reinforcement Learning' at NeurIPS 2023.
Published 'A Reductions Approach to Risk-Sensitive Reinforcement Learning with Optimized Certainty Equivalents' at ICML 2025.
Published 'Near-Minimax-Optimal Risk-Sensitive Reinforcement Learning with CVaR' at ICML 2023.
Published 'Efficient and Sharp Off-Policy Evaluation in Robust Markov Decision Processes' at NeurIPS 2024.