Browse publications on Google Scholar (top-right) ↗
Resume (English only)
Academic Achievements
Published “Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL” at NeurIPS 2025
Published “POSITION BIAS MITIGATES POSITION BIAS: Mitigate Position Bias Through Inter-Position Knowledge Distillation” at EMNLP 2025
Published “A Closed-Loop Architecture with Knowledge-of-Results Feedback for Neural-Symbolic Planning” in Knowledge-Based Systems (KBS), vol. 326, 114041, July 2025
Published “Enhancing LLM Reasoning with Iterative DPO: A Comprehensive Empirical Investigation” at COLM 2025
Published “Learning Dynamics in Continual Pre-Training for Large Language Models” at ICML 2025 (Spotlight/Oral)
Published “Uncertainty Unveiled: Can Exposure to More In-context Examples Mitigate Uncertainty for Large Language Models?” in Findings of ACL 2025
Published “Evaluating Generalization Capability of Language Models across Abductive, Deductive and Inductive Logical Reasoning” at COLING 2025
Published “A Symbolic Incubator for Training Planners via Recognition-based Feedback” at IEEE ISI 2025
Published “Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons” at EMNLP 2024
Published “Learning Strategy Representation for Imitation Learning in Multi-Agent Games” at AAAI 2025
Published two papers at AAMAS 2025: “CPE: A New Paradigm for Policy Extraction in Offline Reinforcement Learning” and “Offline Meta Reinforcement Learning with Weighted Policy Constraints and Proximal Context Collection”