* 'Walk Wisely on Graph: Knowledge Graph Reasoning with Dual Agents via Efficient Guidance-Exploration', AAAI Conference on Artificial Intelligence (AAAI), 2025
* 'Windows deep transformer Q-networks: an extended variance reduction architecture for partially observable reinforcement learning', Applied Intelligence, 2024
* 'Controllable Flow Matching for Online Reinforcement Learning', AAAI Conference on Artificial Intelligence (AAAI), 2026
* 'Decomposing Scientific Paper Queries with Draft-and-Follow Policy Optimization', International Conference on Learning Representations (ICLR), 2026, Under Review
- Awards:
* National Scholarship (Ministry of Education, Top 0.2%), 2024
* First Prize, The Chinese Mathematics Competitions for College Students, 2023
* Outstanding Student Prize, China University of Petroleum, 2023-2024 (2 consecutive years)
* Outstanding Star Scholarship for Innovation and Entrepreneurship, China University of Petroleum, 2024-2025 (2 consecutive years)
Research Experience
- Position: Research Intern
- Institution: X-LANCE Lab, Shanghai Jiao Tong University
- Time: Apr 2025 - Present
- Project: LLM Post-training & Agent-RL within the LLM group
- Supervisor: Prof. Lu Chen
- Position: Undergraduate Researcher
- Institution: Department of Artificial Intelligence, China University of Petroleum (East China)
- Time: Sep 2023 - Present
- Project: Research on Graph Reinforcement Learning & Model-based Reinforcement Learning
- Advisor: Prof. Bin Wang
Education
- Degree: Bachelor's
- School: China University of Petroleum (East China)
- Time: Sep 2022 - Jun 2026
- Major: Computer Science and Technology
- GPA: 4.18/5.00 (91.8/100)
- Ranking: 2nd out of 88 students
- Core Courses: Data Structure, Machine Learning, Deep Learning, Computer Vision, Natural Language Processing, etc.
- Degree: Doctorate
- School: Shanghai Jiao Tong University
- Advisor: Prof. Lu Chen
- Time: Sep 2026 - Present
- Lab: X-LANCE Lab
Background
- Research Interests: Reinforcement Learning (especially Graph RL and Model-based RL)
- Field: Computer Science and Technology
- Brief Introduction: Currently a 4th year undergraduate student at China University of Petroleum (East China), and will be a Ph.D. Student at X-LANCE Lab, Shanghai Jiao Tong University from Fall 2026.