Published in Nature Machine Intelligence: 'Efficient and Scalable Reinforcement Learning for Large-scale Network Control', covered by Xinhua Net, Science and Technology Daily, and Peking University News
ICLR 2025: 'Magnetic Preference Optimization: Achieving Last-iterate Convergence for Language Model Alignment'
AAMAS 2025: 'Mean Field Correlated Imitation Learning'
NeurIPS 2024: 'Panacea: Pareto Alignment via Preference Adaptation for LLMs'
Arxiv 2023: 'Evolving Diverse Red-team Language Models in Multi-round Multi-agent Games'
Co-instructor for ICML 2025 Tutorial: 'Alignment Methods for Language Models: A Machine Learning Perspective'
Invited talks in 2024 at the Distributed Artificial Intelligence (DAI) conference, National Key Lab of Autonomous Intelligent Unmanned Systems (BIT), and Cognitive Computing and Reasoning Lab (BIGAI)