- GeometryZero: Improving Geometry Solving for LLM with Group Contrastive Policy Optimization
- VisuoThink: Empowering LVLM Reasoning with Multimodal Tree Search
- Uncertainty Aware Learning for Language Model Alignment
- RRescue: Ranking LLM Responses to Enhance Reasoning Over Context
- LLM-DA: Data Augmentation via Large Language Models for Few-Shot Named Entity Recognition
Research Experience
Tencent Hunyuan, Shanghai, China - Research Intern, Topic: Frontier visual reasoning model; supervised finetuning and reinforcement learning (Jul. 2025 ~ Present)
Shanghai Artificial Intelligence Lab, Shanghai, China - LLM Research Intern, Topic: LLM pretraining, reward modeling (Mar. 2024 ~ Sept. 2024)
Education
Ph.D. Student at Fudan University & SII (Joint Program), supervised by Prof. Dacheng Tao
Background
Research Interests: Multimodal LLMs and LLM agents. Previously, worked with Dr. Xuanjing Huang, Dr. Fei Liu, and Dr. Yiran Chen on robust watermarking.