- Paper: Visual-RFT: Visual Reinforcement Fine-Tuning accepted by ICCV2025
- Paper: MIA-DPO: Multi-Image Augmented Direct Preference Optimization For Large Vision-Language Models accepted by ICLR2025
- Paper: MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs accepted by NeurIPS2024
- Organizing the Visual Perception via Learning in an Open World: The 4th Workshop on Open World Vision and the V3Det Challenge at CVPR 2024
Research Experience
- Intern at Shanghai AI Laboratory
- Involved in multiple research projects including Visual-ARFT, Visual-RFT, MIA-DPO, MMDU, etc.
Education
- PhD in progress, Shanghai Jiao Tong University
- Intern, Shanghai AI Laboratory
Background
- Research interests: Multimodal large language models (MLLMs), reinforcement fine-tuning (RFT), reinforcement learning from human feedback (RLHF), and retrieval-augmented generation (RAG)
- Currently an intern at Shanghai AI Laboratory and pursuing a PhD at Shanghai Jiao Tong University.