Scholar
Zejun Li
Google Scholar ID: FYppLbUAAAAJ
Fudan University
vision-language
multi-modality
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
226
H-index
8
i10-index
7
Publications
20
Co-authors
10
list available
Contact
No contact links provided.
Publications
8 items
SpatialNav: Leveraging Spatial Scene Graphs for Zero-Shot Vision-and-Language Navigation
arXiv.org · 2026
Cited
0
Mixture-of-Visual-Thoughts: Exploring Context-Adaptive Reasoning Mode Selection for General Visual Reasoning
2025
Cited
0
Simple o3: Towards Interleaved Vision-Language Reasoning
2025
Cited
0
MoIIE: Mixture of Intra- and Inter-Modality Experts for Large Vision Language Models
2025
Cited
0
AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs
2025
Cited
0
OViP: Online Vision-Language Preference Learning
2025
Cited
0
Activating Distributed Visual Region within LLMs for Efficient and Effective Vision-Language Training and Inference
arXiv.org · 2024
Cited
0
VoCoT: Unleashing Visually Grounded Multi-Step Reasoning in Large Multi-Modal Models
arXiv.org · 2024
Cited
3
Resume (English only)
Co-authors
10 total
Zhongyu Wei (魏忠钰)
Associate Professor at School of Data Science, Fudan University
Huang Xuanjing (黄萱菁)
Professor of Computer Science, Fudan University
Zhihao Fan
Qwen Team; Fudan University
Siyuan Wang
University of Southern California
Jingjing Chen
Fudan University
Jiwen Zhang
Fudan University
Mengfei Du
Fudan University
Ruipu Luo
Bytedance
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up