Scholar
Yiyi Zhou
Google Scholar ID: w3_2ep0AAAAJ
Xiamen University
deep learning
language and vision
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
2,788
H-index
23
i10-index
38
Publications
20
Co-authors
8
list available
Contact
No contact links provided.
Publications
13 items
Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism
2026
Cited
0
ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling
2026
Cited
0
DeepInv: A Novel Self-supervised Learning Approach for Fast and Accurate Diffusion Inversion
arXiv.org · 2026
Cited
0
Towards Effective and Efficient Long Video Understanding of Multimodal Large Language Models via One-shot Clip Retrieval
2025
Cited
0
Omni-Referring Image Segmentation
2025
Cited
0
Grounded Chain-of-Thought for Multimodal Large Language Models
2025
Cited
0
AdaFlow: Efficient Long Video Editing via Adaptive Attention Slimming And Keyframe Selection
2025
Cited
0
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray
2025
Cited
0
Load more
Resume (English only)
Co-authors
8 total
Rongrong Ji 纪荣嵘
Professor, Xiamen University
Xiaoshuai Sun 孙晓帅
Professor, Xiamen University
Gen Luo
Shanghai AI Laboratory
Jinsong Su
Xiamen University
Qiong Wu
Xiamen University
Chaoyang Zhu
CSE, Hong Kong University of Science and Technology
Xinghao Ding
Unknown affiliation
Chia-Wen Lin (林嘉文)
Distinguished Professor of Electrical Engineering, National Tsing Hua University, Taiwan
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up