Scholar

Yiyi Zhou

Google Scholar ID: w3_2ep0AAAAJ

Xiamen University

deep learninglanguage and vision

Google Scholar↗

Citations & Impact

All-time

Citations

2,788

H-index

23

i10-index

38

Publications

20

Co-authors

8

list available

Contact

No contact links provided.

Publications

16 items

QCA: Query- and Content-Aware Keyframe Selection for Long Video Understanding

2026

Cited

0

Towards a Dynamic and Fixed-budget Memory Bank for Efficient Streaming Video Understanding

2026

Cited

0

Towards Fast and Effective Long Video Understanding of Multimodal Large Language Models via Adaptive Quasi-Gaussian Sampling

2026

Cited

0

Scaling the Long Video Understanding of Multimodal Large Language Models via Visual Memory Mechanism

2026

Cited

0

ForestPrune: High-ratio Visual Token Compression for Video Multimodal Large Language Models via Spatial-Temporal Forest Modeling

2026

Cited

0

DeepInv: A Novel Self-supervised Learning Approach for Fast and Accurate Diffusion Inversion

arXiv.org · 2026

Cited

0

Towards Effective and Efficient Long Video Understanding of Multimodal Large Language Models via One-shot Clip Retrieval

2025

Cited

0

Omni-Referring Image Segmentation

2025

Cited

0

Resume (English only)

Co-authors

8 total

Rongrong Ji 纪荣嵘

Professor, Xiamen University

Xiaoshuai Sun 孙晓帅

Professor, Xiamen University

Shanghai AI Laboratory

Xiamen University

Xiamen University

CSE, Hong Kong University of Science and Technology

Unknown affiliation

Chia-Wen Lin (林嘉文)

Distinguished Professor of Electrical Engineering, National Tsing Hua University, Taiwan