Scholar
Zuyan Liu
Google Scholar ID: 7npgHqAAAAAJ
Tsinghua University
multi-modal
computer vision
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
1,462
H-index
9
i10-index
9
Publications
13
Co-authors
12
list available
Contact
No contact links provided.
Publications
9 items
PerceptionComp: A Video Benchmark for Complex Perception-Centric Reasoning
2026
Cited
0
Insight-V++: Towards Advanced Long-Chain Visual Reasoning with Multimodal Large Language Models
2026
Cited
0
GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization
2025
Cited
0
RealUnify: Do Unified Models Truly Benefit from Unification? A Comprehensive Benchmark
2025
Cited
0
Vision Generalist Model: A Survey
2025
Cited
0
SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs
2025
Cited
0
Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment
2025
Cited
0
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
arXiv.org · 2024
Cited
16
Load more
Resume (English only)
Co-authors
12 total
Yongming Rao
Tencent Hunyuan
Jiwen Lu (鲁继文)
Professor, Department of Automation, Tsinghua University
Jie Zhou
Professor, Department of Automation, Tsinghua University
Yuhao Dong
Tsinghua University, Nanyang Technological University
Wenliang Zhao
Tsinghua University
Ziyi Wang
Tsinghua University
Ziwei Liu
Associate Professor, Nanyang Technological University
Xumin Yu
Tencent Hunyuan
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up