Scholar
Boyuan Sun
Google Scholar ID: GvTWUAEAAAAJ
Nankai University
Computer Vision
Multi-Modal Large Language Model
Semantic Segmentation
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
254
H-index
5
i10-index
3
Publications
9
Co-authors
9
list available
Contact
GitHub
Open ↗
Publications
7 items
GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics
2026
Cited
0
Depth Anything at Any Condition
2025
Cited
0
LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs
2025
Cited
0
HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context
2025
Cited
0
HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding
2025
Cited
0
Facial Dynamics in Video: Instruction Tuning for Improved Facial Expression Perception and Contextual Awareness
2025
Cited
0
LLaVA-Octopus: Unlocking Instruction-Driven Adaptive Projector Fusion for Video Understanding
2025
Cited
0
Resume (English only)
Co-authors
9 total
Qibin Hou
Nankai University
Ming-Ming Cheng
Professor of Computer Science, Nankai University
Bo-Wen Yin 尹博文
Nankai University
Jia-Xing Zhao
Master, Nankai University
Luc Van Gool
professor computer vision INSAIT Sofia University, em. KU Leuven, em. ETHZ, Toyota Lab TRACE
Yuqi Yang
Nankai University
Zhong-Yu Li
Nankai University
Jingren Zhou
Alibaba Group, Microsoft
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up