Scholar

Boyuan Sun

Google Scholar ID: GvTWUAEAAAAJ

Nankai University

Computer VisionMulti-Modal Large Language ModelSemantic Segmentation

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

254

H-index

5

i10-index

3

Publications

9

Co-authors

9

list available

Contact

Publications

10 items

AgentSociety 2: An Integrated Research Environment for Executable Social Science

2026

Cited

0

MLEvolve: A Self-Evolving Framework for Automated Machine Learning Algorithm Discovery

2026

Cited

0

See What I Mean: Aligning Vision and Language Representations for Video Fine-grained Object Understanding

2026

Cited

0

GeoAgent: Learning to Geolocate Everywhere with Reinforced Geographic Characteristics

2026

Cited

0

Depth Anything at Any Condition

2025

Cited

0

LLaVA-Scissor: Token Compression with Semantic Connected Components for Video LLMs

2025

Cited

0

HumanOmniV2: From Understanding to Omni-Modal Reasoning with Context

2025

Cited

0

HumanOmni: A Large Vision-Speech Language Model for Human-Centric Video Understanding

2025

Cited

0

Resume (English only)

Co-authors

9 total

Nankai University

Ming-Ming Cheng

Professor of Computer Science, Nankai University

Bo-Wen Yin 尹博文

Nankai University

Master, Nankai University

professor computer vision INSAIT Sofia University, em. KU Leuven, em. ETHZ, Toyota Lab TRACE

Nankai University

Nankai University

Alibaba Group, Microsoft