Scholar
Ganlin Yang
Google Scholar ID: 321C4TQAAAAJ
University of Science and Technology of China && Shanghai AI Laboratory
Computer vision
3D vision
Multimodal models
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
17
H-index
3
i10-index
0
Publications
7
Co-authors
6
list available
Contact
No contact links provided.
Publications
6 items
ScaleEdit-12M: Scaling Open-Source Image Editing Data Generation via Multi-Agent Framework
2026
Cited
0
InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing
2026
Cited
0
ACE-Brain-0: Spatial Intelligence as a Shared Scaffold for Universal Embodiments
2026
Cited
0
Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning
2025
Cited
0
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
2025
Cited
0
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces
2025
Cited
0
Resume (English only)
Co-authors
6 total
Dong Liu
University of Science and Technology of China
Co-author 2
Zhizheng Zhang (张直政)
Co-founder & VP of Large Models at Galbot << Microsoft Research
Guoqiang Wei (魏国强)
ByteDance Seed
Yan Lu
Microsoft Research Asia
Jingjing Fu
MS
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up