Scholar
Shuai Bai
Google Scholar ID: ylhI1JsAAAAJ
Qwen Team, Alibaba Group
Multi-Modal Learning
Visual Generation
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
17,689
H-index
23
i10-index
25
Publications
20
Co-authors
18
list available
Contact
No contact links provided.
Publications
16 items
GenMask: Adapting DiT for Segmentation via Direct Mask
2026
Cited
0
Learning Transferable Temporal Primitives for Video Reasoning via Synthetic Videos
2026
Cited
0
Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents
2026
Cited
0
Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking
arXiv.org · 2026
Cited
9
VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models
arXiv.org · 2026
Cited
1
VLCache: Computing 2% Vision Tokens and Reusing 98% for Vision-Language Inference
2025
Cited
0
Qwen3-VL Technical Report
2025
Cited
0
Soft Adaptive Policy Optimization
2025
Cited
0
Load more
Resume (English only)
Co-authors
18 total
Junyang Lin
Qwen Team, Alibaba Group & Peking University
Co-author 2
Jingren Zhou
Alibaba Group, Microsoft
Co-author 4
Co-author 5
Hongxia Yang
Professor, HK Polytechnic University
Dayiheng Liu (刘大一恒)
Qwen Team, Alibaba Group
Rui Men
Qwen Team, Alibaba Group & Peking University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up