Scholar
Yali Wang
Google Scholar ID: hD948dkAAAAJ
Professor, Shenzhen Institutes of Advanced Technology,Chinese Academy of Sciences
Video Understanding
Multi-Modal Learning
Computer Vision
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
10,078
H-index
44
i10-index
72
Publications
20
Co-authors
10
list available
Contact
No contact links provided.
Publications
7 items
WinTok: A Win-Win Hybrid Tokenizer via Decomposing Visual Understanding and Generation with Transferable Tokens
2026
Cited
0
Breaking Dual Bottlenecks: Evolving Unified Multimodal Models into Self-Adaptive Interleaved Visual Reasoners
2026
Cited
0
BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion
2026
Cited
0
What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion
2026
Cited
0
UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model
2026
Cited
0
MotionWeaver: Holistic 4D-Anchored Framework for Multi-Humanoid Image Animation
2026
Cited
0
Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning
2026
Cited
0
Resume (English only)
Co-authors
10 total
Yu Qiao
Professor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CAS
Limin Wang
Nanjing University
Kunchang Li
ByteDance Seed
Jianzhuang Liu
Shenzhen Institutes of Advanced Technology, University of Chinese Academy of Sciences
Zhifeng Li
Tencent
Marcus A Brubaker
Research Scientist, Google DeepMind; Associate Prof, York University; Affiliate, Vector Institute
Raquel Urtasun
Founder & CEO Waabi, Professor, University of Toronto.
David Barber
Department of Computer Science, University College London
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up