Scholar

Yali Wang

Google Scholar ID: hD948dkAAAAJ

Professor, Shenzhen Institutes of Advanced Technology，Chinese Academy of Sciences

Video UnderstandingMulti-Modal LearningComputer Vision

Google Scholar↗

Citations & Impact

All-time

Citations

10,078

H-index

44

i10-index

72

Publications

20

Co-authors

10

list available

Contact

No contact links provided.

Publications

7 items

WinTok: A Win-Win Hybrid Tokenizer via Decomposing Visual Understanding and Generation with Transferable Tokens

2026

Cited

0

Breaking Dual Bottlenecks: Evolving Unified Multimodal Models into Self-Adaptive Interleaved Visual Reasoners

2026

Cited

0

BitLM: Unlocking Multi-Token Language Generation with Bitwise Continuous Diffusion

2026

Cited

0

What Matters for Diffusion-Friendly Latent Manifold? Prior-Aligned Autoencoders for Latent Diffusion

2026

Cited

0

UniWeTok: An Unified Binary Tokenizer with Codebook Size $\mathit{2^{128}}$ for Unified Multimodal Large Language Model

2026

Cited

0

MotionWeaver: Holistic 4D-Anchored Framework for Multi-Humanoid Image Animation

2026

Cited

0

Video-o3: Native Interleaved Clue Seeking for Long Video Multi-Hop Reasoning

2026

Cited

0

Resume (English only)

Co-authors

10 total

Professor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CAS

Nanjing University

Shenzhen Institutes of Advanced Technology, University of Chinese Academy of Sciences

Marcus A Brubaker

Research Scientist, Google DeepMind; Associate Prof, York University; Affiliate, Vector Institute

Founder & CEO Waabi, Professor, University of Toronto.

Department of Computer Science, University College London