Scholar
Zechen Bai
Google Scholar ID: aIdQ8GwAAAAJ
National University of Singapore
Multimodal
Computer Vision
Virtual Reality
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,339
H-index
14
i10-index
16
Publications
20
Co-authors
7
list available
Contact
No contact links provided.
Publications
8 items
Kiwi-Edit: Versatile Video Editing via Instruction and Reference Guidance
2026
Cited
0
SIGMA: Selective-Interleaved Generation with Multi-Attribute Tokens
2026
Cited
0
World-VLA-Loop: Closed-Loop Learning of Video World Model and VLA Policy
2026
Cited
0
EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models
2025
Cited
0
Impossible Videos
2025
Cited
0
Show-o: One Single Transformer to Unify Multimodal Understanding and Generation
International Conference on Learning Representations · 2024
Cited
292
Bridging Information Asymmetry in Text-video Retrieval: A Data-centric Approach
2024
Cited
0
Hallucination of Multimodal Large Language Models: A Survey
arXiv.org · 2024
Cited
113
Resume (English only)
Co-authors
7 total
Mike Z. SHOU
National U. of Singapore; Facebook AI; Columbia University
Tong He
Senior Applied Scientist @ AWS
Tianjun Xiao
Tesla Autopilot
Pichao WANG
Amazon AGI
Difei Gao
National U. of Singapore; Institute of Computing Technology, Chinese Academy of Sciences
Kevin Qinghong Lin
University of Oxford; National U. of Singapore
Thomas Brox
University of Freiburg
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up