Scholar
Kaiwen Zhu
Google Scholar ID: O8lP5XMAAAAJ
Shanghai Jiao Tong University
Multi-Modal Generation
Computer Vision
Follow
Google Scholar
↗
Citations & Impact
All-time
Citations
55
H-index
3
i10-index
2
Publications
6
Co-authors
7
list available
Contact
No contact links provided.
Publications
10 items
Accelerating Masked Image Generation by Learning Latent Controlled Dynamics
2026
Cited
0
UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
2025
Cited
0
dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models
2025
Cited
0
PICABench: How Far Are We from Physically Realistic Image Editing?
2025
Cited
0
Lumina-DiMOO: An Omni Diffusion Large Language Model for Multi-Modal Generation and Understanding
2025
Cited
0
Exploring Scalable Unified Modeling for General Low-Level Vision
2025
Cited
0
ArtiMuse: Fine-Grained Image Aesthetics Assessment with Joint Scoring and Expert-Level Understanding
2025
Cited
0
Lumina-OmniLV: A Unified Multimodal Framework for General Low-Level Vision
2025
Cited
0
Load more
Resume (English only)
Co-authors
7 total
Chao Dong
Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
Zhiyuan You
MMLab, The Chinese University of Hong Kong
Jinjin GU
Tenure-Track Faculty Member, INSAIT, Sofia University
Tianfan Xue
Information Engineering Department, The Chinese University of Hong Kong
Yuandong Pu
SJTU,Shanghai AI Laboratory
Yu Qiao
Professor of Shanghai AI Laboratory; Shenzhen Institutes of Advanced Technology, CAS
Yihao Liu
Shanghai Artificial Intelligence Laboratory
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up