Scholar
Jianzong Wu
Google Scholar ID: Q_fbCwkAAAAJ
PhD Student in School of Intelligence Science and Technology, Peking University
Computer Vision
Multi Modal Learning
Generative AI
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
433
H-index
6
i10-index
5
Publications
12
Co-authors
5
list available
Contact
No contact links provided.
Publications
8 items
Does Hearing Help Seeing? Investigating Audio-Video Joint Denoising for Video Generation
2025
Cited
0
VMoBA: Mixture-of-Block Attention for Video Diffusion Models
2025
Cited
0
Muddit: Liberating Generation Beyond Text-to-Image with a Unified Discrete Diffusion Model
2025
Cited
0
An Empirical Study of GPT-4o Image Generation Capabilities
2025
Cited
0
Decouple and Track: Benchmarking and Improving Video Diffusion Transformers for Motion Transfer
2025
Cited
0
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation
arXiv.org · 2024
Cited
2
DreamRelation: Bridging Customization and Relation Generation
2024
Cited
7
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language
arXiv.org · 2024
Cited
3
Resume (English only)
Co-authors
5 total
Yunhai Tong
Peking University
Xiangtai Li
Research Scientist, Tiktok, SG; MMLab@NTU
Henghui Ding
Fudan University
Xia Li
ETH Zurich
Dacheng Tao
Nanyang Technological University
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up