Scholar
Yatian Pang
Google Scholar ID: AZQyNWkAAAAJ
National University of Singapore
Multi-modal understanding
Multi-modal generation
Unified models
Follow
Homepage
↗
Google Scholar
↗
Citations & Impact
All-time
Citations
1,674
H-index
10
i10-index
10
Publications
14
Co-authors
6
list available
Contact
Email
yatian_pang@u.nus.edu
GitHub
Open ↗
LinkedIn
Open ↗
Publications
6 items
E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras
2025
Cited
0
UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation
2025
Cited
0
VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention
2025
Cited
0
SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video
2025
Cited
0
Next Patch Prediction for Autoregressive Visual Generation
2024
Cited
1
VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention
2024
Cited
5
Resume (English only)
Academic Achievements
Achieved state-of-the-art results on various benchmarks in Qwen3-VL video understanding
Proposed UniWorld, a unified framework connecting frozen VLMs with diffusion generators via a novel semantic encoder
Key contributor to Open-Sora-Plan, releasing high-quality video generation architecture and data
“Video Sparse Attention for Streaming Long Video Understanding” (2025, under submission)
“Unified Autoregressive Pretraining for Image Generation and Representation Learning” (2025, under submission)
“Next Patch Prediction for Autoregressive Visual Generation” (arXiv, 2024)
“DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses” (ICCV 2025)
“Envision3D: One Image to 3D with Anchor Views Interpolation” (arXiv, 2024)
“Masked autoencoders for point cloud self-supervised learning” (ECCV, 2022)
Co-authored “MoE-LLaVA: Mixture of Experts for Large Vision-Language Models” (IEEE TMM, 2024)
Co-authored “LanguageBind: Extending Video-Language Pretraining to N-Modality by Language-Based Semantic Alignment” (ICLR 2024)
Co-authors
6 total
Li Yuan, 袁粒
Peking University, Shenzhen Graduate School, School of ECE
Bin Lin (林彬)
Ph.D. candidate (expected: 2028), Peking University
Bin Zhu
Peking University
Eng Hock Francis Tay
Associate Professor of Mechanical Engineering, National University of Singapore
Peng Jin
PhD student, Peking University
Co-author 6
×
Welcome back
Sign in to Agora
Welcome back! Please sign in to continue.
Email address
Password
Forgot password?
Continue
Do not have an account?
Sign up