Scholar

Yatian Pang

Google Scholar ID: AZQyNWkAAAAJ

National University of Singapore

Multi-modal understandingMulti-modal generationUnified models

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,674

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailyatian_pang@u.nus.edu GitHubOpen ↗LinkedInOpen ↗

Publications

6 items

E-4DGS: High-Fidelity Dynamic Reconstruction from the Multi-view Event Cameras

2025

Cited

UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation

2025

Cited

VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention

2025

Cited

SwapAnyone: Consistent and Realistic Video Synthesis for Swapping Any Person into Any Video

2025

Cited

Next Patch Prediction for Autoregressive Visual Generation

2024

Cited

VideoGen-of-Thought: Step-by-step generating multi-shot video with minimal manual intervention

2024

Cited

Resume (English only)

Academic Achievements

Achieved state-of-the-art results on various benchmarks in Qwen3-VL video understanding
Proposed UniWorld, a unified framework connecting frozen VLMs with diffusion generators via a novel semantic encoder
Key contributor to Open-Sora-Plan, releasing high-quality video generation architecture and data
“Video Sparse Attention for Streaming Long Video Understanding” (2025, under submission)
“Unified Autoregressive Pretraining for Image Generation and Representation Learning” (2025, under submission)
“Next Patch Prediction for Autoregressive Visual Generation” (arXiv, 2024)
“DreamDance: Animating Human Images by Enriching 3D Geometry Cues from 2D Poses” (ICCV 2025)
“Envision3D: One Image to 3D with Anchor Views Interpolation” (arXiv, 2024)
“Masked autoencoders for point cloud self-supervised learning” (ECCV, 2022)
Co-authored “MoE-LLaVA: Mixture of Experts for Large Vision-Language Models” (IEEE TMM, 2024)
Co-authored “LanguageBind: Extending Video-Language Pretraining to N-Modality by Language-Based Semantic Alignment” (ICLR 2024)

Co-authors

6 total

Li Yuan, 袁粒

Peking University, Shenzhen Graduate School, School of ECE

Bin Lin (林彬)

Ph.D. candidate (expected: 2028), Peking University

Bin Zhu

Peking University

Eng Hock Francis Tay

Associate Professor of Mechanical Engineering, National University of Singapore

Peng Jin

PhD student, Peking University

Co-author 6