- ICCV 2025: X-Dancer: Expressive music to human dance video generation, DepR: Depth Guided Single-view Scene Reconstruction with Instance-level Diffusion, YOLO-Count: Differentiable Object Counting for Text-to-Image Generation
- CVPR 2025: X-dyna: Expressive dynamic human image animation
- ECCV 2024: Dolfin: Diffusion Layout Transformers without Autoencoder
- CVPR 2024: Bayesian Diffusion Models for 3D Shape Reconstruction
- AAAI 2024: Bliva: A Simple Multimodal LLM for Better Handling of Text-rich Visual Questions
- ICCV 2023: Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction
- University of Illinois Urbana-Champaign: Jun. 2021 - Aug. 2022, Advisor: Shenlong Wang
- University of California, San Diego: Apr. 2021 - Nov. 2021, Advisor: Xiaolong Wang
- University of Texas at Austin, VITA Group: Jun. 2020 - Mar. 2021, Advisor: Zhangyang (Atlas) Wang
Education
Ph.D.: University of California, San Diego, advised by Prof. Zhuowen Tu; B.S.: University of Science and Technology of China, major in Data Science and Big Data Technology.
Background
Research interests include generative models, particularly in controllable image, video, and 3D generation. Interned at Adobe Firefly, working with Dr. Yuanjun Xiong and Dr. Kai Zhang on building video generation models.
Miscellany
Huge fan of Overwatch and a Tracer One Trick player