Scholar

Dongyang Liu

Google Scholar ID: VxQGEOcAAAAJ

MMLab CUHK

Image/Video GenerationLLMsVLMs

Homepage↗Google Scholar↗

Citations & Impact

All-time

Citations

1,830

H-index

i10-index

Publications

Co-authors

list available

Contact

Emailliudongyang0114@gmail.com GitHubOpen ↗

Publications

12 items

Action-Prior Denoising for Smooth Real-Time Chunking

2026

Cited

D-OPSD: On-Policy Self-Distillation for Continuously Tuning Step-Distilled Diffusion Models

2026

Cited

Decoupled DMD: CFG Augmentation as the Spear, Distribution Matching as the Shield

2025

Cited

Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer

2025

Cited

Distribution Matching Distillation Meets Reinforcement Learning

2025

Cited

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

2025

Cited

BlueLM-2.5-3B Technical Report

2025

Cited

OmniCaptioner: One Captioner to Rule Them All

2025

Cited

Resume (English only)

Academic Achievements

- Publications:
- Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT
- Lumina-mGPT: Illuminate flexible photorealistic text-to-image generation with multimodal generative pretraining
- Lumina-Next: Making Lumina-T2X Stronger and Faster with Next-DiT
- Lumina-T2X: Transforming Text into Any Modality, Resolution, and Duration via Flow-based Large Diffusion Transformers
- SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models
- SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models
- LLaMA-Adapter: Efficient Fine-tuning of Language Models with Zero-init Attention
- A Simple Romance Between Multi-Exit Vision Transformer and Token Reduction
- Function-Consistent Feature Distillation

Research Experience

- Currently a Ph.D. student at MMLab, involved in multiple research projects including Lumina-Video, Lumina-mGPT, Lumina-Next, etc.
- During the time at VIPL lab, participated in various research works.

Education

- 2024.09 - Present: MMLab, The Chinese University of Hong Kong (Ph.D.), Supervisor: Prof. Hongsheng Li
- 2021.09 - 2024.06: VIPL Lab, Institute of Computing Technology, Chinese Academy of Sciences (Master), Supervisors: Prof. Shiguang Shan and Prof. Meina Kan
- 2017.09 - 2021.06: School of Software Engineering, Tongji University (Bachelor)

Background

- Research Interests: Multimodal understanding and generation
- Background: Currently a first-year Ph.D. student at MMLab, CUHK, supervised by Prof. Hongsheng Li. Before that, obtained a master’s degree from VIPL, supervised by Prof. Shiguang Shan and Prof. Meina Kan.

Co-authors

4 total

Hongsheng Li (李鸿升)

The Chinese University of Hong Kong

Meina Kan

Institute of Computing Technology, Chinese Academy of Sciences

Shiguang Shan

Professor of Institute of Computing Technology, Chinese Academy of Sciences

Peng Gao

Unknown Organization