- November 2025, one paper accepted by AAAI 2026 (oral).
- January 2025, one paper accepted by CVPR 2025.
Projects:
- UniDiffusion: A Diffusion training toolbox based on diffusers and existing SOTA methods.
- Awesome Controllable T2I Diffusion Models: A collection of resources on controllable generation with text-to-image diffusion models.
- GAN Inverter: A GAN inversion toolbox based on the PyTorch library.
Research Experience
Currently a fourth-year Ph.D. student at Beijing University of Posts and Telecommunications, working on research related to Visual Synthesis and Multimodal Large-language Model.
Education
PhD in Artificial Intelligence, 2022-present, Beijing University of Posts and Telecommunications, supervised by Prof. Qing Song and Dr. Lu Yang; BSc in Information and Computational Science, 2018, University of Science and Technology Beijing.
Background
Research interests include Image Synthesis, Multimodal Large Language Models, Visual Representation, Image Detection/Segmentation, and Computer Vision. Currently a Ph.D. student in Artificial Intelligence, focusing on Visual Synthesis and Multimodal Large-language Model.