- Published 'Latent Space Energy-based Neural ODEs' in TMLR 2025
- Published 'TripletCLIP: Improving Compositional Reasoning of CLIP via Vision-Language Negatives' in NeurIPS 2024
- Published 'Precision or Recall? An Analysis of Image Captions for Training Text-to-Image Generation Model' in EMNLP 2024, Findings
- Published 'Revising Text-to-Image Prior for Improved Text Conditioned Image Generations' in CVPR 2024
- Published 'WOUAF: Weight Modulation for User Attribution and Fingerprinting in Text-to-Image Diffusion Models' in CVPR 2024
- Published 'Self-supervised Learning to Discover Physical Objects and Predict Their Interactions from Raw Videos' in NeurIPS AI4Science workshop, 2023
- Published 'Adversarial Bayesian Augmentation for Single-Source Domain Generalization' in ICCV 2023
- Published 'SSR-GNNs: Stroke-based Sketch Representation with Graph Neural Networks' in CVPR Sketch workshop, 2022
- Published 'Data-Driven Learning of Three-Point Correlation Functions as Microstructure Representations' in Acta Materialia, 2022
- Published 'Evaluating the Robustness of Bayesian Neural Networks Against Different Types of Attacks' in CVPR Adversarial Machine Learning workshop, 2021
Research Experience
- Amazon AGI, Feb 2025 - Present, working on image/video generation
- Amazon AWS, Fall 2024, mitigating hallucinations in MLLMs using hard negative samples
- Ph.D., Arizona State University, Advisor: Yezhou Yang
- M.Eng., University of Illinois at Urbana-Champaign, Advisor: Ruoyu Sun
- B.S., Huazhong University of Science and Technology
Background
Research interests include image/video generation, multimodal LLM, vision & language (particularly in Text-to-Image generation), domain generalization & robustness, and AI in Science.