Published "Learning Compositionality from Multifaceted Synthetic Data for Language-based Object Detection" in IJCV 2025.
Published "CaptionSmiths: Flexibly Controlling Language Pattern in Image Captioning" in ICCV 2025 (Highlighted Paper).
Published "Hierarchical Entailment Representations for Linguistic Compositionality in Language-based Object Detection" at ICCV 2025 Workshop on "What is Next in Multimodal Foundation Models?".
Published "A Multimodal Chain of Tools for Described Object Detection" at NeurIPS 2024 Workshop on "Compositional Learning".
Published "KOALA: Empirical Lessons Toward Memory-Efficient and Fast Diffusion Models for Text-to-Image Synthesis" at NeurIPS 2024, also presented at CVPR 2024 Workshop on "Generative Models for Computer Vision", covered by YTN, Yonhap News, AI Times, and other Korean media.
Published "Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection" at ECCV 2024, also presented at CVPR 2024 Workshop on "Generative Models for Computer Vision".