GO-MLVTON: Garment Occlusion-Aware Multi-Layer Virtual Try-On with Diffusion Models

๐Ÿ“… 2026-01-20
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing virtual try-on methods struggle to model occlusion relationships among multiple clothing layers, often resulting in unnatural layering effects. This work proposes the first framework specifically designed for multi-layer virtual try-on, explicitly modeling inter-layer occlusions and achieving realistic garment fitting through a novel occlusion learning module and a Stable Diffusionโ€“based deformation alignment mechanism. The study introduces the first multi-layer garment dataset, MLG, and a new evaluation metric, Layered Appearance Consistency Distance (LACD). Extensive experiments on MLG demonstrate that the proposed method significantly outperforms existing approaches in both visual realism and layering consistency, establishing a new state of the art for multi-layer virtual try-on.

Technology Category

Application Category

๐Ÿ“ Abstract
Existing image-based virtual try-on (VTON) methods primarily focus on single-layer or multi-garment VTON, neglecting multi-layer VTON (ML-VTON), which involves dressing multiple layers of garments onto the human body with realistic deformation and layering to generate visually plausible outcomes. The main challenge lies in accurately modeling occlusion relationships between inner and outer garments to reduce interference from redundant inner garment features. To address this, we propose GO-MLVTON, the first multi-layer VTON method, introducing the Garment Occlusion Learning module to learn occlusion relationships and the StableDiffusion-based Garment Morphing&Fitting module to deform and fit garments onto the human body, producing high-quality multi-layer try-on results. Additionally, we present the MLG dataset for this task and propose a new metric named Layered Appearance Coherence Difference (LACD) for evaluation. Extensive experiments demonstrate the state-of-the-art performance of GO-MLVTON. Project page: https://upyuyang.github.io/go-mlvton/.
Problem

Research questions and friction points this paper is trying to address.

multi-layer virtual try-on
garment occlusion
image-based virtual try-on
layered clothing
occlusion modeling
Innovation

Methods, ideas, or system contributions that make the work stand out.

multi-layer virtual try-on
garment occlusion
diffusion models
virtual try-on dataset
appearance coherence
๐Ÿ”Ž Similar Papers
No similar papers found.
Y
Yang Yu
Huazhong University of Science and Technology, Wuhan, China
Y
Yunze Deng
Huazhong University of Science and Technology, Wuhan, China
Y
Yige Zhang
Huazhong University of Science and Technology, Wuhan, China
Y
Yanjie Xiao
Huazhong University of Science and Technology, Wuhan, China
Y
Youkun Ou
Huazhong University of Science and Technology, Wuhan, China
W
Wenhao Hu
Huazhong University of Science and Technology, Wuhan, China
M
Mingchao Li
Huazhong University of Science and Technology, Wuhan, China
Bin Feng
Bin Feng
Huazhong University of Sci. & Tech.
Wenyu Liu
Wenyu Liu
Huazhong University of Science and Technology
Compuetr visionArtificial intelligence
D
Dandan Zheng
Ant Group, Beijing, China
Jingdong Chen
Jingdong Chen
Senior Staff Algorithm Engineer, Ant Group
Computer VisionMultimodal