BridgeDiff: Bridging Human Observations and Flat-Garment Synthesis for Virtual Try-Off

πŸ“… 2026-03-10
πŸ“ˆ Citations: 0
✨ Influential: 0
πŸ“„ PDF
πŸ€– AI Summary
This work addresses the challenges of inconsistent details and structural instability in recovering standardized flat-layout garments from dressed human images in virtual try-on, which arise from neglecting the discrepancy between on-body appearance and flat garment geometry. To tackle this, we propose a novel diffusion-based framework that incorporates a Garment Condition Bridging Module (GCBM) to model global appearance semantics and a Flat-layout Structural Constraint Module (FSCM) equipped with a planar-constrained attention mechanism to inject explicit geometric priors during the diffusion denoising process. Experimental results on the VTOFF benchmark demonstrate that our method significantly outperforms existing approaches, achieving state-of-the-art performance in both structural integrity and detail consistency of reconstructed flat garments.

Technology Category

Application Category

πŸ“ Abstract
Virtual try-off (VTOFF) aims to recover canonical flat-garment representations from images of dressed persons for standardized display and downstream virtual try-on. Prior methods often treat VTOFF as direct image translation driven by local masks or text-only prompts, overlooking the gap between on-body appearances and flat layouts. This gap frequently leads to inconsistent completion in unobserved regions and unstable garment structure. We propose BridgeDiff, a diffusion-based framework that explicitly bridges human-centric observations and flat-garment synthesis through two complementary components. First, the Garment Condition Bridge Module (GCBM) builds a garment-cue representation that captures global appearance and semantic identity, enabling robust inference of continuous details under partial visibility. Second, the Flat Structure Constraint Module (FSCM) injects explicit flat-garment structural priors via Flat-Constraint Attention (FC-Attention) at selected denoising stages, improving structural stability beyond text-only conditioning. Extensive experiments on standard VTOFF benchmarks show that BridgeDiff achieves state-of-the-art performance, producing higher-quality flat-garment reconstructions while preserving fine-grained appearance and structural integrity.
Problem

Research questions and friction points this paper is trying to address.

Virtual try-off
Flat-garment synthesis
Garment structure
Image-to-garment translation
Appearance-structure gap
Innovation

Methods, ideas, or system contributions that make the work stand out.

BridgeDiff
Garment Condition Bridge Module
Flat Structure Constraint Module
diffusion model
virtual try-off
πŸ”Ž Similar Papers
No similar papers found.
Shuang Liu
Shuang Liu
China University of Geosciences (Wuhan)
Applied geophysicsGravity and magnetic methodsInversion.
A
Ao Yu
School of Integrated Circuits, Anhui University, Hefei, China
L
Linkang Cheng
School of Integrated Circuits, Anhui University, Hefei, China
X
Xiwen Huang
School of Integrated Circuits, Anhui University, Hefei, China
L
Li Zhao
School of Integrated Circuits, Anhui University, Hefei, China
J
Junhui Liu
School of Astronautics, Northwestern Polytechnical University, Xi’an, China
Z
Zhiting Lin
School of Integrated Circuits, Anhui University, Hefei, China; Anhui Provincial High-performance Integrated Circuit Engineering Research Center
Yu Liu
Yu Liu
Professor, Hefei University of Technology
Image FusionImage RestorationMedical Image ProcessingComputer Vision