Thinking inside the Convolution for Image Inpainting: Reconstructing Texture via Structure under Global and Local Side

๐Ÿ“… 2026-02-03
๐Ÿ“ˆ Citations: 0
โœจ Influential: 0
๐Ÿ“„ PDF
๐Ÿค– AI Summary
Existing image inpainting methods often suffer from the loss of structural and textural details during convolutional downsampling, which limits reconstruction quality. This work presents the first systematic investigation into the complementary relationship between structural and textural features in the downsampling stage and introduces a mutual guidance mechanism based on statistical normalization and denormalization. Integrated within an encoderโ€“decoder framework, this approach enables effective multi-scale feature reconstruction by preserving and recovering critical details. The proposed method consistently outperforms current state-of-the-art techniques on both 256ร—256 and 512ร—512 resolution images. Furthermore, replacing the standard encoder with the proposed module yields significant performance gains, demonstrating its effectiveness and generalizability.

Technology Category

Application Category

๐Ÿ“ Abstract
Image inpainting has earned substantial progress, owing to the encoder-and-decoder pipeline, which is benefited from the Convolutional Neural Networks (CNNs) with convolutional downsampling to inpaint the masked regions semantically from the known regions within the encoder, coupled with an upsampling process from the decoder for final inpainting output. Recent studies intuitively identify the high-frequency structure and low-frequency texture to be extracted by CNNs from the encoder, and subsequently for a desirable upsampling recovery. However, the existing arts inevitably overlook the information loss for both structure and texture feature maps during the convolutional downsampling process, hence suffer from a non-ideal upsampling output. In this paper, we systematically answer whether and how the structure and texture feature map can mutually help to alleviate the information loss during the convolutional downsampling. Given the structure and texture feature maps, we adopt the statistical normalization and denormalization strategy for the reconstruction guidance during the convolutional downsampling process. The extensive experimental results validate its advantages to the state-of-the-arts over the images from low-to-high resolutions including 256*256 and 512*512, especially holds by substituting all the encoders by ours. Our code is available at https://github.com/htyjers/ConvInpaint-TSGL
Problem

Research questions and friction points this paper is trying to address.

image inpainting
convolutional downsampling
structure and texture
information loss
feature reconstruction
Innovation

Methods, ideas, or system contributions that make the work stand out.

image inpainting
convolutional downsampling
structure-texture interaction
statistical normalization
feature reconstruction
๐Ÿ”Ž Similar Papers
No similar papers found.
Haipeng Liu
Haipeng Liu
Hefei University of Technology
Computer Vision
Y
Yang Wang
School of Computer Science and Information Engineering, Hefei University of Technology, China
Biao Qian
Biao Qian
Tsinghua University
Knowledge distillationGenerative modelModel compressionComputer vision
Y
Yong Rui
Lenovo Research, China
M
Meng Wang
School of Computer Science and Information Engineering, Hefei University of Technology, China