Patch-wise Retrieval: A Bag of Practical Techniques for Instance-level Matching

📅 2025-12-14

📈 Citations: 0

✨ Influential: 0

career value

181K/year

🤖 AI Summary

This paper addresses instance-level image retrieval—precisely localizing images containing the same object despite significant variations in scale, pose, and appearance. We propose Patchify, a fine-tuning-free framework that partitions database images into structured local patches and performs cross-granularity matching between global query features and patch-level features, enabling spatially interpretable retrieval. We introduce LocScore, a novel localization-aware evaluation metric, and reveal the critical role of information preservation during feature compression; to this end, we integrate Product Quantization for efficient compression. Experiments demonstrate that Patchify consistently outperforms global-feature baselines across multiple benchmarks and backbone architectures, significantly improving re-ranking accuracy. The method supports real-time retrieval over databases of up to ten million images, achieving a favorable trade-off among high accuracy, spatial interpretability, and strong scalability.

Technology Category

Application Category

📝 Abstract

Instance-level image retrieval aims to find images containing the same object as a given query, despite variations in size, position, or appearance. To address this challenging task, we propose Patchify, a simple yet effective patch-wise retrieval framework that offers high performance, scalability, and interpretability without requiring fine-tuning. Patchify divides each database image into a small number of structured patches and performs retrieval by comparing these local features with a global query descriptor, enabling accurate and spatially grounded matching. To assess not just retrieval accuracy but also spatial correctness, we introduce LocScore, a localization-aware metric that quantifies whether the retrieved region aligns with the target object. This makes LocScore a valuable diagnostic tool for understanding and improving retrieval behavior. We conduct extensive experiments across multiple benchmarks, backbones, and region selection strategies, showing that Patchify outperforms global methods and complements state-of-the-art reranking pipelines. Furthermore, we apply Product Quantization for efficient large-scale retrieval and highlight the importance of using informative features during compression, which significantly boosts performance. Project website: https://wons20k.github.io/PatchwiseRetrieval/

Problem

Research questions and friction points this paper is trying to address.

Develops a patch-wise retrieval framework for accurate instance-level image matching

Introduces a localization-aware metric to assess spatial correctness in retrieval

Enables efficient large-scale retrieval through feature compression techniques

Innovation

Methods, ideas, or system contributions that make the work stand out.

Patchify framework divides images into structured patches for matching

LocScore metric evaluates spatial correctness of retrieved object regions

Product Quantization compresses features to enable efficient large-scale retrieval

🔎 Similar Papers

Searching from Area to Point: A Hierarchical Framework for Semantic-Geometric Combined Feature Matching