MMCL: Correcting Content Query Distributions for Improved Anti-Overlapping X-Ray Object Detection

📅 2024-06-05
📈 Citations: 6
Influential: 0
📄 PDF
🤖 AI Summary
X-ray images suffer from feature entanglement due to depth superposition and object semi-transparency, severely hindering detection of small prohibited items. Existing DETR-based methods struggle with uneven content query distribution and inadequate representation of overlapping object hypotheses. To address these challenges, we propose a novel Multi-class Minimum-margin Contrastive Learning (MMCL) framework: (i) a multi-class repulsion loss to enhance inter-class separability; (ii) a minimum-margin clustering loss to improve intra-class diversity; and (iii) a self-supervised query distribution rectification mechanism. Evaluated on three major X-ray benchmarks—PIXray, OPIXray, and PIDray—MMCL achieves state-of-the-art performance, significantly improving detection accuracy in overlapping and semi-transparent scenarios, particularly boosting recall for small objects.

Technology Category

Application Category

📝 Abstract
Unlike natural images with occlusion-based overlap, X-ray images exhibit depth-induced superimposition and semi-transparent appearances, where objects at different depths overlap and their features blend together. These characteristics demand specialized mechanisms to disentangle mixed representations between target objects (e.g., prohibited items) and irrelevant backgrounds. While recent studies have explored adapting detection transformers (DETR) for anti-overlapping object detection, the importance of well-distributed content queries that represent object hypotheses remains underexplored. In this paper, we introduce a multi-class min-margin contrastive learning (MMCL) framework to correct the distribution of content queries, achieving balanced intra-class diversity and inter-class separability. The framework first groups content queries by object category and then applies two proposed complementary loss components: a multi-class exclusion loss to enhance inter-class separability, and a min-margin clustering loss to encourage intra-class diversity. We evaluate the proposed method on three widely used X-ray prohibited-item detection datasets, PIXray, OPIXray, and PIDray, using two backbone networks and four DETR variants. Experimental results demonstrate that MMCL effectively enhances anti-overlapping object detection and achieves state-of-the-art performance on both datasets. Code will be made publicly available on GitHub.
Problem

Research questions and friction points this paper is trying to address.

Correcting content query distributions for anti-overlapping X-ray object detection
Disentangling mixed representations between target objects and irrelevant backgrounds
Achieving balanced intra-class diversity and inter-class separability
Innovation

Methods, ideas, or system contributions that make the work stand out.

MMCL framework corrects content query distributions
Uses multi-class exclusion loss for inter-class separability
Applies min-margin clustering loss for intra-class diversity
M
Mingyuan Li
College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China
Tong Jia
Tong Jia
Peking University
AIOpsAnomaly DetectionLog AnalysisAI for Medical Research
Hui Lu
Hui Lu
Department of Computer Science and Engineering (CSE), the University of Texas at Arlington (UTA)
Cloud ComputingVirtualizationFile and Storage SystemsComputer NetworksComputer Systems
H
Hao Wang
College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China
Bowen Ma
Bowen Ma
Senior Research Associate, The University of Hong Kong; PhD, UT-Austin; BSc, USTC
Condensed Matter Theory
S
Shiyi Guo
College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China
S
Shuyang Lin
College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China
D
Dongyue Chen
College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China
H
Haoran Wang
College of Information Science and Engineering, Northeastern University, Shenyang, 110819, China
Baosheng Yu
Baosheng Yu
Assistant Professor, Nanyang Technological University
Machine LearningDeep LearningComputer VisionAI for Medicine