Open World Object Detection: A Survey

📅 2024-10-15
🏛️ IEEE transactions on circuits and systems for video technology (Print)
📈 Citations: 1
Influential: 0
📄 PDF
🤖 AI Summary
Open-World Object Detection (OWOD) aims to enable models to simultaneously detect unknown classes, mitigate catastrophic forgetting, and dynamically expand their knowledge boundaries while continuously learning novel categories. This paper presents the first comprehensive, panoramic survey of OWOD, establishing a unified problem formulation, evaluation protocol, and benchmark dataset suite, while rigorously delineating its theoretical distinctions from open-set recognition and incremental learning. We systematically analyze over 100 state-of-the-art works, covering paradigms including class-incremental OWOD, pseudo-labeling, and memory replay, as well as architectural adaptations for detectors such as Faster R-CNN and DETR. Furthermore, we release the first unified codebase and benchmarking platform. Quantitative evaluation reveals persistent bottlenecks across methods in unknown-class detection recall, known-class retention stability, and cross-stage generalization performance—providing a standardized foundation and principled guidance for future research.

Technology Category

Application Category

📝 Abstract
Exploring new knowledge is a fundamental human ability that can be mirrored in the development of deep neural networks, especially in the field of object detection. Open world object detection (OWOD) is an emerging area of research that adapts this principle to explore new knowledge. It focuses on recognizing and learning from objects absent from initial training sets, thereby incrementally expanding its knowledge base when new class labels are introduced. This survey paper offers a thorough review of the OWOD domain, covering essential aspects, including problem definitions, benchmark datasets, source codes, evaluation metrics, and a comparative study of existing methods. Additionally, we investigate related areas like open set recognition (OSR) and incremental learning (IL), underlining their relevance to OWOD. Finally, the paper concludes by addressing the limitations and challenges faced by current OWOD algorithms and proposes directions for future research. To our knowledge, this is the first comprehensive survey of the emerging OWOD field with over one hundred references, marking a significant step forward for object detection technology. A comprehensive source code and benchmarks are archived and concluded at https://github.com/ArminLee/OWOD_Review.
Problem

Research questions and friction points this paper is trying to address.

Recognizing and learning from unseen object classes
Incrementally expanding knowledge with new class labels
Reviewing benchmarks and methods for open world detection
Innovation

Methods, ideas, or system contributions that make the work stand out.

Recognizes and learns from unknown objects
Incrementally expands knowledge with new classes
Integrates open set recognition and incremental learning
🔎 Similar Papers
No similar papers found.
Y
Yiming Li
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore
Y
Yi Wang
Department of Electrical and Electronic Engineering, the Hong Kong Polytechnic University, Hong Kong SAR
Wenqian Wang
Wenqian Wang
Singapore University of Technology and Design; Nanyang Technology University; Shandong University
Computer VisionAnomaly DetectionAction RecognitionAction Prediction
Dan Lin
Dan Lin
NANYANG TECHNOLOGICAL UNIVERSITY (NTU)
Data Mining and Machine LearningComputer Vision
B
Bingbing Li
Continental Automotive Singapore Pte. Ltd.
K
Kim-Hui Yap
School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore