Partial Weakly-Supervised Oriented Object Detection

📅 2025-07-03
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the high annotation cost of oriented object detection (OOD) under weak supervision, this paper proposes PWOOD, a semi-weakly supervised framework that leverages only partial weak annotations—such as axis-aligned bounding boxes or single points—together with abundant unlabeled data. Methodologically, PWOOD introduces an OS-Student model that explicitly encodes orientation and scale information, and incorporates a class-agnostic pseudo-label filtering (CPF) strategy to eliminate reliance on fixed confidence thresholds. It adopts a student–teacher paradigm integrating consistency regularization and dynamic pseudo-label refinement. On DOTA and DIOR benchmarks, PWOOD achieves performance comparable to or surpassing state-of-the-art semi-supervised methods, while requiring significantly less annotation effort than existing weakly supervised approaches. To our knowledge, PWOOD is the first work to systematically establish a partial weak supervision paradigm for OOD, bridging the gap between semi-supervised learning and practical annotation constraints in remote sensing and aerial imagery analysis.

Technology Category

Application Category

📝 Abstract
The growing demand for oriented object detection (OOD) across various domains has driven significant research in this area. However, the high cost of dataset annotation remains a major concern. Current mainstream OOD algorithms can be mainly categorized into three types: (1) fully supervised methods using complete oriented bounding box (OBB) annotations, (2) semi-supervised methods using partial OBB annotations, and (3) weakly supervised methods using weak annotations such as horizontal boxes or points. However, these algorithms inevitably increase the cost of models in terms of annotation speed or annotation cost. To address this issue, we propose:(1) the first Partial Weakly-Supervised Oriented Object Detection (PWOOD) framework based on partially weak annotations (horizontal boxes or single points), which can efficiently leverage large amounts of unlabeled data, significantly outperforming weakly supervised algorithms trained with partially weak annotations, also offers a lower cost solution; (2) Orientation-and-Scale-aware Student (OS-Student) model capable of learning orientation and scale information with only a small amount of orientation-agnostic or scale-agnostic weak annotations; and (3) Class-Agnostic Pseudo-Label Filtering strategy (CPF) to reduce the model's sensitivity to static filtering thresholds. Comprehensive experiments on DOTA-v1.0/v1.5/v2.0 and DIOR datasets demonstrate that our PWOOD framework performs comparably to, or even surpasses, traditional semi-supervised algorithms.
Problem

Research questions and friction points this paper is trying to address.

Reducing annotation cost in oriented object detection
Leveraging weak annotations for efficient model training
Improving detection accuracy with partial weak supervision
Innovation

Methods, ideas, or system contributions that make the work stand out.

Partial Weakly-Supervised OOD framework
OS-Student model for weak annotations
Class-Agnostic Pseudo-Label Filtering strategy
M
Mingxin Liu
Shanghai Jiao Tong University
P
Peiyuan Zhang
Wuhan University
Y
Yuan Liu
Shanghai Jiao Tong University
W
Wei Zhang
Shanghai Jiao Tong University
Y
Yue Zhou
East China Normal University
Ning Liao
Ning Liao
Shanghai Jiao Tong University
LLMMLLMMoE
Ziyang Gong
Ziyang Gong
SJTU, THU, Shanghai AI Lab (OpenGVLab), SYSU
Embodied Spatial Intelligence
Junwei Luo
Junwei Luo
Wuhan University
Vision-Language ModelOriented Object DetectionRemote Sensing
Zhirui Wang
Zhirui Wang
Aerospace Information Research Institute, Chinese Academy of Sciences
Remote sensing image interpretationtarget detectiontarget recognition
Y
Yi Yu
Southeast University
X
Xue Yang
Shanghai Jiao Tong University