FastInstShadow: A Simple Query-Based Model for Instance Shadow Detection

📅 2025-03-10
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the limitations of existing instance shadow detection methods—namely, separate detection of shadows and objects and inaccurate post-hoc pairing—this paper proposes the first query-driven end-to-end framework. Methodologically, we design a dual-path associative Transformer decoder that jointly models geometric and semantic relationships between shadows and objects in a single forward pass, enabling unified learning of detection and precise shadow-object association. Crucially, we introduce a learnable, end-to-end association mechanism, eliminating error-prone heuristic matching during post-processing. On the SOBA benchmark, our method achieves a 4.2% mAP improvement over prior state-of-the-art (e.g., SSISv2). Moreover, it supports real-time inference at medium resolution (1024×512) with 32 FPS, striking an effective balance between accuracy and efficiency.

Technology Category

Application Category

📝 Abstract
Instance shadow detection is the task of detecting pairs of shadows and objects, where existing methods first detect shadows and objects independently, then associate them. This paper introduces FastInstShadow, a method that enhances detection accuracy through a query-based architecture featuring an association transformer decoder with two dual-path transformer decoders to assess relationships between shadows and objects during detection. Experimental results using the SOBA dataset showed that the proposed method outperforms all existing methods across all criteria. This method makes real-time processing feasible for moderate-resolution images with better accuracy than SSISv2, the most accurate existing method. Our code is available at https://github.com/wlotkr/FastInstShadow.
Problem

Research questions and friction points this paper is trying to address.

Detects shadow-object pairs using query-based architecture.
Improves accuracy with dual-path transformer decoders.
Enables real-time processing for moderate-resolution images.
Innovation

Methods, ideas, or system contributions that make the work stand out.

Query-based architecture for shadow-object detection
Dual-path transformer decoders enhance accuracy
Real-time processing with improved detection accuracy
Takeru Inoue
Takeru Inoue
University of Yamanashi
Design and control of communication networks
R
Ryusuke Miyamoto
Department of Computer Science, School of Science and Technology, Meiji University, Japan