🤖 AI Summary
Addressing key challenges in unsupervised video object segmentation (UVOS)—including difficulty in discovering the initial salient object, frequent omission of small objects, and temporal inconsistency induced by motion deformation—this paper proposes a two-stage post-processing framework. Methodologically, the first stage introduces a frequency-based criterion to refine salient object selection, mitigating uncertainty in initial proposals. The second stage employs a three-phase temporal mask completion mechanism that jointly integrates saliency detection, temporal ensemble, and region completion to explicitly enforce inter-frame consistency. Our core contribution lies in unifying frequency-aware modeling with structured temporal completion, substantially enhancing segmentation robustness under complex motion and for small objects. Evaluated on mainstream UVOS benchmarks, the method achieves state-of-the-art performance, outperforming existing approaches in both segmentation accuracy and temporal stability.
📝 Abstract
Predicting and tracking objects in real-world scenarios is a critical challenge in Video Object Segmentation (VOS) tasks. Unsupervised VOS (UVOS) has the additional challenge of finding an initial segmentation of salient objects, which affects the entire process and keeps a permanent uncertainty about the object proposals. Moreover, deformation and fast motion can lead to temporal inconsistencies. To address these problems, we propose Frequent Temporally Integrated Objects (FTIO), a post-processing framework with two key components. First, we introduce a combined criterion to improve object selection, mitigating failures common in UVOS--particularly when objects are small or structurally complex--by extracting frequently appearing salient objects. Second, we present a three-stage method to correct temporal inconsistencies by integrating missing object mask regions. Experimental results demonstrate that FTIO achieves state-of-the-art performance in multi-object UVOS. Code is available at: https://github.com/MohammadMohammadzadehKalati/FTIO