🤖 AI Summary
This work addresses the problem of estimating skew angles in document images by proposing an adaptive radial projection method based on the magnitude spectrum of the two-dimensional discrete Fourier transform. The core innovation lies in an adaptive radial projection strategy that robustly and accurately captures dominant directional features in the frequency domain. To enable systematic evaluation, the authors construct DISE-2021, a high-quality benchmark dataset. Experimental results demonstrate that the proposed method significantly outperforms existing approaches in both accuracy and robustness, thereby providing a reliable preprocessing step for downstream document analysis tasks.
📝 Abstract
Skew estimation is one of the vital tasks in document processing systems, especially for scanned document images, because its performance impacts subsequent steps directly. Over the years, an enormous number of researches focus on this challenging problem in the rise of digitization age. In this research, we first propose a novel skew estimation method that extracts the dominant skew angle of the given document image by applying an Adaptive Radial Projection on the 2D Discrete Fourier Magnitude spectrum. Second, we introduce a high quality skew estimation dataset DISE-2021 to assess the performance of different estimators. Finally, we provide comprehensive analyses that focus on multiple improvement aspects of Fourier-based methods. Our results show that the proposed method is robust, reliable, and outperforms all compared methods. The source code is available at github.com/phamquiluan/jdeskew.