Predicting Region of Interest in Human Visual Search Based on Statistical Texture and Gabor Features

📅 2026-01-12
🏛️ arXiv.org
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the challenge of accurately predicting early human fixation regions in visual search tasks with unknown target locations, aiming to model bottom-up visual attention allocation. To this end, it proposes two multi-feature fusion pipelines that systematically integrate structure-oriented Gabor filter responses with statistical texture features derived from the Gray-Level Co-occurrence Matrix (GLCM)—a novel combination in this context. The approach is validated on digital breast tomosynthesis images, demonstrating that the generated salient regions exhibit strong alignment with human observers’ early eye movements and outperform conventional threshold-based models. These findings highlight the complementary roles of Gabor and GLCM features in visual information encoding and offer a new pathway for developing perception-driven observer models.

Technology Category

Application Category

📝 Abstract
Understanding human visual search behavior is a fundamental problem in vision science and computer vision, with direct implications for modeling how observers allocate attention in location-unknown search tasks. In this study, we investigate the relationship between Gabor-based features and gray-level co-occurrence matrix (GLCM)–based texture features in modeling early-stage visual search behavior. Two feature-combination pipelines are proposed to integrate Gabor and GLCM features for narrowing the region of possible human fixations. The pipelines are evaluated using simulated digital breast tomosynthesis images. Results show qualitative agreement among fixation candidates predicted by the proposed pipelines and a threshold-based model observer. A strong correlation (r=0.765) is observed between GLCM mean and Gabor feature responses, indicating that these features encode related image information despite their different formulations. Eye-tracking data from human observers further suggest consistency between predicted fixation regions and early-stage gaze behavior. These findings highlight the value of combining structural and texture-based features for modeling visual search and support the development of perceptually informed observer models.
Problem

Research questions and friction points this paper is trying to address.

visual search
region of interest
Gabor features
texture features
fixation prediction
Innovation

Methods, ideas, or system contributions that make the work stand out.

Gabor features
GLCM texture features
visual search modeling
fixation prediction
feature fusion
🔎 Similar Papers
No similar papers found.
H
Hongwei Lin
Department of Biomedical Engineering, University of Houston, 3517 Cullen Blvd, Houston, TX 77204, USA
Diego Andrade
Diego Andrade
University of A Coruña
computer architecturehigh performance computing
M
Mini Das
Department of Biomedical Engineering, University of Houston, 3517 Cullen Blvd, Houston, TX 77204, USA
Howard C. Gifford
Howard C. Gifford
Associate Professor of Biomedical Engineering, University of Houston