🤖 AI Summary
Gastric cancer is highly prevalent worldwide and associated with poor prognosis (approximately 20% five-year survival rate); current histopathological diagnosis relies heavily on manual assessment, resulting in low efficiency and high inter-observer variability—highlighting an urgent need for lightweight, robust automated decision-support tools. This study systematically evaluates a fine-tuning-free feature engineering paradigm for binary classification of gastric cancer tissue histopathology images on the GasHisSDB dataset. We compare handcrafted features—including LBP, GLCM, and HOG—with deep features extracted from pre-trained VGG and AlexNet models, coupled with shallow classifiers such as Random Forest (RF) and SVM. To our knowledge, this is the first comprehensive investigation into the generalizability and robustness of diverse feature–classifier combinations in gastric cancer histopathology. RF with handcrafted features achieves an F1-score of 93.4%, significantly outperforming existing fine-tuning-free methods and demonstrating the clinical viability and practical utility of lightweight feature engineering for deployable辅助 diagnostic systems.
📝 Abstract
Gastric cancer ranks as the fifth most common and fourth most lethal cancer globally, with a dismal 5-year survival rate of approximately 20%. Despite extensive research on its pathobiology, the prognostic predictability remains inadequate, compounded by pathologists' high workload and potential diagnostic errors. Thus, automated, accurate histopathological diagnosis tools are crucial. This study employs Machine Learning and Deep Learning techniques to classify histopathological images into healthy and cancerous categories. Using handcrafted and deep features with shallow learning classifiers on the GasHisSDB dataset, we offer a comparative analysis and insights into the most robust and high-performing combinations of features and classifiers for distinguishing between normal and abnormal histopathological images without fine-tuning strategies. With the RF classifier, our approach can reach F1 of 93.4%, demonstrating its validity.