AUTOMATED DIAGNOSIS OF LUNG DISEASES USING VISION TRANSFORMER: A COMPARATIVE STUDY ON CHEST X-RAY CLASSIFICATION

📅 2024-10-10
🏛️ Journal of Population Therapeutics and Clinical Pharmacology
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This study addresses the high reliance on manual interpretation and poor generalizability of deep learning models under small-sample conditions in early diagnosis of pulmonary diseases (e.g., pneumonia, consolidation) from chest X-rays. We propose an automated Vision Transformer (ViT)-based three-class classification system—normal, pulmonary consolidation, and viral pneumonia. To our knowledge, this is the first systematic comparison of ViT and Swin Transformer against CNN-based benchmarks (ResNet-50, DenseNet, CheXNet) specifically in pediatric and geriatric populations, where these pathologies are highly prevalent. Experiments on a limited-scale chest X-ray dataset demonstrate superior performance: 99.0% accuracy for binary classification (abnormal vs. normal) and 95.25% for three-way classification—significantly outperforming all baselines. Results validate ViT’s enhanced representational capacity for fine-grained medical image classification and establish a reproducible Transformer-based paradigm for low-resource medical AI diagnostics.

Technology Category

Application Category

📝 Abstract
Background: Lung disease is a significant health issue, particularly in children and elderly individuals. It often results from lung infections and is one of the leading causes of mortality in children. Globally, lung-related diseases claim many lives each year, making early and accurate diagnoses crucial. Radiographs are valuable tools for the diagnosis of such conditions. The most prevalent lung diseases, including pneumonia, asthma, allergies, chronic obstructive pulmonary disease (COPD), bronchitis, emphysema, and lung cancer, represent significant public health challenges. Early prediction of these conditions is critical, as it allows for the identification of risk factors and implementation of preventive measures to reduce the likelihood of disease onset Methods: In this study, we utilized a dataset comprising 3,475 chest X-ray images sourced from from Mendeley Data provided by Talukder, M. A. (2023) [14], categorized into three classes: normal, lung opacity, and pneumonia. We applied five pre-trained deep learning models, including CNN, ResNet50, DenseNet, CheXNet, and U-Net, as well as two transfer learning algorithms such as Vision Transformer (ViT) and Shifted Window (Swin) to classify these images. This approach aims to address diagnostic issues in lung abnormalities by reducing reliance on human intervention through automated classification systems. Our analysis was conducted in both binary and multiclass settings. Results: In the binary classification, we focused on distinguishing between normal and viral pneumonia cases, whereas in the multi-class classification, all three classes (normal, lung opacity, and viral pneumonia) were included. Our proposed methodology (ViT) achieved remarkable performance, with accuracy rates of 99% for binary classification and 95.25% for multiclass classification.
Problem

Research questions and friction points this paper is trying to address.

Automated classification of lung diseases using chest X-rays
Comparative study of deep learning models for diagnosis
Early detection of pneumonia and lung opacity via ViT
Innovation

Methods, ideas, or system contributions that make the work stand out.

Used Vision Transformer for lung disease diagnosis
Compared multiple deep learning models on X-rays
Achieved high accuracy in binary and multiclass classification
Muhammad Ahmad
Muhammad Ahmad
King Fahd University of Petroleum and Minerals
Machine LearningComputer VisionHyperspectral imaging
S
Sardar Usman
Department of computer Science, Grand Asian University of Sialkot, Pakistan
Ildar Batyrshin
Ildar Batyrshin
Instituto Politecnico Nacional
M
Muhammad Muzammil
Department of computer Science, the Islamia University of Bahawalpur, Pakistan
K
K Sajid
College of Computer Science and Technology, Zhejiang Normal University, Jinhua (321004), China
M
M. Hasnain
Department of Computer Science, Leads University Lahore
M
Muhammad Jalal
Department of computer Science, the Islamia University of Bahawalpur, Pakistan
Grigori Sidorov
Grigori Sidorov
Professor of Computational Linguistics, Instituto Politécnico Nacional (IPN), Mexico
Computational LinguisticsNatural Language ProcessingArtificial IntelligenceMachine Learning