Enhancing Classification with Semi-Supervised Deep Learning Using Distance-Based Sample Weights

📅 2025-05-20
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
This paper addresses semi-supervised classification under three challenging conditions: scarce labeled data, label noise, and class imbalance. To this end, we propose a distance-driven dynamic sample weighting framework. Methodologically, we introduce— for the first time—the distance from each test sample to unlabeled samples as a principled weighting criterion, enabling test-oriented local similarity modeling. We further integrate uncertainty-aware consistency regularization with graph-structure-guided label propagation to jointly enhance model robustness and generalization. Extensive experiments across 12 benchmark datasets demonstrate significant improvements in accuracy, precision, and recall. Notably, our method substantially outperforms state-of-the-art approaches—including FixMatch, UDA, and Mean Teacher—under extremely low labeling rates (≤1%) and high noise levels (≥40%). These results empirically validate the effectiveness and broad applicability of our distance-based weighting mechanism.

Technology Category

Application Category

📝 Abstract
Recent advancements in semi-supervised deep learning have introduced effective strategies for leveraging both labeled and unlabeled data to improve classification performance. This work proposes a semi-supervised framework that utilizes a distance-based weighting mechanism to prioritize critical training samples based on their proximity to test data. By focusing on the most informative examples, the method enhances model generalization and robustness, particularly in challenging scenarios with noisy or imbalanced datasets. Building on techniques such as uncertainty consistency and graph-based representations, the approach addresses key challenges of limited labeled data while maintaining scalability. Experiments on twelve benchmark datasets demonstrate significant improvements across key metrics, including accuracy, precision, and recall, consistently outperforming existing methods. This framework provides a robust and practical solution for semi-supervised learning, with potential applications in domains such as healthcare and security where data limitations pose significant challenges.
Problem

Research questions and friction points this paper is trying to address.

Improves classification using distance-based sample weights
Addresses limited labeled data with semi-supervised learning
Enhances model robustness in noisy or imbalanced datasets
Innovation

Methods, ideas, or system contributions that make the work stand out.

Distance-based weighting for critical samples
Uncertainty consistency and graph representations
Improved accuracy with semi-supervised learning
🔎 Similar Papers
No similar papers found.
A
Aydin Abedinia
Dept. of Computer Science, Islamic Azad University, South-Tehran Branch, Tehran, Iran
S
Shima Tabakhi
School of Electrical, Computer and Biomedical Engineering, Southern Illinois University, Carbondale, United States
Vahid Seydi
Vahid Seydi
University of the West of England Bristol
Machine LearningArtificial IntelligenceData Science