Approach to Finding a Robust Deep Learning Model

📅 2025-05-22
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
Unsupervised large-scale model training suffers from low prediction reliability and difficulty in robustness assessment. Method: This paper proposes a fully automated, task-agnostic framework for quantifying deep learning model robustness, accompanied by a meta-level model selection algorithm. Leveraging a lightweight CNN-FC hybrid architecture, we systematically investigate the coupled effects of sample size, weight initialization, and inductive bias on robustness. The meta-selection algorithm is agnostic to both model architecture and downstream tasks. Results: Empirical analysis reveals that initialization strategy and data scale are primary determinants of robustness. Our approach substantially improves prediction reliability and enables efficient, reproducible robustness ranking—even for lightweight models—thereby establishing a new paradigm for automated model evaluation and deployment.

Technology Category

Application Category

📝 Abstract
The rapid development of machine learning (ML) and artificial intelligence (AI) applications requires the training of large numbers of models. This growing demand highlights the importance of training models without human supervision, while ensuring that their predictions are reliable. In response to this need, we propose a novel approach for determining model robustness. This approach, supplemented with a proposed model selection algorithm designed as a meta-algorithm, is versatile and applicable to any machine learning model, provided that it is appropriate for the task at hand. This study demonstrates the application of our approach to evaluate the robustness of deep learning models. To this end, we study small models composed of a few convolutional and fully connected layers, using common optimizers due to their ease of interpretation and computational efficiency. Within this framework, we address the influence of training sample size, model weight initialization, and inductive bias on the robustness of deep learning models.
Problem

Research questions and friction points this paper is trying to address.

Ensuring reliable predictions in unsupervised deep learning models
Developing a versatile robustness evaluation approach for ML models
Assessing impact of training size and initialization on model robustness
Innovation

Methods, ideas, or system contributions that make the work stand out.

Meta-algorithm for model selection
Unsupervised robust model training
Evaluates robustness via sample size
🔎 Similar Papers
No similar papers found.
A
Alexey Boldyrev
Laboratory of Methods for Big Data Analysis, HSE University, 101000 Moscow, Russia
Fedor Ratnikov
Fedor Ratnikov
National Research University Higher School of Economics
High Energy PhysicsMachine Learning
A
Andrey Shevelev
Laboratory of Methods for Big Data Analysis, HSE University, 101000 Moscow, Russia