🤖 AI Summary
Hyperparameter optimization (HPO) for deep learning suffers from high computational cost and inefficient search, particularly in high-dimensional spaces where Bayesian optimization remains limited. To address this, we propose a Group-Sequential HPO framework grounded in Hyperparameter Importance Assessment (HIA), introducing the novel “sequential grouping” paradigm. This approach dynamically ranks hyperparameters using prior importance weights and optimizes them in stages—partitioning the hyperparameter space into subsets and coupling HIA, Bayesian optimization, and serialized search. The method significantly improves search efficiency without compromising model performance. Experiments across six new image classification benchmarks demonstrate an average 31.9% reduction in optimization time while preserving classification accuracy. Our work establishes a new paradigm for efficient, interpretable, and resource-conscious deep learning hyperparameter tuning.
📝 Abstract
Hyperparameter optimization (HPO) is a critical component of machine learning pipelines, significantly affecting model robustness, stability, and generalization. However, HPO is often a time-consuming and computationally intensive task. Traditional HPO methods, such as grid search and random search, often suffer from inefficiency. Bayesian optimization, while more efficient, still struggles with high-dimensional search spaces. In this paper, we contribute to the field by exploring how insights gained from hyperparameter importance assessment (HIA) can be leveraged to accelerate HPO, reducing both time and computational resources. Building on prior work that quantified hyperparameter importance by evaluating 10 hyperparameters on CNNs using 10 common image classification datasets, we implement a novel HPO strategy called 'Sequential Grouping.' That prior work assessed the importance weights of the investigated hyperparameters based on their influence on model performance, providing valuable insights that we leverage to optimize our HPO process. Our experiments, validated across six additional image classification datasets, demonstrate that incorporating hyperparameter importance assessment (HIA) can significantly accelerate HPO without compromising model performance, reducing optimization time by an average of 31.9% compared to the conventional simultaneous strategy.