A High-Dimensional Feature Selection Algorithm Based on Multiobjective Differential Evolution

📅 2025-05-09
📈 Citations: 0
Influential: 0
📄 PDF
🤖 AI Summary
To address the inefficiency of multi-objective feature selection in high-dimensional data caused by feature redundancy and interdependence, this paper proposes a novel Multi-Objective Differential Evolution for Feature Selection (MO-DEFS). MO-DEFS introduces a pioneering four-subpopulation initialization strategy that jointly incorporates feature weighting and redundancy assessment; it further designs a weight-guided mutation mechanism and an adaptive grid-based deduplication strategy to synergistically enhance solution-set diversity and convergence quality. Experimental evaluation on 11 UCI benchmark datasets demonstrates that MO-DEFS achieves a superior Pareto front in optimizing the conflicting objectives of minimizing feature count and classification error rate: the selected feature subsets are, on average, 12.7% more compact and yield 1.9% higher classification accuracy, while exhibiting significantly greater robustness than mainstream algorithms including NSGA-II and MOEA/D.

Technology Category

Application Category

📝 Abstract
Multiobjective feature selection seeks to determine the most discriminative feature subset by simultaneously optimizing two conflicting objectives: minimizing the number of selected features and the classification error rate. The goal is to enhance the model's predictive performance and computational efficiency. However, feature redundancy and interdependence in high-dimensional data present considerable obstacles to the search efficiency of optimization algorithms and the quality of the resulting solutions. To tackle these issues, we propose a high-dimensional feature selection algorithm based on multiobjective differential evolution. First, a population initialization strategy is designed by integrating feature weights and redundancy indices, where the population is divided into four subpopulations to improve the diversity and uniformity of the initial population. Then, a multiobjective selection mechanism is developed, in which feature weights guide the mutation process. The solution quality is further enhanced through nondominated sorting, with preference given to solutions with lower classification error, effectively balancing global exploration and local exploitation. Finally, an adaptive grid mechanism is applied in the objective space to identify densely populated regions and detect duplicated solutions. Experimental results on 11 UCI datasets of varying difficulty demonstrate that the proposed method significantly outperforms several state-of-the-art multiobjective feature selection approaches regarding feature selection performance.
Problem

Research questions and friction points this paper is trying to address.

Optimizing feature selection to minimize redundancy and classification errors
Enhancing algorithm efficiency in high-dimensional data feature selection
Balancing global exploration and local exploitation in multiobjective optimization
Innovation

Methods, ideas, or system contributions that make the work stand out.

Multiobjective differential evolution for feature selection
Population initialization with feature weights and redundancy
Adaptive grid mechanism for solution diversity
🔎 Similar Papers
No similar papers found.
Zhenxing Zhang
Zhenxing Zhang
School of computing, Dublin City University
machine learningcomputer visioninformation retrieval
Q
Qianxiang An
Department of Computer Science, Qufu Normal University, Rizhao, 276826, Shandong, China
Yilei Wang
Yilei Wang
Alibaba Cloud
Chenfeng Wu
Chenfeng Wu
Department of Information and Electrical Engineering, Ludong University, Yantai, 264025, Shandong, China
B
Baoling Dong
Department of Computer Science, Qufu Normal University, Rizhao, 276826, Shandong, China
C
Chunjie Zhou
Department of Information and Electrical Engineering, Ludong University, Yantai, 264025, Shandong, China