Abstract
Feature selection in data exploration and analysis problems for multiclass data sets seeks to select a subset of variables or features that define the data to acquire the most appropriate and concise representation of the available information. This issue can be solved using feature selection, which removes extra and irrelevant data. Computation time, training accuracy, and understanding of training models and resources are all decreased as a result. The previous method Enhanced Cuckoo Search with Ant Colony Optimization algorithm (ECS–ACO), shows slow search speed and low collection accuracy of data features. The improved Binary Grey Wolf algorithm (EBGWO–ACO), ant colony-based optimization, which is the basis of the proposed feature selection method for multi-class datasets, is used to determine the minimal feature set selection. Data is gathered from the standard repository in the first step and start with data pre-processing is based on a filtering process to reduce the missing values or irrelevant values from the dataset. Then, cooperative features are selected from the clustering method for the grouping feature dataset. To identify the cluster of different data points into group or dataset values of help to similar data features. Third step is feature selection is based on the Enhanced Binary Grey Wolf algorithm with Ant Colony Optimization (EBGWO–ACO) is for using analysis the minimum subset of samples and the grey wolf algorithm is used to identify the best features with ACO is based on the similar feature location. By using k5-fold validation against the optimal classifier model, the KNN classification accuracies of the investigated approaches were confirmed, and in terms of classification success, the classification. The findings validate that when the number of characteristics is decreased, the approach is appropriate and classically performs improved.
Similar content being viewed by others
Data Availability
The dataset produced and scrutinized in this study are accessible from the corresponding author upon reasonable request.
References
Qureshi MNI, Min B, Park H-J, Cho D, Choi W, Lee B. Multiclass classification of word imagination speech with hybrid connectivity features. IEEE Trans Biomed Eng. 2018;65(10):2168–77. https://doi.org/10.1109/TBME.2017.2786251.
Li Y, Pan Y, Liu Z. Multiclass nonnegative matrix factorization for comprehensive feature pattern discovery. IEEE Trans Neural Netw Learn Syst. 2019;30(2):615–29. https://doi.org/10.1109/TNNLS.2018.2849932.
Pei Z, Wang H, Bezerianos A, Li J. IEEG-based multiclass workload identification using feature fusion and selection. IEEE Trans Instrum Meas. 2021;70:1–8. https://doi.org/10.1109/TIM.2020.3019849. (Art no. 4001108).
Wang P, Xue B, Liang J, Zhang M. Differential evolution-based feature selection: a niching-based multiobjective approach. IEEE Trans Evol Comput. 2023;27(2):296–310. https://doi.org/10.1109/TEVC.2022.3168052.
Kuang H, Chen L, Chan LLH, Cheung RCC, Yan H. Feature selection based on tensor decomposition and object proposal for night-time multiclass vehicle detection. IEEE Trans Syst Man Cybern Syst. 2019;49(1):71–80. https://doi.org/10.1109/TSMC.2018.2872891.
Bruzzone L, Roli F, Serpico SB. An extension of the Jeffreys–Matusita distance to multiclass cases for feature selection. IEEE Trans Geosci Remote Sens. 1995;33(6):1318–21. https://doi.org/10.1109/36.477187.
Zhu X, Suk H-I, Lee S-W, Shen D. Subspace regularized sparse multitask learning for multiclass neurodegenerative disease identification. IEEE Trans Biomed Eng. 2016;63(3):607–18. https://doi.org/10.1109/TBME.2015.2466616.
Fan M, Zhang X, Hu J, Gu N, Tao D. Adaptive data structure regularized multiclass discriminative feature selection. IEEE Trans Neural Netw Learn Syst. 2022;33(10):5859–72. https://doi.org/10.1109/TNNLS.2021.3071603.
Xu J, Han J, Nie F, Li X. Multi-view scaling support vector machines for classification and feature selection. IEEE Trans Knowl Data Eng. 2020;32(7):1419–30. https://doi.org/10.1109/TKDE.2019.2904256.
Kalakoti R, Nõmm S, Bahsi H. In-depth feature selection for the statistical machine learning-based botnet detection in IoT networks. IEEE Access. 2022;10:94518–35. https://doi.org/10.1109/ACCESS.2022.3204001.
Vidyarthi A, Agarwal R, Gupta D, Sharma R, Draheim D, Tiwari P. Machine learning assisted methodology for multiclass classification of malignant brain tumors. IEEE Access. 2022;10:50624–40. https://doi.org/10.1109/ACCESS.2022.3172303.
Peng, Wu X, Yuan W, Zhang X, Zhang Y, Li Y. MGRFE: multilayer recursive feature elimination based on an embedded genetic algorithm for cancer classification. IEEE/ACM Trans Comput Biol Bioinform. 2021;18(2):621–32. https://doi.org/10.1109/TCBB.2019.2921961.
Wu J, Guo P, Cheng Y, Zhu H, Wang X-B, Shao X. Ensemble generalized multiclass support-vector-machine-based health evaluation of complex degradation systems. IEEE/ASME Trans Mechatron. 2020;25(5):2230–40. https://doi.org/10.1109/TMECH.2020.3009449.
Bakro M, et al. An improved design for a cloud intrusion detection system using hybrid features selection approach with ML classifier. IEEE Access. 2023;11:64228–47. https://doi.org/10.1109/ACCESS.2023.3289405.
Nahiduzzaman M, Islam MR, Islam SMR, Goni MOF, Anower MS, Kwak K-S. Hybrid CNN-SVD based prominent feature extraction and selection for grading diabetic retinopathy using extreme learning machine algorithm. IEEE Access. 2021;9:152261–74. https://doi.org/10.1109/ACCESS.2021.3125791.
Jung D. Distributed feature selection for multi-class classification using ADMM. IEEE Control Syst Lett. 2021;5(3):821–6. https://doi.org/10.1109/LCSYS.2020.3006428.
Abramovich F, Grinshtein V, Levy T. Multiclass classification by sparse multinomial logistic regression. IEEE Trans Inf Theory. 2021;67(7):4637–46. https://doi.org/10.1109/TIT.2021.3075137.
Deepa N, Khan MZ, Prabadevi B, Vincent P M DR, Maddikunta PKR, Gadekallu TR. Multiclass model for agriculture development using multivariate statistical method. IEEE Access. 2020;8:183749–58. https://doi.org/10.1109/ACCESS.2020.3028595.
Tsai C-F, Lin W-C. Feature selection and ensemble learning techniques in one-class classifiers: an empirical study of two-class imbalanced datasets. IEEE Access. 2021;9:13717–26. https://doi.org/10.1109/ACCESS.2021.3051969.
Bradde T, Fracastoro G, Calafiore GC. Multiclass sparse centroids with application to fast time series classification. IEEE Trans Neural Netw Learn Syst. 2023;34(8):5206–11. https://doi.org/10.1109/TNNLS.2021.3124300.
Rafiuddin N, Khan YU, Farooq O. A novel wavelet approach for multiclass iEEG signal classification in automated diagnosis of epilepsy. IEEE Trans Instrum Meas. 2022;71:1–10. https://doi.org/10.1109/TIM.2022.3207799. (Art no. 4009010).
Kumar A, Kaur A, Singh P, Driss M, Boulila W. Efficient multiclass classification using feature selection in high-dimensional datasets. Electronics. 2023;12:2290. https://doi.org/10.3390/electronics12102290.
Agarwal R, Shekhawat NS, Kumar S, Nayyar A, Qureshi B. Improved feature selection method for the identification of soil images using Oscillating Spider Monkey Optimization. IEEE Access. 2021;9:167128–39.
Wang Z, Wang C, Wei J, Liu J. Multi-class feature selection by exploring reliable class correlation. Knowl-Based Syst. 2021;230: 107377. https://doi.org/10.1016/j.knosys.2021.107377.
Benkessirat, Benblidia N. Fundamentals of feature selection: an overview and comparison. In: 2019 IEEE/ACS 16th International Conference on Computer Systems and Applications (AICCSA), Abu Dhabi, United Arab Emirates, 2019, p. 1–6. https://doi.org/10.1109/AICCSA47632.2019.9035281.
Wang Y, Feng L. A new hybrid feature selection based on multi-filter weights and multi-feature weights. Appl Intell. 2019;49:4033–57. https://doi.org/10.1007/s10489-019-01470-z.
Cateni, Sivia & Colla, Valentina & Vannucci, Marco. A hybrid feature selection method for classification purposes. In: Proceedings—UKSim-AMSS 8th European Modelling Symposium on Computer Modelling and Simulation, EMS 2014. 2014. https://doi.org/10.1109/EMS.2014.44.
Gong L, Xie S, Zhang Y, Wang M, Wang X. Hybrid feature selection method based on feature subset and factor analysis. IEEE Access. 2022;10:120792–803. https://doi.org/10.1109/ACCESS.2022.3222812.
Bashiri Mosavi SA. Applying cross-permutation-based quad-hybrid feature selection algorithm on transient univariates to select optimal features for transient analysis. IEEE Access. 2022;10:41131–51. https://doi.org/10.1109/ACCESS.2022.3166917.
Thejas GS, Garg R, Iyengar SS, Sunitha NR, Badrinath P, Chennupati S. Metric and accuracy ranked feature inclusion: hybrids of filter and wrapper feature selection approaches. IEEE Access. 2021;9:128687–701. https://doi.org/10.1109/ACCESS.2021.3112169.
Acknowledgements
The authors acknowledged the Thanthai Periyar Government Arts & Science College (Autonomous), Affiliated to Bharathidasan University, Tiruchirappalli, Tamilnadu, India for supporting the research work by providing the facilities.
Funding
No funding involved in the research work.
Author information
Authors and Affiliations
Contributions
The research resulted from a collective effort, with all authors contributing collaboratively to its accomplishment.
Corresponding author
Ethics declarations
Conflict of interest
Authors have no conflict of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Selvi, R.S., Bibi, K.F. Feature Selection of Multi-class Data Sets Based on Enhanced Binary Gray Wolf Algorithm and Ant Colony Optimization Algorithm. SN COMPUT. SCI. 5, 1076 (2024). https://doi.org/10.1007/s42979-024-03402-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s42979-024-03402-2