Skip to main content
Log in

Feature Selection of Multi-class Data Sets Based on Enhanced Binary Gray Wolf Algorithm and Ant Colony Optimization Algorithm

  • Original Research
  • Published:
SN Computer Science Aims and scope Submit manuscript

Abstract

Feature selection in data exploration and analysis problems for multiclass data sets seeks to select a subset of variables or features that define the data to acquire the most appropriate and concise representation of the available information. This issue can be solved using feature selection, which removes extra and irrelevant data. Computation time, training accuracy, and understanding of training models and resources are all decreased as a result. The previous method Enhanced Cuckoo Search with Ant Colony Optimization algorithm (ECS–ACO), shows slow search speed and low collection accuracy of data features. The improved Binary Grey Wolf algorithm (EBGWO–ACO), ant colony-based optimization, which is the basis of the proposed feature selection method for multi-class datasets, is used to determine the minimal feature set selection. Data is gathered from the standard repository in the first step and start with data pre-processing is based on a filtering process to reduce the missing values or irrelevant values from the dataset. Then, cooperative features are selected from the clustering method for the grouping feature dataset. To identify the cluster of different data points into group or dataset values of help to similar data features. Third step is feature selection is based on the Enhanced Binary Grey Wolf algorithm with Ant Colony Optimization (EBGWO–ACO) is for using analysis the minimum subset of samples and the grey wolf algorithm is used to identify the best features with ACO is based on the similar feature location. By using k5-fold validation against the optimal classifier model, the KNN classification accuracies of the investigated approaches were confirmed, and in terms of classification success, the classification. The findings validate that when the number of characteristics is decreased, the approach is appropriate and classically performs improved.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10

Similar content being viewed by others

Data Availability

The dataset produced and scrutinized in this study are accessible from the corresponding author upon reasonable request.

References

  1. Qureshi MNI, Min B, Park H-J, Cho D, Choi W, Lee B. Multiclass classification of word imagination speech with hybrid connectivity features. IEEE Trans Biomed Eng. 2018;65(10):2168–77. https://doi.org/10.1109/TBME.2017.2786251.

    Article  Google Scholar 

  2. Li Y, Pan Y, Liu Z. Multiclass nonnegative matrix factorization for comprehensive feature pattern discovery. IEEE Trans Neural Netw Learn Syst. 2019;30(2):615–29. https://doi.org/10.1109/TNNLS.2018.2849932.

    Article  MathSciNet  Google Scholar 

  3. Pei Z, Wang H, Bezerianos A, Li J. IEEG-based multiclass workload identification using feature fusion and selection. IEEE Trans Instrum Meas. 2021;70:1–8. https://doi.org/10.1109/TIM.2020.3019849. (Art no. 4001108).

    Article  Google Scholar 

  4. Wang P, Xue B, Liang J, Zhang M. Differential evolution-based feature selection: a niching-based multiobjective approach. IEEE Trans Evol Comput. 2023;27(2):296–310. https://doi.org/10.1109/TEVC.2022.3168052.

    Article  Google Scholar 

  5. Kuang H, Chen L, Chan LLH, Cheung RCC, Yan H. Feature selection based on tensor decomposition and object proposal for night-time multiclass vehicle detection. IEEE Trans Syst Man Cybern Syst. 2019;49(1):71–80. https://doi.org/10.1109/TSMC.2018.2872891.

    Article  Google Scholar 

  6. Bruzzone L, Roli F, Serpico SB. An extension of the Jeffreys–Matusita distance to multiclass cases for feature selection. IEEE Trans Geosci Remote Sens. 1995;33(6):1318–21. https://doi.org/10.1109/36.477187.

    Article  Google Scholar 

  7. Zhu X, Suk H-I, Lee S-W, Shen D. Subspace regularized sparse multitask learning for multiclass neurodegenerative disease identification. IEEE Trans Biomed Eng. 2016;63(3):607–18. https://doi.org/10.1109/TBME.2015.2466616.

    Article  Google Scholar 

  8. Fan M, Zhang X, Hu J, Gu N, Tao D. Adaptive data structure regularized multiclass discriminative feature selection. IEEE Trans Neural Netw Learn Syst. 2022;33(10):5859–72. https://doi.org/10.1109/TNNLS.2021.3071603.

    Article  MathSciNet  Google Scholar 

  9. Xu J, Han J, Nie F, Li X. Multi-view scaling support vector machines for classification and feature selection. IEEE Trans Knowl Data Eng. 2020;32(7):1419–30. https://doi.org/10.1109/TKDE.2019.2904256.

    Article  Google Scholar 

  10. Kalakoti R, Nõmm S, Bahsi H. In-depth feature selection for the statistical machine learning-based botnet detection in IoT networks. IEEE Access. 2022;10:94518–35. https://doi.org/10.1109/ACCESS.2022.3204001.

    Article  Google Scholar 

  11. Vidyarthi A, Agarwal R, Gupta D, Sharma R, Draheim D, Tiwari P. Machine learning assisted methodology for multiclass classification of malignant brain tumors. IEEE Access. 2022;10:50624–40. https://doi.org/10.1109/ACCESS.2022.3172303.

    Article  Google Scholar 

  12. Peng, Wu X, Yuan W, Zhang X, Zhang Y, Li Y. MGRFE: multilayer recursive feature elimination based on an embedded genetic algorithm for cancer classification. IEEE/ACM Trans Comput Biol Bioinform. 2021;18(2):621–32. https://doi.org/10.1109/TCBB.2019.2921961.

    Article  Google Scholar 

  13. Wu J, Guo P, Cheng Y, Zhu H, Wang X-B, Shao X. Ensemble generalized multiclass support-vector-machine-based health evaluation of complex degradation systems. IEEE/ASME Trans Mechatron. 2020;25(5):2230–40. https://doi.org/10.1109/TMECH.2020.3009449.

    Article  Google Scholar 

  14. Bakro M, et al. An improved design for a cloud intrusion detection system using hybrid features selection approach with ML classifier. IEEE Access. 2023;11:64228–47. https://doi.org/10.1109/ACCESS.2023.3289405.

    Article  Google Scholar 

  15. Nahiduzzaman M, Islam MR, Islam SMR, Goni MOF, Anower MS, Kwak K-S. Hybrid CNN-SVD based prominent feature extraction and selection for grading diabetic retinopathy using extreme learning machine algorithm. IEEE Access. 2021;9:152261–74. https://doi.org/10.1109/ACCESS.2021.3125791.

    Article  Google Scholar 

  16. Jung D. Distributed feature selection for multi-class classification using ADMM. IEEE Control Syst Lett. 2021;5(3):821–6. https://doi.org/10.1109/LCSYS.2020.3006428.

    Article  MathSciNet  Google Scholar 

  17. Abramovich F, Grinshtein V, Levy T. Multiclass classification by sparse multinomial logistic regression. IEEE Trans Inf Theory. 2021;67(7):4637–46. https://doi.org/10.1109/TIT.2021.3075137.

    Article  MathSciNet  Google Scholar 

  18. Deepa N, Khan MZ, Prabadevi B, Vincent P M DR, Maddikunta PKR, Gadekallu TR. Multiclass model for agriculture development using multivariate statistical method. IEEE Access. 2020;8:183749–58. https://doi.org/10.1109/ACCESS.2020.3028595.

    Article  Google Scholar 

  19. Tsai C-F, Lin W-C. Feature selection and ensemble learning techniques in one-class classifiers: an empirical study of two-class imbalanced datasets. IEEE Access. 2021;9:13717–26. https://doi.org/10.1109/ACCESS.2021.3051969.

    Article  Google Scholar 

  20. Bradde T, Fracastoro G, Calafiore GC. Multiclass sparse centroids with application to fast time series classification. IEEE Trans Neural Netw Learn Syst. 2023;34(8):5206–11. https://doi.org/10.1109/TNNLS.2021.3124300.

    Article  Google Scholar 

  21. Rafiuddin N, Khan YU, Farooq O. A novel wavelet approach for multiclass iEEG signal classification in automated diagnosis of epilepsy. IEEE Trans Instrum Meas. 2022;71:1–10. https://doi.org/10.1109/TIM.2022.3207799. (Art no. 4009010).

    Article  Google Scholar 

  22. Kumar A, Kaur A, Singh P, Driss M, Boulila W. Efficient multiclass classification using feature selection in high-dimensional datasets. Electronics. 2023;12:2290. https://doi.org/10.3390/electronics12102290.

    Article  Google Scholar 

  23. Agarwal R, Shekhawat NS, Kumar S, Nayyar A, Qureshi B. Improved feature selection method for the identification of soil images using Oscillating Spider Monkey Optimization. IEEE Access. 2021;9:167128–39.

    Article  Google Scholar 

  24. Wang Z, Wang C, Wei J, Liu J. Multi-class feature selection by exploring reliable class correlation. Knowl-Based Syst. 2021;230: 107377. https://doi.org/10.1016/j.knosys.2021.107377.

    Article  Google Scholar 

  25. Benkessirat, Benblidia N. Fundamentals of feature selection: an overview and comparison. In: 2019 IEEE/ACS 16th International Conference on Computer Systems and Applications (AICCSA), Abu Dhabi, United Arab Emirates, 2019, p. 1–6. https://doi.org/10.1109/AICCSA47632.2019.9035281.

  26. Wang Y, Feng L. A new hybrid feature selection based on multi-filter weights and multi-feature weights. Appl Intell. 2019;49:4033–57. https://doi.org/10.1007/s10489-019-01470-z.

    Article  Google Scholar 

  27. Cateni, Sivia & Colla, Valentina & Vannucci, Marco. A hybrid feature selection method for classification purposes. In: Proceedings—UKSim-AMSS 8th European Modelling Symposium on Computer Modelling and Simulation, EMS 2014. 2014. https://doi.org/10.1109/EMS.2014.44.

  28. Gong L, Xie S, Zhang Y, Wang M, Wang X. Hybrid feature selection method based on feature subset and factor analysis. IEEE Access. 2022;10:120792–803. https://doi.org/10.1109/ACCESS.2022.3222812.

    Article  Google Scholar 

  29. Bashiri Mosavi SA. Applying cross-permutation-based quad-hybrid feature selection algorithm on transient univariates to select optimal features for transient analysis. IEEE Access. 2022;10:41131–51. https://doi.org/10.1109/ACCESS.2022.3166917.

    Article  Google Scholar 

  30. Thejas GS, Garg R, Iyengar SS, Sunitha NR, Badrinath P, Chennupati S. Metric and accuracy ranked feature inclusion: hybrids of filter and wrapper feature selection approaches. IEEE Access. 2021;9:128687–701. https://doi.org/10.1109/ACCESS.2021.3112169.

    Article  Google Scholar 

Download references

Acknowledgements

The authors acknowledged the Thanthai Periyar Government Arts & Science College (Autonomous), Affiliated to Bharathidasan University, Tiruchirappalli, Tamilnadu, India for supporting the research work by providing the facilities.

Funding

No funding involved in the research work.

Author information

Authors and Affiliations

Authors

Contributions

The research resulted from a collective effort, with all authors contributing collaboratively to its accomplishment.

Corresponding author

Correspondence to R. Senthamil Selvi.

Ethics declarations

Conflict of interest

Authors have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Selvi, R.S., Bibi, K.F. Feature Selection of Multi-class Data Sets Based on Enhanced Binary Gray Wolf Algorithm and Ant Colony Optimization Algorithm. SN COMPUT. SCI. 5, 1076 (2024). https://doi.org/10.1007/s42979-024-03402-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s42979-024-03402-2

Keywords

Navigation