skip to main content
10.1145/3608164.3608172acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicbbtConference Proceedingsconference-collections
research-article

A Bald Eagle Search Optimization Based Weighted Rank Aggregation Method for Microarray Data Classification

Published: 07 November 2023 Publication History

Abstract

The rapid development of microarray technology has generated a large amount of microarray data, and the classification of these data is meaningful for cancer diagnosis, treatment and prognosis. The classification of high-dimensional microarray data with small samples is a challenging problem, which usually requires feature selection methods to reduce the data dimensionality first. However, different feature selection methods usually generate different feature lists for the same data. Researchers need to choose among many feature selection methods, which reduces the research efficiency. Therefore, rank aggregation method is used to generate a optimal list by aggregating all ordered feature lists generated by different feature selection methods. It can combine the advantages of multiple feature selection methods and does not prefer a particular method, so it is more robust to outliers, noises and errors. In this paper, we propose a weighted rank aggregation method based on the Bald Eagle Search optimization. A positional weight is designed to emphasize the importance of the top features in the list, so that the distance between lists can be measured more accurately. In addition, we improve the Bald Eagle Search algorithm for optimizing rank aggregation method to obtain a optimal ordered list. The experimental results on six public microarray datasets indicate that the features selected by our method can significantly improve the classification performance.

References

[1]
F Adacd, E Ata, D Smb, and F Rmc. 2020. Gene selection and classification of microarray data method based on mutual information and moth flame algorithm. Expert Systems with Applications 166 (2020).
[2]
Juan A Aledo, Jose A Gámez, and David Molina. 2013. Tackling the rank aggregation problem with evolutionary algorithms. Appl. Math. Comput. 222 (2013), 632–644.
[3]
N. Almugren and H. M. Alshamlan. 2019. New Bio-Marker Gene Discovery Algorithms for Cancer Gene Expression Profile. IEEE Access PP, 99 (2019), 1–1.
[4]
HA Alsattar, AA Zaidan, and BB Zaidan. 2020. Novel meta-heuristic bald eagle search optimisation algorithm. Artificial Intelligence Review 53, 3 (2020), 2237–2264.
[5]
A. Chaudhuri and T. P. Sahu. 2022. Multi-objective feature selection based on quasi-oppositional based Jaya algorithm for microarray data. Knowledge-Based Systems 236 (2022), 107804–.
[6]
Robert P DeConde, Sarah Hawley, Seth Falcon, Nigel Clegg, Beatrice Knudsen, and Ruth Etzioni. 2006. Combining results of microarray experiments: a rank aggregation approach. Statistical applications in genetics and molecular biology 5, 1 (2006).
[7]
Chris Ding and Hanchuan Peng. 2005. Minimum redundancy feature selection from microarray gene expression data. Journal of bioinformatics and computational biology 3, 02 (2005), 185–205.
[8]
Yunpeng Gao and Ke Xu. 2019. pRankAggreg: A fast clustering based partial rank aggregation. Information Sciences 478 (2019), 408–421.
[9]
Jie Huang, Li Zhang, Changyu He, Ying Qu, Jianfang Li, Jianian Zhang, Tao Du, Xuehua Chen, Yingyan Yu, Bingya Liu, 2015. Claudin-1 enhances tumor proliferation and metastasis by regulating cell anoikis in gastric cancer. Oncotarget 6, 3 (2015), 1652.
[10]
A. Jain and D. Zongker. 1997. Feature selection: evaluation, application, and small sample performance. IEEE Trans.pattern Anal.mach.intell 19, 2 (1997), 153–158.
[11]
Indu Jain, Vinod Kumar Jain, and Renu Jain. 2018. Correlation feature selection based improved-binary particle swarm optimization for gene selection and cancer classification. Applied Soft Computing 62 (2018), 203–215.
[12]
X. Jiang, L. H. Lim, Y. Yao, and Y. Ye. 2011. Statistical ranking and combinatorial Hodge theory. Mathematical Programming 127, 1 (2011), 203–244.
[13]
Raivo Kolde, Sven Laur, Priit Adler, and Jaak Vilo. 2012. Robust rank aggregation for gene list integration and meta-analysis. Bioinformatics 28, 4 (2012), 573–580.
[14]
Liang Liang, Jin-hui Zhu, Gang Chen, Xin-gan Qin, and Jun-qiang Chen. 2020. Prognostic values for the mRNA expression of the ADAMTS family of genes in gastric cancer. Journal of Oncology 2020 (2020).
[15]
Shili Lin. 2010. Rank aggregation methods. Wiley Interdisciplinary Reviews: Computational Statistics 2, 5 (2010), 555–570.
[16]
H. Lu, J. Chen, K. Yan, Q. Jin, Y. Xue, and Z. Gao. 2016. A hybrid feature selection algorithm for gene expression data classification. Neurocomputing 256, sep.20 (2016), 56–62.
[17]
M. Maghsoudloo, S. A. Jamalkandi, A. Najafi, and A. Masoudi-Nejad. 2020. An efficient hybrid feature selection method to identify potential biomarkers in common chronic lung inflammatory diseases. Genomics5 (2020), 112.
[18]
M. Mandal and A. Mukhopadhyay. 2017. Multiobjective PSO-based rank aggregation: Application in gene ranking from microarray data. Information Sciences s 385–386 (2017), 55–75.
[19]
M. E. Matheny, F. S. Resnic, N. Arora, and L. Ohno-Machado. 2007. Effects of SVM parameter optimization on discrimination and calibration for post-procedural PCI mortality. Journal of Biomedical Informatics 40, 6 (2007), 688–697.
[20]
Alberto Maydeu Olivares. 2006. Corrections to Classical Procedures for Estimating Thurstone’s Case V Model for Ranking Data. Instituto de Empresa Business School Working Paper No. WP06-25 (2006).
[21]
Trevelyan R Menheniott, Louise O’Connor, Yok Teng Chionh, Jan Däbritz, Michelle Scurr, Benjamin N Rollo, Garrett Z Ng, Shelley Jacobs, Angelique Catubig, Bayzar Kurklu, 2016. Loss of gastrokine-2 drives premalignant gastric inflammation and tumor progression. The Journal of clinical investigation 126, 4 (2016), 1383–1400.
[22]
Mohammadreza Momenzadeh, Mohammadreza Sehhati, and Hossein Rabbani. 2019. A novel feature selection method for microarray data classification based on hidden Markov model. Journal of biomedical informatics 95 (2019), 103213.
[23]
Y. Nakayama, K. Yata, and M. Aoshima. 2021. Clustering by principal component analysis with Gaussian kernel in high-dimension, low-sample-size settings. Journal of Multivariate Analysis 185 (2021).
[24]
Somnath Panja, Samiran Bag, Feng Hao, and Bimal Roy. 2020. A smart contract system for decentralized borda count voting. IEEE Transactions on Engineering Management 67, 4 (2020), 1323–1339.
[25]
Vasyl Pihur, Susmita Datta, and Somnath Datta. 2009. RankAggreg, an R package for weighted rank aggregation. BMC bioinformatics 10, 1 (2009), 1–10.
[26]
S. Salesi, G. Cosma, and M. Mavrocouniotis. 2021. TAGA: Tabu Asexual Genetic Algorithm Embedded in a Filter/Filter Feature Selection Approach for High-dimensional Data. Information Sciences (2021).
[27]
Gehad Ismail Sayed, Mona M Soliman, and Aboul Ella Hassanien. 2021. A novel melanoma prediction model for imbalanced data using optimized SqueezeNet by bald eagle search optimization. Computers in Biology and Medicine 136 (2021), 104712.
[28]
Jingwei Too and Abdul Rahim Abdullah. 2021. A new and fast rival genetic algorithm for feature selection. The Journal of Supercomputing 77, 3 (2021), 2844–2874.
[29]
Wanchuang Zhu, Yingkai Jiang, Jun S Liu, and Ke Deng. 2021. Partition-Mallows Model and Its Inference for Rank Aggregation. J. Amer. Statist. Assoc.just-accepted (2021), 1–41.

Cited By

View all
  • (2024)Bald eagle search algorithm: a comprehensive review with its variants and applicationsSystems Science & Control Engineering10.1080/21642583.2024.238531012:1Online publication date: Aug-2024

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICBBT '23: Proceedings of the 2023 15th International Conference on Bioinformatics and Biomedical Technology
May 2023
313 pages
ISBN:9798400700385
DOI:10.1145/3608164
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2023

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. feature selection
  2. microarray data
  3. rank aggregation
  4. stochastic optimization

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICBBT 2023

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)21
  • Downloads (Last 6 weeks)1
Reflects downloads up to 17 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2024)Bald eagle search algorithm: a comprehensive review with its variants and applicationsSystems Science & Control Engineering10.1080/21642583.2024.238531012:1Online publication date: Aug-2024

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media