Prediction of malignant and benign breast cancer: A data mining approach in healthcare applications
Advances in Data Science and Management: Proceedings of ICDSM 2019, 2020•Springer
As much as data science is playing a pivotal role everywhere, health care also finds its
prominent application. Breast Cancer is the top-rated type of cancer amongst women; which
alone took away 627,000 lives. This high mortality rate due to breast cancer does need
attention, for early detection so that prevention can be done in time. As a potential
contributor to state-of-the-art technology development, data mining finds a multi-fold
application in predicting Brest cancer. This work focuses on different classification …
prominent application. Breast Cancer is the top-rated type of cancer amongst women; which
alone took away 627,000 lives. This high mortality rate due to breast cancer does need
attention, for early detection so that prevention can be done in time. As a potential
contributor to state-of-the-art technology development, data mining finds a multi-fold
application in predicting Brest cancer. This work focuses on different classification …
Abstract
As much as data science is playing a pivotal role everywhere, health care also finds its prominent application. Breast Cancer is the top-rated type of cancer amongst women; which alone took away 627,000 lives. This high mortality rate due to breast cancer does need attention, for early detection so that prevention can be done in time. As a potential contributor to state-of-the-art technology development, data mining finds a multi-fold application in predicting Brest cancer. This work focuses on different classification techniques implementation for data mining in predicting malignant and benign breast cancer. Breast Cancer Wisconsin data set from the UCI repository has been used as an experimental dataset while attribute clump thickness being used as an evaluation class. The performances of these twelve algorithms: Ada Boost M1, Decision Table, J-Rip, J48, Lazy IBK, Lazy K-star, Logistics Regression, Multiclass Classifier, Multilayer–Perceptron, Naïve Bayes, Random Forest, and Random Tree is analyzed on this data set.
Springer