BP 2

Uploaded by

kandeharikabai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views6 pages

BP 2

Uploaded by

kandeharikabai

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

A SURVEY ON TECHNIQUE FOR PREDICTION OF

DISEASE IN MEDICAL DATA

Ahelam Tikotikar Mallikarjun Kodabagi
PhD Scholar Professor
School of Computing & IT School of Computing & IT
REVA UNIVERSITY REVA UNIVERSITY
Rukmini Knowledge Park Rukmini Knowledge Park
Kattigenahalli Yelahanka Kattigenahalli Yelahanka
Bangalore-560064 Bangalore-560064

Abstract—In today’s era data mining plays important role for attribute to generate fuzzy rules which are weighted based on
prediction of diseases in medical field. With the growing research the frequency in the learning database. Then these fuzzy rules
on disease predicting system, it has become important to discover are used to built the decision support system. The proposed
hidden patterns and relationships from medical databases. In system is tested on three types of dataset those are cleveland
classical clinical diagnosis, it requires lots of tests which could
dataset collected from V.A medical center which contains total
complicate the disease prediction. Hence the data mining
techniques can help medical expertise to take the decision about instance of 303 records in which 202 are training data and 101
the disease using computer aided decision support system. In this are used as test data. Hungarian dataset collected from
paper comprehensive survey on various data mining techniques Hungarian Institute of Cardiology, Budapest includes 196 of
used for disease prediction is presented. training data, 98 of test data which has total instance of 294
records. And total instance of 123 records are taken in which
Keywords—Data mining, disease prediction, breast cancer, 41 are test data and 82 are training data from Switzerland
heart, feature selection. dataset collected from University Hospital, Zurich,
I. Introduction Switzerland. This method when compared to neural-network
based system has achieved highest accuracy. The method
Data mining is a novel approach for extracting knowledge reported accuracy of 57.851% for Cleveland data. Hungarian
from databases. One of the most active research of data dataset, gives the accuracy of 50.583%. Switzerland dataset
mining is healthcare industry. As healthcare companies are reported the accuracy which is 20%, higher then neural-
making efforts to gather patient’s records. Estimation shows network based system. [1]
that there is approximately 1,099,511,627,776 bytes of data,
which is thus increasing day by day. This data has to be mined S. Apte et al. (2012) proposes data mining classification
to extract useful information. As sometimes patients fails to technique, for prediction of heart disease. In this approach data
explain, their symptoms correctly and laboratory reports preprocessing technique is applied, to remove missing values
outcomes may be with some degrees of error. The doctors find and this missing value has been replaced with mean mode
difficulties in taking decision about the disease as they may method. Later the multi-layer perceptron neural network is
not have expertise in all fields. Thus to solve this problems, used for mapping the data. Hence data mining classification
there is a need for development of decision prediction system techniques namely: naïve Bayes, neural network and decision
that combines knowledge of medical expertise with automated trees are analyzed on Heart disease database. Here the data
system to achieve best results and can serve the society. mining tool used is Weka 3.6.6. The method collected 303
II. Description of data mining techniques for disease records from Cleveland heart disease database which is used
prediction as training set & 270 records from Statlog Heart Disease
database which is used as test set. The data set consists of 3
In this section significant amount of work has gone in to the types of attributes: Predictable attribute, Input and Key.
research related to data mining technique for disease Totally 573 records are used to detect the disease. Prior 13
prediction. input attributes are used and further two more attributes are
P. K. Anooj (2012) proposed a clinical decision support added, those attributes are smoking and obesity, as these
system using weighted fuzzy rules for risk level prediction of attributes are considered as important factor for disease like
heart disease. In this method, first data preprocessing for heart. Thus, the result obtained, is that the accuracy of neural
eliminating missing value is applied. Further they carry out network reported to be 100%, decision tree gives accuracy of
generation of weighted fuzzy and developed a fuzzy rule- 99.62% and naïve bayes reported the accuracy of 90.74%.
based decision support system. The method selects suitable And after comparing these three classification techniques, the

978-1-5386-0569-1$31.00 2017
c IEEE 550
result derived was, as compared to decision trees and naïve classification of cerebrovascular accident attack. This method
bayes, the accuracy of neural network was highest. [2] first predicts the variable which is dependent. The model with
hidden layer 10 nodes and output layer consist of one node is
D.I. Kotsia et al. (2008) proposed an automatically generated used. The dataset contains 100 records in which 40 were
system for diagnosis of coronary artery disease using data females and 60 were males from federal medical centre, Owo,
mining and fuzzy modeling. It contains various steps such as: Nigeria. The neural network was trained with 150 epoch and
induction of a decision tree from data, extraction of rules, MSE of 0.0698843. Thus, simulation result achieved by this
formulation of crisp model, transformation of crisp model to approach is that the model was capable of producing a
fuzzy model and finally optimization. The data for testing this reasonable forecasting accuracy. [8]
method is collected from Invasive Cardiology, department of
the university, hospital of Ioannina. The dataset consists of I.H.Elhajj et al. (2010) proposed an anticipated decision
199 subjects, each one characterized by 19 features. The support system to detect agitation transition. The model uses
method reports a sensitivity accuracy of 80% and specificity decision confidence measure and two new support vector
accuracy of 65%, after fuzzification and optimization. [3] machine architectures namely confidence-based SVM and
confidence-based multilevel SVM for detecting agitation
T. Turner et al. (2012) proposed a method by integrating k- transition. The dataset is obtained using sensors, placed
means clustering and decision tree for diagnosing heart around body. Then the patient undergoes trait scale state-trait
disease Patient. The method employs different centroid anxiety inventory (T-STAI), which is used to measure anxiety
selection methods for k-means clustering algorithm and in adults. By this 240 samples are collected. Thus, an accuracy
decision tree for determining the clusters. The dataset is of 91.4% was achieved as compared to conventional support
collected from Cleveland Clinic Foundation Heart Disease. vector machine which had an accuracy of 90.9%. [9]
The dataset contains 13 different attributes. The combination
of K-means Clustering and decision tree has achieved great S. Pal et al. (2013) describe a model for predicting heart
results when compared to traditional decision tree. Thus, the disease using data mining technique. The proposed
integrated algorithm reported the accuracy of 83.9%. [4] methodology surveyed on three different classifiers namely:
ID3 (Iterative Dichotomized 3), Decision tree, and CART
R. Stocker et al. (2012) proposed a method for diagnosing (Classification and Regression tree). The dataset is collected
heart disease patient, by integrating k- means clustering and from Cleveland Clinic Foundation. Thus, the observation and
naïve bayes with different centroid selection. The dataset is comparison showed that Classification and Regression tree
obtained from Cleveland Clinic Foundation. Thus, by (CART) achieved the accuracy of 83.49% which was
integrating k- means clustering and naïve bayes, the accuracy comparatively better then ID3 (Iterative Dichotomized 3) and
reported is 84.5%, when compared to individual algorithm. [5] Decision tree. The average error reported by CART was 0.3.
The time taken to build CART model is 0.23 seconds. [10]
R. Subramanian et al. (2007) proposed a method for predicting
intelligent heart disease. The method is implemented by K. Chandra Shekar et al. (2011) proposed a method for
integrating three models namely neural network, coactive classification of heart attack patients. Here in this approach,
neuro-fuzzy inference system (CANFIS) for discovering firstly the dataset is preprocessed, then modified equal width
nonlinear relationship maps between different attribute model bining interval approach is applied. Further numeric attributes
and genetic algorithm. The simulation is performed on are converted in to categorical form and frequent patterns
NeuroSolution Software. The dataset is obtained from UCI. applicable to heart disease are mined, using pruning-
Hence CANFIS reported the mean square error of 0.000842. classification association rule (PCAR) algorithm from the data
[6] extracted. Thus, the model uses only selected class label for
effective prediction. The dataset is obtained from UCI. Hence
Montazer et al. (2010) proposed a model to detect coronary the model is capable of predicting the heart attack effectively.
heart disease risk assessment using fuzzy-evidential hybrid [11]
inference engine. In this method, first fuzzy set rules are
applied for the information which is not clear and then extract S. Soni et al. (2011) proposed a model for heart attack
fuzzy rule set. This result is considered as basic belief. And prediction using weighted associative classifier (WAC). The
from this belief, plausibility functions are positioned. This is dataset is obtained from University of California Irvine (UCI)
called as decision making uncertainty and hence information machine learning repository. In this method instead of using 5
fusion takes place from various sources. The dataset is class label i,e 4 for four types of Heart Disease and 1 for no
obtained from Hungarian institute of cardiology’s heart heart Disease. The method considers only 2 class labels 1 for
disease dataset in the university of California, Irvine’s “Heart Disease” and another for “No Heart Disease” as the
machine learning repository. The dataset consists of 294 data set is having less number of records for different types of
samples. Hence the accuracy achieved is 91.58%. [7] Heart Disease. Using 25% of support value and 80% of
confidence value. The model achieved the accuracy of
Olabode et al. (2012) proposes multilayer feed forward 81.51%. Thus, weighted associative classifier is the best
artificial neural network with back propagation error for

2017 International Conference On Smart Technology for Smart Nation 551

approach to obtain significant pattern from the dataset of heart P.K. Anooj (2012) 57.85%
disease. [12]

A.Govrdhan et al. (2011) proposed a data mining application S. Apte et al. (2012) 100%
in medical industry for predicting heart attacks. The method
uses one dependency augmented naïve bayes classifier
(ODANB) and naive creedal classifier 2 (NCC2) for data D.I. Kotsia et al. (2008) 80%
preprocessing. The application uses three data mining
algorithms namely: decision list, naïve bayes and K-NN for
predicting heart attacks. The dataset used here is plain text
T. Turner etal. (2012) 83.9%
format ARFF files and also dataset from the University of
California Irvine (UCI) machine learning repository. The
method has been validated on 3000 instances of dataset with
14 different attributes. The method performed validations on R. Stocker et al. (2012) 84.5%
both training and test data, in which 70% data is used as
training and remaining 30% as test data. The decision list
reported the accuracy of 52%, naïve bayes gives the accuracy R. Subramanian et (2007) MSE -0.000842
of 52.33% and K-NN has achieved the accuracy of 45.67%. al.
The comparison was made among these algorithms. The
results were judged on the basis of accuracy and time taken in
diagnosing the heart disease. Navie bayes was chosen as the Montazer etal. (2010) 91.58%
best classification algorithm as time taken by navie bayes was
comparatively less when compared to decision list and K-NN
algorithm i.e 609ms. [13] Olabode et al. (2012) MSE-0.0698843
E. Anupriya et al. (2010) employ a model to predict heart
disease. First genetic algorithm is used to determine
significant attribute. Then new population is constructed using I.H.Elhajj et al. (2010) 91.4%
survival of fittest. Further the model uses three classification
techniques namely: decision tree, classification via clustering
and naive bayes for predicting disease. The dataset consists of S. Pal et al. (2013) 83.49%
909 records. Initially with 13 attributes, which were reduced to
6 attributes with 0.6 cross over probability and 0.033 mutation
probability. The Decision tree reported highest accuracy of
99.2%, when compared to other classification technique. [14] K. Chandra Shekar (2011) Predicts
et al. effectively
C. Ardil et al. (2013) Proposed a method using data mining
technique for predicting acute coronary syndrome. First the S. Soni et al. (2011) 81.51%
model uses data reduction technique to reduce the dimensions.
After applying principal component analysis on the ten
independent numeric variables, the model founds that the first
eight principle components cover more than 98% of the total A.Govrdhan et al. (2011) 52.33%
variability of the continuous data space. The model uses data
sets from two different cardiac hospitals of Karachi and
Pakistan. After data reduction, the 14 independent variables E. Anupriya et al. (2010) 99.2%
are hypertension, gender, fasting blood sugar, cholesterol,
pulse rate, heart rate, smoke, age, blood pressure (diastolic),
family history, hypertension, diabetics mellitus, streptokinase,
blood pressure (systolic). Thus, the observation showed that C. Ardil et al. (2013) Smoking is
smoking is the most significant factor or risk for acute significant factor.
coronary syndrome, when compared to other factors. [15]

The comparative study of heart attack disease

Authors Year Accuracy Survey on Breast Cancer Diagnosis

552 2017 International Conference On Smart Technology for Smart Nation

Some of the other data mining works reported on breast cancer proposed method uses three data mining technique namely:
prediction are summarized below. back propagated neural network, naïve bayes and C4.5. The
dataset is obtained from the Surveillance, Epidemiology, and
M. Yaghoobi et al. (2014) report a fuzzy system for End Results (SEER) program of the national cancer
distinguishing between benign and malignant breast cancer. institute (NCI) is a source of epidemiologic information on
The method introduced chaos in to the hierarchical cluster- the incidence and survival rates of cancer in the united states.
based multispecies partical swarm optimization, which leads The dataset consists of 151,886 records, with 16 fields. The
to development of chaotic hierarchical cluster-based model reports the accuracy of approximately 87%. This
multispecies particle swarm optimization (CHCMSPSO). The accuracy was implemented using weka toolkit. [19]
CHCMSPSO helps for distinguishing the type of breast cancer
and for optimizing the fuzzy system. Here the model also R. Ceylan et al. (2013) proposed a model for identifying
learns about takagi-sugeno-kang type fuzzy rules with high biomedical pattern classification using artificial neural
accuracy. The dataset is obtained from University of network based on rotation forest (RF-ANN). The model uses
California Irvine (UCI) machine learning repository. Thus, the multilayer perceptron neural network as the base classifier. RF
method uses eleven chaotic maps for global search ability. algorithm was used as ensemble classifier. Different feature
Among those maps, sinusoidal chaotic map achieved accuracy sets are obtained from original data set using principal
of 99%, because it coordinated well with the problem component analysis. The dataset is obtained from Wisconsin
condition. The accuracy of model to distinguish between breast cancer data from the University of California Irvine
benign and malignant breast cancer is reported to be more than (UCI) machine learning repository. Thus, the accuracy
90%. [16] reported by RF-ANN is 98.05%. [20]

Burke B.H et al. (1999) proposed a model for evaluating the Behnam H et al. (2005) proposes a model to predict the
accuracy of ANN in predicting 5, 10 and 15 years breast disease and assist the radiologists for diagnosing breast cancer.
cancer specific survival. The eight input variables entered in The model integrates multiwavelet based sub band image
this model are nuclear pleomorphism, tumor necrosis, tubule decomposition and artificial neural network (ANN). The
formation, age, axillary nodal status, mitotic count, method is tested on mammographic image analysis
histological and tumor size. The dataset is obtained from City society(MIAS) mammographic database. Among different
Hospital of Turku and Turku University Central Hospital. The multiwavelet, performance of biorthogonal geronimo, hardin
dataset consists of 951 instances. Further divided in to training and massopust multiwavelet with length 2 (BiGHM2) was
set of 651 and a validation set of 300 patients. Here in this best. Thus, BiGHM2 achieved accuracy with areas ranging
model, the results of artificial neural network and logistic around 0.96 under receiver operating characteristic curve. [21]
regression is compared. The accuracy for 5 years survival
reported to be 0.909, 0.086 for 10 years and 0.883 for 15 S.Nahavandi et al. (2015) proposes an automated medical data
years. Thus, the observation and comparison showed that classification method using interval type-2 fuzzy logic system
artificial neural network reports consistently high accuracy (IT2FLS) and wavelets. The model deals with uncertainity and
over time when compared with logistic regression. [17] high dimensionality data challenge. This implementation is
carried out on two different medical datasets: Cleveland heart
G.Walker et al. (2005) implemented a method for comparing disease and Wisconsin breast cancer from UCI repository for
three data mining technique for predicting breast cancer machine learning. The result demonstrates that, advantage of
survivability. The data mining technique includes logistic IT2FLS is better when compared to other machine learning
regression, decision tree and artificial neural network. The method. [22]
model uses a large dataset with more than 200,000 cases.
Thus, the results obtained, is that the accuracy of logistic S. Sulong et al. (2012) proposes a method to detect cervical
regression reported to be 89.2%, decision tree (C5) gives the cancer using confounding effects like age, marital status and
accuracy of 93.6% and artificial neural network reported the treatment among Malaysian women. The cervical cancer
accuracy of 91.2%. To test the data 10-fold cross-validation is patient records are taken from databank of department,
performed, to measure the unbiased estimate, for prediction of university Kebangsaan Malaysia(UKM) medical center. The
three techniques. Thus, the comparative study concludes that model considers four stages, with 444 patient records, who are
the decision tree (C5), is the best predictor for predicting suffering from cervical cancer, and found out the treatment for
breast cancer survivability, as compared to artificial neural women according to their age, and marital status. Thus, found
network and logistic regression. [18] that the women at the age of 46 years have more chances of
cervical cancer. So Malaysian women are suggested to take
E. Gauven et al. (2006) proposed a model for prediction of test before the age of 45 years and it also discovers that
breast cancer survivability using data mining techniques. In married and Chinese women less the 57 years old are more
this approach, first pre-classification process is performed by likely to diagnose in the early stage of cervical cancer either
considering three fields namely: vital status recode, cause of by operation or by both combined treatment of radiotherapy
death, and survival time recode. For classification, the and operations compared to any other treatment. [23]

2017 International Conference On Smart Technology for Smart Nation 553

E. Gauven et al. (2006) 87%
P. Lim et al. (2014) presents a medical data classification
technique for prediction of cancer using hybrid intelligent
classification. The model consists of classification trees and
R. Ceylan et al. (2013) 98.05%
regression tree (CART), random forest and fuzzy min-max
neural network. The random forest (RF) method is applied to
form an ensemble of classification and regression tree (CART)
model. Fuzzy min-max is used for learning. The CART is Behnam H et al. (2005) ROC-0.96
used for rule extraction. The database is collected from Liver
Disorders, Wisconsin breast cancer (WBC), Pima Indians
diabetes from UCI. Thus, accuracy achieved by this hybrid
S.Nahavandi et al. (2015) 97.40%
model for prediction of cancer is 98.84%. [24]

D. Chen et al. (2001) proposed a novel system using data

mining with decision tree for classification of breast tumor in Sulong et al. (2012) Test at 45 years
medical ultrasonic images. Breast tumor is deadly disease
among women. The model monitored US images. In this
model C5.0 algorithm is used as the decision tree. The data P. Lim et al. (2014) 98.84%
mining with decision tree has helped for classification of
breast tumor with the sensitivity accuracy of 93.33% and
specificity accuracy of 96.67%. [25]
D. Chen et al. (2001) 96.67%

Choi Chul Sang et al. (2001) proposed a model using

Hierarchical Classification Structures with rough set. Here Choi Chul Sang et (2001)
upper and lower approximations of rough set are used to al.
classify the interested objects in to similarity classes, so as to
reason uncertain concepts. Rough set has three valued
membership function those are PERHAPS, YES and NO. The
model uses hierarchical granulation structure to find the
III. RESEARCH GAP
classification rules and thus proposed a rule discovery. The
method is validated on dataset obtained from wisconsin breast
cancer data (WBC). The system when feeded with simple The papers discussed here reviews about data mining
rules and short conditionals, still it yields good performance. techniques, classification techniques, intelligent techniques
Thus, the method was successful for reducing the number of and feature selection for prediction of disease. As feature
attribute by generating minimal classification rules. The model selection helps us to eliminate unwanted data, and high
helps to analyze the information system in an easier way. [26] dimensional data has to be compressed without the loss of
information, by which performance of classifier is increased.
But the complexity of feature subset selection is high which is
The comparative study of breast cancer challenging task because it contains complex interdependency
on a variety of factor. In the future, we could integrate rules
Authors Year Accuracy and feature selection in the classifiers for better performance.
Further, new feature selection technique like ant colony
optimization etc, can be tested to improve the quality, and you
M. Yaghoobi et al. (2014) More than 90% may try to experiment algorithm potential for most medical
dataset which includes different characteristics like noisy data,
sparsity, missing value etc, to improve the reliability of model.
Burke B.H et al. (1999) 5yrs-0.909,
IV. CONCLUSION
10yrs-0.086
&for 15yrs-
0.883 The main focus of this paper is to discuss about decision
G.Walker et al. (2005) 93.6% parameter, attribute, and features used for predicting the
disease. The method also throws lights on importance of
different classification methods for prediction of disease in
medical dataset. The dataset considered in so many existing
techniques that we have discussed are related to heart and
breast cancer. The various data mining techniques are used as
classifier, to build a cost effective model for disease

554 2017 International Conference On Smart Technology for Smart Nation

prediction. Hence it is well understood by the exhaustive partical swarm optimization”, Iranian Conference on Intelligent
Systems, pp 1-6, feb 2014, DOI: 10.1109/IranianCIS.2014.6802524.
survey that mining the required information from the medical
data help us to support well informed diagnosis and decisions. [17] Lundin M., Lundin J., BurkeB.H.,Toikkanen S., Pylkkänen L. and
Joensuu H. , “Artificial Neural Networks Applied to Survival Prediction
in Breast Cancer”, Oncology International Journal for Cancer Resaerch
REFERENCES and Treatment, vol. 57, 1999.
[1] P. K. Anooj, “Clinical Decision Support system: Risk level prediction of [18] Delen D, Walker G, Kadam A, “Predicting breast cancer survivability: a
heart disease using weighted fuzzy rules”, Journal of King Saud comparision of three data mining methods”, Artif Intell Med. 2005
University- Computer and Information Sciences (2012) 24, 27-40. Jun;34(2):113-27. PMID:15894176 DOI:10.1016/j.artmed.2004.07.002
[2] Chaitrali S. Dangare and Sulabha S. Apte, “Improved Study of Heart [19] Bellaachia Abdelghani and Erhan Guven, "Predicting Breast Cancer
Disease Prediction System using Data Mining Classification Survivability using Data Mining Techniques," Ninth Workshop on
Techniques”, International Journal of Computer Applications (0975 – Mining Scientific and Engineering Datasets in conjunction with the
888) Volume 47– No.10, June 2012. Sixth SIAM International Conference on Data Mining,” 2006.
[3] Tsipouras M.G., Exarchos T.P., Fotiadis D.I., Kotsia A.P., Vakalis K.V., [20] H. Koyuncu and R. Ceylan, “Artificial neural network based on rotation
Naka K.K., Michalis L.K., Automated diagnosis of coronary artery forest for biomedical pattern classification,” in Telecom-munications
disease based on data mining and fuzzy modeling, IEEE T. Inf. Technol. and Signal Processing (TSP), 2013 36th International Conference on.
B., 12(4), 447–458, 2008. IEEE, 2013, pp. 581–585.
[4] Mai Shouman, Tim Turner and Rob Stocker, “Integrating Decision Tree [21] Jamarani S. M. h., Behnam H. and Rezairad G. A., “Multiwavelet Based
and K-Means Clustering with Different Initial Centroid Selection NeuralNetwork for Breast Cancer Diagnosis”, GVIP 05 Conference,
Methods in the Diagnosis of Heart Disease Patients”, Proceedings of the 2005, pp. 19-21.
International Conference on Data Mining, 2012.
[22] T. Nguyen, A. Khosravi, D. Creighton, and S. Nahavandi, “Medi-cal
[5] M. Shouman, T. Turner and R. Stocker, “Integrating naïve bayes and K- data classification using interval type-2 fuzzy logic system and
Means clustering with different initial centroid selection methods in the wavelets,” Applied Soft Computing, vol. 30, pp. 812–822, 2015.
diagnosis of heart disease patients” ICAITA, 2012.
[23] Z. Mahmud and S. Sulong, “Confounding effects of age, marital status
[6] Latha Parthiban and R. Subramanian, “Intelligent Heart Disease and treatment on cervical cancer stages among malaysian women”,
Prediction System using CANFIS and Genetic Algorithm”, International Statistics in Science, Business, and Engineering, 2012.
Journal of Biological and Life Science, Vol. 15, pp. 157 – 160, 2007.
[24] M. Seera and C. P. Lim, “A hybrid intelligent system for medical data
[7] Khatibi V., Montazer G.A., A fuzzy-evidential hybrid inference engine classification,” Expert Systems with Applications, vol. 41, no. 5, pp.
for coronary heart disease risk assessment, Expert Sys. Appl., 37(12), 2239–2249,2014.
8536–8542, 2010.
[25] W. Kuo, R. Chang, D. Chen and C. C. Lee, “Data Mining with Decision
[8] Bola Titilayo Olabode and Olatubosun Olabode, “Cerebrovascular Trees for Diagnosis of Breast Tumor in Medical Ultrasonic Images”,
Accident Attack Classification Using Multilayer Feed Forward Breast Cancer Research and Treatment, Dordrecht, vol. 66, Iss. 1, Mar
Artificial Neural Network with Back Propagation Error”, Journal of 2001.
Computer Science vol. 8 , No. 1,pp. 18-25, 2012.
[26] Chul-Heui Lee, Seon-Hak Seo, Sang-Chul Choi, “Rule Discovery using
[9] G. E. Sakr, I. H. Elhajj and H. A. Huijer, “Support vector machines to hierarchical clasification structure with rough sets,” IFSA World
define and detect agitation transition," IEEE Transactions On Affective Congress and 20th NAFIPS International Conference, 2001.
Computing, vol. 1, pp. 98-108, December 2010. DOI: 10.1109/NAFIPS.2001.944294
[10] V. Chaurasia and S. Pal, “Early prediction of heart diseases using data
mining techniques”, Caribbean Journal of Science and Technology,
vol.1, pp 208-217, 2013.
[11] K. Chandra shekar and N. Deepika , “Association rule for classification
of Heart Attack Patients”, International Journal of Advanced
Engineering Science and Technologies, Vol. 11, No. 2, pp.253 - 257,
2011.
[12] Uzma Ansari, Dipesh Sharma, Jyoti Soni and Sunita Soni, “Intelligent
and Effective Heart Disease Prediction System using Weighted
Associative Classifiers”, International Journal on Computer Science and
Engineering (IJCSE), ISSN : 0975-3397 Vol. 3 No. 6 June 2011.
[13] K. Srinivas, B. Kavitha Rani and Dr. A. Govrdhan,“Application of Data
Mining Techniques in Healthcare and Prediction of Heart
Attacks”, International Journal on Computer Science and Engineering,
Vol. 02, No. 02, pp.250 - 255, 2011
[14] M. Anbarasi, E. Anupriya and N.CH.S.N. Iyenga, “Enhanced Prediction
of Heart Disease with Feature Subset Selection using Genetic
Algorithm”, International Journal of Engineering Science and
Technology, Vol. 2, No. 10, pp.5370 - 5376, 2010.
[15] Jilani T.A., Yasin H., Yasin M., Ardil C., Acute coronary syndrome
prediction using data mining techniques-an application, International
Journal of Computer, Electrical, Automation, Control and Information
Engineering Vol:7, No:1, 2013
[16] A. Yassi, M. Yaghoobi, M. Yassi, “Distinguishing and Clustering breast
cancer according to hierarchical structures based on chaotic multispecies

2017 International Conference On Smart Technology for Smart Nation 555

Diagnosis of Heart Disease Using Data Mining Algorithm
No ratings yet
Diagnosis of Heart Disease Using Data Mining Algorithm
3 pages
Full Paper SRL
No ratings yet
Full Paper SRL
9 pages
Comparison of Various Data Mining Methods For Early Diagnosis of Human Cardiology
No ratings yet
Comparison of Various Data Mining Methods For Early Diagnosis of Human Cardiology
9 pages
Heart Disease Prediction with ML
No ratings yet
Heart Disease Prediction with ML
5 pages
Heart Disease Prediction via Data Mining
No ratings yet
Heart Disease Prediction via Data Mining
9 pages
Predictive Data Mining For Medical Diagnosis: An Overview of Heart Disease Prediction
No ratings yet
Predictive Data Mining For Medical Diagnosis: An Overview of Heart Disease Prediction
6 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
16 pages
Review of Heart Disease Prediction System Using Data Mining and Hybrid Intelligent Techniques
No ratings yet
Review of Heart Disease Prediction System Using Data Mining and Hybrid Intelligent Techniques
5 pages
Heart Disease Prediction Using Data Mining Techniques: Journal of Analysis and Computation (JAC)
No ratings yet
Heart Disease Prediction Using Data Mining Techniques: Journal of Analysis and Computation (JAC)
8 pages
Prediction Heart Disease
No ratings yet
Prediction Heart Disease
11 pages
Survey of Heart Disease Prediction Based On Data Mining Algorithms Ijariie1844
No ratings yet
Survey of Heart Disease Prediction Based On Data Mining Algorithms Ijariie1844
5 pages
An Analysis of Heart Disease Prediction
No ratings yet
An Analysis of Heart Disease Prediction
4 pages
A Comparative Study For Predicting Heart Diseases Using Data Mining Classification Methods
No ratings yet
A Comparative Study For Predicting Heart Diseases Using Data Mining Classification Methods
12 pages
Farzana 2020
No ratings yet
Farzana 2020
5 pages
Prediction of Heart Disease Using A Hybrid Technique in Data Mining Classification
No ratings yet
Prediction of Heart Disease Using A Hybrid Technique in Data Mining Classification
3 pages
Decision Support in Heart Disease Prediction System Using Naive Bayes
No ratings yet
Decision Support in Heart Disease Prediction System Using Naive Bayes
7 pages
An Optimized Approach For Prediction of Heart Diseases Using Gradient Boosting Classifier
No ratings yet
An Optimized Approach For Prediction of Heart Diseases Using Gradient Boosting Classifier
7 pages
An Optimized Approach For Prediction of Heart Diseases Using Gradient Boosting Classifier
No ratings yet
An Optimized Approach For Prediction of Heart Diseases Using Gradient Boosting Classifier
7 pages
A Comparative Study of Classification Algorithms For Diseases Prediction in Medical Domain
No ratings yet
A Comparative Study of Classification Algorithms For Diseases Prediction in Medical Domain
5 pages
4 Analysis of Heart Disease
No ratings yet
4 Analysis of Heart Disease
6 pages
IJCRT2205103
No ratings yet
IJCRT2205103
10 pages
Identification of Significant Features and Data Mining Techniques in Predicting Heart Disease Telematics and Informatics
No ratings yet
Identification of Significant Features and Data Mining Techniques in Predicting Heart Disease Telematics and Informatics
34 pages
Thesis Updated
No ratings yet
Thesis Updated
151 pages
Earlier Prediction of Heart Disease Using Locality Sensitive Hashing
No ratings yet
Earlier Prediction of Heart Disease Using Locality Sensitive Hashing
10 pages
Heart Disease Prediction Using DM Techniques
No ratings yet
Heart Disease Prediction Using DM Techniques
6 pages
Heart Disease Prediction via Data Mining
No ratings yet
Heart Disease Prediction via Data Mining
4 pages
Heart Disease Prediction Using Data Mining
No ratings yet
Heart Disease Prediction Using Data Mining
3 pages
Prognosis of Cardiac Disease Using Data Mining Techniques A Comprehensive Survey
No ratings yet
Prognosis of Cardiac Disease Using Data Mining Techniques A Comprehensive Survey
5 pages
Heart Disease Prediction Using Machine Learning IJERTV9IS080128
No ratings yet
Heart Disease Prediction Using Machine Learning IJERTV9IS080128
3 pages
Heart Disease Prediction Using Data Mining Techniques IJERTV10IS020083
No ratings yet
Heart Disease Prediction Using Data Mining Techniques IJERTV10IS020083
7 pages
Feature Selection For Classification in Medical Data Mining: Volume 2, Issue 2, March - April 2013
No ratings yet
Feature Selection For Classification in Medical Data Mining: Volume 2, Issue 2, March - April 2013
6 pages
6245e19c618b73 12171037
No ratings yet
6245e19c618b73 12171037
9 pages
Heart Disease PredictionUsing
No ratings yet
Heart Disease PredictionUsing
6 pages
View of Cardiovascular Heart Disease Prediction Using Machine Learning Classifiers With Data Mining Techniques
No ratings yet
View of Cardiovascular Heart Disease Prediction Using Machine Learning Classifiers With Data Mining Techniques
9 pages
Disease Prediction Using Data Mining
No ratings yet
Disease Prediction Using Data Mining
5 pages
Analysis of Heart Disease Using in Data Mining Tools Orange and Weka
No ratings yet
Analysis of Heart Disease Using in Data Mining Tools Orange and Weka
7 pages
Heart Prediction
No ratings yet
Heart Prediction
15 pages
Implementation of An Incremental Deep Learning Model For Survival Prediction of Cardiovascular Patients
No ratings yet
Implementation of An Incremental Deep Learning Model For Survival Prediction of Cardiovascular Patients
9 pages
Heart Disease Prediction via Firefly-Optimized Stacked Ensemble
No ratings yet
Heart Disease Prediction via Firefly-Optimized Stacked Ensemble
14 pages
Mcmi 22 011
No ratings yet
Mcmi 22 011
11 pages
Heart Disease Prediction Using Machine Learning Te
No ratings yet
Heart Disease Prediction Using Machine Learning Te
7 pages
AI-based Smart Prediction of Clinical Disease Using Random Forest Classifier and Naive Bayes
No ratings yet
AI-based Smart Prediction of Clinical Disease Using Random Forest Classifier and Naive Bayes
22 pages
Report On Multiple Disease Prediction Using Machine Learning Algorithms
No ratings yet
Report On Multiple Disease Prediction Using Machine Learning Algorithms
14 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
9 pages
Heart Disease Prediction Using Machine Learning Techniques: Devansh Shah Samir Patel Santosh Kumar Bharti
No ratings yet
Heart Disease Prediction Using Machine Learning Techniques: Devansh Shah Samir Patel Santosh Kumar Bharti
6 pages
Heart Disease Python Report 1st Phase
No ratings yet
Heart Disease Python Report 1st Phase
33 pages
Heart Disease Prediction Using Machine Learning Techniques: A Survey
No ratings yet
Heart Disease Prediction Using Machine Learning Techniques: A Survey
5 pages
Docs 7754347436162e1e4b7406
No ratings yet
Docs 7754347436162e1e4b7406
18 pages
Fadnavis 2021 J. Phys. Conf. Ser. 1913 012099
No ratings yet
Fadnavis 2021 J. Phys. Conf. Ser. 1913 012099
7 pages
Heart Disease
No ratings yet
Heart Disease
6 pages
ML for Heart Disease Prediction
No ratings yet
ML for Heart Disease Prediction
4 pages
Heart Disease Predication
No ratings yet
Heart Disease Predication
40 pages
New Research New 1
No ratings yet
New Research New 1
5 pages
Heart Disease Prediction Using Machine Learning Te
No ratings yet
Heart Disease Prediction Using Machine Learning Te
5 pages
Data Mining in Healthcare Systems
No ratings yet
Data Mining in Healthcare Systems
4 pages
Jut 2
No ratings yet
Jut 2
12 pages
Prediction of Heart Disease Using Data M
No ratings yet
Prediction of Heart Disease Using Data M
6 pages
NB 1
No ratings yet
NB 1
7 pages
J Imu 2019 100203
No ratings yet
J Imu 2019 100203
18 pages
Overview en 60 025 094iv
No ratings yet
Overview en 60 025 094iv
50 pages
UV Method for Esomeprazole Validation
No ratings yet
UV Method for Esomeprazole Validation
29 pages
Psychological Tests in Counselling: September 2020
No ratings yet
Psychological Tests in Counselling: September 2020
13 pages
Demystifying Risk Assessment - Key Principles and Controversies - CCI 2017
No ratings yet
Demystifying Risk Assessment - Key Principles and Controversies - CCI 2017
30 pages
T8730 Universal Flow Tester Guide
No ratings yet
T8730 Universal Flow Tester Guide
2 pages
Malaysian Standard
No ratings yet
Malaysian Standard
20 pages
WASMARF
No ratings yet
WASMARF
55 pages
Final Research Paper Arduino 1 1
No ratings yet
Final Research Paper Arduino 1 1
50 pages
Advanced Practical Skills - Physics
No ratings yet
Advanced Practical Skills - Physics
16 pages
Klein Tools CL2000 Clamp Meter Guide
No ratings yet
Klein Tools CL2000 Clamp Meter Guide
2 pages
Ballbar Operation Manual
No ratings yet
Ballbar Operation Manual
18 pages
Spine Magnetic Resonance Image Segmentation Using Deep Learning Techniques
No ratings yet
Spine Magnetic Resonance Image Segmentation Using Deep Learning Techniques
6 pages
E 251 - 92 (2014)
No ratings yet
E 251 - 92 (2014)
20 pages
Coal Blending Compliance Strategies
No ratings yet
Coal Blending Compliance Strategies
0 pages
SIH PPT Rainfall
No ratings yet
SIH PPT Rainfall
4 pages
Advanced Boundary Detection Model
No ratings yet
Advanced Boundary Detection Model
25 pages
Bersin Ammunition Measuring Tool
100% (2)
Bersin Ammunition Measuring Tool
11 pages
ASTM D95 Water Content
100% (2)
ASTM D95 Water Content
6 pages
PSA-LA2500B User Manual
No ratings yet
PSA-LA2500B User Manual
37 pages
Predicting Accuracy of Players in The Cricket Using Machine Learning
No ratings yet
Predicting Accuracy of Players in The Cricket Using Machine Learning
11 pages
PUMA Forward Kinematics
No ratings yet
PUMA Forward Kinematics
11 pages
Levelling Report
No ratings yet
Levelling Report
8 pages
IB Math SL Paper 2 Markscheme
No ratings yet
IB Math SL Paper 2 Markscheme
20 pages
Citizen Monitoring QA/QC Guide
No ratings yet
Citizen Monitoring QA/QC Guide
2 pages
Six Axis Force / Torque Transducer FT Transducer: Assembly and Operating Manual
No ratings yet
Six Axis Force / Torque Transducer FT Transducer: Assembly and Operating Manual
176 pages
Accuracy and Precision Lab Report
No ratings yet
Accuracy and Precision Lab Report
9 pages
Business - Report-Comp-Fin - Data - Part A - Problem
No ratings yet
Business - Report-Comp-Fin - Data - Part A - Problem
17 pages
Truthfulness and Accuracy Quiz
No ratings yet
Truthfulness and Accuracy Quiz
2 pages
Critical Thinking Grid
No ratings yet
Critical Thinking Grid
3 pages
Journal Amdal PDF
No ratings yet
Journal Amdal PDF
18 pages

BP 2

Uploaded by

BP 2

Uploaded by

A SURVEY ON TECHNIQUE FOR PREDICTION OF

DISEASE IN MEDICAL DATA

2017 International Conference On Smart Technology for Smart Nation 551

The comparative study of heart attack disease

Authors Year Accuracy Survey on Breast Cancer Diagnosis

552 2017 International Conference On Smart Technology for Smart Nation

2017 International Conference On Smart Technology for Smart Nation 553

D. Chen et al. (2001) proposed a novel system using data

Choi Chul Sang et al. (2001) proposed a model using

554 2017 International Conference On Smart Technology for Smart Nation

2017 International Conference On Smart Technology for Smart Nation 555

You might also like