0% found this document useful (0 votes)

12 views4 pages

Mini Research

This document discusses the application of machine learning techniques, specifically K-Nearest Neighbor (K-NN) and Random Forest, for predicting heart disease risk based on various patient characteristics. The study evaluates the performance of these algorithms, achieving an accuracy of 79.0% for K-NN and 80.7% for Random Forest. The findings suggest that machine learning can significantly aid in early diagnosis and management of cardiovascular diseases.

Uploaded by

Moneer Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views4 pages

Mini Research

Uploaded by

Moneer Ali

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Heart disease prediction using machine learning

techniques
Mohammed Ramadan Mohammeed
Computer Sciences (AI)
University of Benghazi
Benghazi, Libya
mohammed.ramadan@uob.edu.ly

Abstract—one of the most well-known uses of artificial II. LITERATURE REVIEW

intelligence, machine learning (ML), is revolutionizing
Through research in this area, techniques for predicting
the field of study. In this work, the use of machine cardiovascular disease using supervised machine learning
learning to determine a person's risk of heart disease is algorithms have been developed. On this subject, several
discussed. Cardiovascular diseases (CVDs) are common study articles have been prepared. A report surveying the
and can possibly be fatal for people anywhere in the globe. performance of many models based on machine learning
A person's age, cholesterol level, chest discomfort, and algorithms and methodologies has been given. [4]. One of the
other characteristics may all be taken into account using studies describes efforts to develop a Graphical User Interface
machine learning to determine if they have a (GUI) that uses a Weighted Association rule-based classifier
cardiovascular disease. Cardiovascular disease diagnosis to determine if a person has heart disease or not [5]. A novel
can be facilitated by machine learning classification method for the prediction of cardiac illness based on the
algorithms based on supervised learning. To distinguish coactive neuro-fuzzy interference system (CANFIS) has been
between individuals with and without cardiac disease, reported in another study [6]. In one of the publications [7],
algorithms such as Random Forest and K-Nearest the methods frequently used to forecast cardiac disease and
Neighbor (KNN) are utilized. This study uses two their associated difficulties are summarized. One of the studies
supervised machine learning algorithms: Random Forest [8] described a classifier strategy for the identification of heart
and K-Nearest Neighbor (K-NN). K-Nearest Neighbor disease and demonstrates the usage of Naive Bayes for
(K-NN) yielded a prediction accuracy of 79.0%, whereas classification purposes. One of the publications conducts a
the Random Forest method produced an accuracy of survey comprising several papers whereby one or more data
mining techniques have been applied to forecast heart disease
80.7%.
[9].
Keywords
III. PROPOSED METHODS
Heart Disease, Random Forest, K Nearest Neighbor (K-NN), A. K-Nearest Neighbor (K-NN)
Machine Learning
For classification tasks, a well-liked machine learning
I. INTRODUCTION algorithm is K-Nearest Neighbors (K-NN). Data points are
categorized using this non-parametric algorithm according to
Human body is made up of various organs, all of which how close they are to one another in a feature space. The K-
have their own functions. Heart is one such organ which NN algorithm counts the number of neighbors, represented by
pumps blood throughout the body and if it does not do so, the the letter k, that will be taken into consideration for
human body can have fatal circumstances. One of the main classification when a new data point is encountered and its
reasons of mortality today is having a heart disease [1]. So, it category or class is unknown. Usually, the user specifies this
becomes necessary to make sure that our cardiovascular value of k in advance or finds it through cross-validation.
system or any other system in the human body for that matter
must remain healthy. Unfortunately, people all around the Because it is based on the notion that data points belonging
world have been facing cardiovascular diseases. Any to the same class tend to be closer to one another in the feature
technology that can help diagnose these diseases before much space, this technique makes the K-NN algorithm a
damage is done will prove as helpful in saving people’s money straightforward but efficient technique for classification tasks.
and more importantly their lives. Data mining techniques can The method makes predictions for unknown data points by
be useful in predicting heart diseases. Predictive models can utilizing the local structure of the data by taking into account
be made by finding previously unknown patterns and trends the class labels of the closest neighbors.
in databases and using the obtained information [2]. To extract B. Random Forest
knowledge from vast volumes of data is to engage in data
mining [3]. One technological advancement that can assist in The Random Forest algorithm is a powerful ensemble
diagnosing cardiac disease early on before significant harm is learning method that combines multiple decision trees to make
done to an individual is machine learning. Machine learning predictions. Using the training set, it constructs a set of
is a rapidly developing subject in science and technology that decision trees, each of which independently generates a
has the ability to diagnose and categorize patients based on predicted class as an output. When it comes to classification
their risk of heart disease. tasks, the final prediction is the class that appears the most
frequently throughout all decision trees.

XXX-X-XXXX-XXXX-X/XX/$XX.00 ©20XX IEEE

Random Forest uses the wisdom of the crowd to produce
predictions that are more reliable and accurate by building a
variety of decision trees and combining their forecasts. The
premise of this ensemble approach is that, despite the potential
biases and limitations of each decision tree, the process of
collective decision-making can make up for these drawbacks.
IV. EXPERIMENTAL SETUP
Getting a dataset containing the traits of a person with and
without heart disease is the first step in getting ready. The
dataset for this experiment may be retrieved from the Kaggle
website (https://www.kaggle.com). The Orange Machine
Learning software is the new program used in this experiment.
The data will now be analyzed using this application. To get a
quick overview of the data set, I used a tool called Data Info.
Fig 3 , Distributions Attribute of sex attribute

Fig 1 , Data info

To get certain statistical statistics for the data set, such the
Fig 4 , Distributions of chest pain attribute
average values of the characteristics used, Distributions
Attribute a service offered by Orange Machine Learning is After checking the data balance, the correlation between
utilized. Target is an attribute that is taken; if the patient has the data is discovered using a tool called Heat Map
heart disease, its value is 1, and if not, its value is 0.

Fig 2 , Distributions of target attribute

It is clear from the findings displayed by attribute

distributions that the data set employed in this investigation is
balanced.
Also use Distributions Attribute with different attributes of
the dataset such as the sex attribute which has values of 1
(male) and 0 (female) and the cp (chest pain) attribute which
shows the type of chest pain ranging from 0 to 3.
Fig 5 , Correlation between variables
The heat map unequivocally demonstrates the positive TABLE I. CONFIUSION MATRIX KNN
link between the desired characteristic and qualities like K-Nearest Neighbor (K-NN)
maximal heart rate reached (thalack) and chest pain (cp).
Having confirmed the association, the dataset has to be Actual
processed in order to turn categorical variables like sex, cp,
fbs, restecg, exang, sclop, ca, and thal into dummy variables. 0 1 ∑
To get the best results while training the models, we will

Predicted
change the values of these characteristics to a value between 0 83 26 109
0 and 1.
1 19 115 134
The training data, which makes up 80% of the total data
set in this study, and the testing data, which makes up the ∑ 102 141 243
remaining 20%, were carefully separated from the original
data set. This section enables a thorough assessment of the
machine learning algorithms used in the research. From the confusion matrix, the accuracy is calculated
The chosen machine learning algorithms were then which comes out to be 79.0 %.
applied to the training data once the data set was ready. These B. Random Forest
algorithms constructed heart disease prediction models by Component The value of number of trees is kept 10. The
leveraging the available features and attributes. In order to confusion matrix obtained was as follows.
make precise predictions, it was necessary to train these
models to recognize the underlying relationships and patterns TABLE II. CONFIUSION MATRIX RF
in the data.
Random Forest
A confusion matrix was used to evaluate the trained
models' performance. The comprehensive assessment of the Actual
algorithm's predictive capabilities is given by the confusion
matrix. Confusion matrix can also be shown as a matrix in the 0 1 ∑
following way:
Predicted

0 79 30 109

1 21 113 134

∑ 100 143 243

From the confusion matrix, the accuracy is calculated

which comes out to be 80.7%.
C. Results after applying each algorithm

TABLE III. RESULTS ALGORITHM

Algorithm Used TP FP TN FN Accuracy

K-NN 83 26 115 19 79.0%

Random Forest 79 30 113 21 80.7%

Fig 6 , Distributions Attribute of sex attribute

The accuracy of the algorithm can be calculated using the VI. CONCLUSION
formula: After putting different algorithms to use, it can be
Accuracy = {(TP + TN) / TP + FP + TN + FN)} * 100 concluded that machine learning is showing to be very helpful
in predicting heart disease, which is one of the biggest issues
Through an examination of the algorithmic accuracy, facing society today. There may soon be new techniques to
can determine how well the machine learning models predict make machine learning more beneficial in the healthcare
heart disease. A higher accuracy score indicates a more industry as more and more research is being done in this area.
reliable and precise algorithm, suggesting that it is capable of With the attributes at hand, the algorithms employed in this
making accurate predictions based on the given attributes. experiment have shown excellent performance. Finally, it can
be concluded that by anticipating heart disease, machine
V. RESULTS learning can lessen the harm done to a person's physical and
A. K-Nearest Neighbor (K-NN) mental health.
The value of k was taken as 5 in the Manhattan matrix, as VII. ACKNOWLEDGMENTS
5 was one of the values that gave the highest accuracy for the
algorithm. The confusion matrix obtained was as follows: Thanks and appreciation to Dr. Muhammad Salem and Dr.
Younis Al-Badri for everything you gave me in this semester,
and I hope that we will meet in future lessons.
VIII.REFERENCES [7] Chitra, R., & Seenivasagam, V. (2013). Review of heart disease
prediction system using data mining and hybrid intelligent techniques.
[1] Mohan, S., Thirumalai, C., & Srivastava, G. (2019). Effective heart ICTACT journal on soft computing, 3(04), 605-09.
disease prediction using hybrid machine learning techniques. IEEE
Access, 7, 81542-81554. [8] Medhekar, D. S., Bote, M. P., & Deshmukh, S. D. (2013). Heart disease
prediction system using naive Bayes. Int. J. Enhanced Res. Sci.
[2] Bhatla, N., & Jyoti, K. (2012). An analysis of heart disease prediction Technol. Eng, 2(3).
using different data mining techniques. International Journal of
Engineering, 1(8), 1-4. [9] Kaur, B., & Singh, W. (2014). Review on heart disease prediction
system using data mining techniques. International journal on recent
[3] Patel, J., TejalUpadhyay, D., & Patel, S. (2015). Heart disease and innovation trends in computing and communication, 2(10), 3003-
prediction using machine learning and data mining technique. Heart 3008.
Disease, 7(1), 129-137.
[4] Ramalingam, V. V., Dandapath, A., & Raja, M. K. (2018). Heart
disease prediction using machine learning techniques: a survey. IEEE conference templates contain guidance text for
International Journal of Engineering & Technology, 7(2.8), 684687. composing and formatting conference papers. Please
[5] Soni, J., Ansari, U., Sharma, D., & Soni, S. (2011). Intelligent and ensure that all template text is removed from your
effective heart disease prediction system using weighted associative conference paper prior to submission to the
classifiers. International Journal on Computer Science and
Engineering, 3(6), 2385-2392. conference. Failure to remove template text from
[6] Parthiban, L., & Subramanian, R. (2008). Intelligent heart disease your paper may result in your paper not being
prediction system using CANFIS and genetic algorithm. International published.
Journal of Biological, Biomedical and Medical Sciences, 3(3).

Garg 2021 IOP Conf. Ser. Mater. Sci. Eng. 1022 012046
No ratings yet
Garg 2021 IOP Conf. Ser. Mater. Sci. Eng. 1022 012046
10 pages
6245e19c618b73 12171037
No ratings yet
6245e19c618b73 12171037
9 pages
Heart Disease Prediction Using Machine Learning Te
No ratings yet
Heart Disease Prediction Using Machine Learning Te
7 pages
Final 1
No ratings yet
Final 1
36 pages
2023-Heart Disease Prediction Using Machine Learning
No ratings yet
2023-Heart Disease Prediction Using Machine Learning
11 pages
Jut 2
No ratings yet
Jut 2
12 pages
Heart Disease Python Report 1st Phase
No ratings yet
Heart Disease Python Report 1st Phase
33 pages
Heart Disease Prediction with ML
No ratings yet
Heart Disease Prediction with ML
5 pages
Heart Disease Prediction by Using Machine Learning Final Research Paper
No ratings yet
Heart Disease Prediction by Using Machine Learning Final Research Paper
8 pages
View of Cardiovascular Heart Disease Prediction Using Machine Learning Classifiers With Data Mining Techniques
No ratings yet
View of Cardiovascular Heart Disease Prediction Using Machine Learning Classifiers With Data Mining Techniques
9 pages
Heart Disease Prediction
No ratings yet
Heart Disease Prediction
9 pages
Islamia College University Peshawar
No ratings yet
Islamia College University Peshawar
15 pages
Heart Disease Prediction Using Machine Learning
No ratings yet
Heart Disease Prediction Using Machine Learning
18 pages
Farzana 2020
No ratings yet
Farzana 2020
5 pages
Diagnosis and Prediction of Heart Disease Using Machine Learning Techniques
No ratings yet
Diagnosis and Prediction of Heart Disease Using Machine Learning Techniques
11 pages
Heart Disease Prediction Using Machine Learning Techniques: Devansh Shah Samir Patel Santosh Kumar Bharti
No ratings yet
Heart Disease Prediction Using Machine Learning Techniques: Devansh Shah Samir Patel Santosh Kumar Bharti
6 pages
ML for Heart Disease Prediction
No ratings yet
ML for Heart Disease Prediction
4 pages
Machine Learning Techniques For Heart Disease Prediction: A. Lakshmanarao, Y.Swathi, P.Sri Sai Sundareswar
No ratings yet
Machine Learning Techniques For Heart Disease Prediction: A. Lakshmanarao, Y.Swathi, P.Sri Sai Sundareswar
4 pages
Heart Disease Prediction Using Data Mining Techniques: Journal of Analysis and Computation (JAC)
No ratings yet
Heart Disease Prediction Using Data Mining Techniques: Journal of Analysis and Computation (JAC)
8 pages
Heart Disease Prediction via Data Mining
No ratings yet
Heart Disease Prediction via Data Mining
9 pages
Heart Disease Prediction via ML
No ratings yet
Heart Disease Prediction via ML
16 pages
Heart Disease Prediction Using Machine Learning IJERTV9IS080128
No ratings yet
Heart Disease Prediction Using Machine Learning IJERTV9IS080128
3 pages
Application of Machine Learning For The Detection of Heart Disease
No ratings yet
Application of Machine Learning For The Detection of Heart Disease
8 pages
Earlier Prediction of Heart Disease Using Locality Sensitive Hashing
No ratings yet
Earlier Prediction of Heart Disease Using Locality Sensitive Hashing
10 pages
Synopsis (Heart Disease Prediction)
No ratings yet
Synopsis (Heart Disease Prediction)
7 pages
Evaluation of Cardiovascular Disease in Diabetic Patients Using Machine Learning Techniques
No ratings yet
Evaluation of Cardiovascular Disease in Diabetic Patients Using Machine Learning Techniques
13 pages
Prediction Heart Disease
No ratings yet
Prediction Heart Disease
11 pages
IJCRT2205103
No ratings yet
IJCRT2205103
10 pages
New Research New 1
No ratings yet
New Research New 1
5 pages
Comparison of Various Data Mining Methods For Early Diagnosis of Human Cardiology
No ratings yet
Comparison of Various Data Mining Methods For Early Diagnosis of Human Cardiology
9 pages
Diagnosis of Heart Disease Using Data Mining Algorithm
No ratings yet
Diagnosis of Heart Disease Using Data Mining Algorithm
3 pages
Heart Disease Prediction Using Feature Selection and Ensemble Learning Techniques
No ratings yet
Heart Disease Prediction Using Feature Selection and Ensemble Learning Techniques
5 pages
Detection of Heart Failure Using Different Machine Learning Algorithms
No ratings yet
Detection of Heart Failure Using Different Machine Learning Algorithms
5 pages
Feb 25 - Vol. 23 No. 1
No ratings yet
Feb 25 - Vol. 23 No. 1
73 pages
ML Mini Project
No ratings yet
ML Mini Project
8 pages
Final Year Project
No ratings yet
Final Year Project
57 pages
Research Proposal
No ratings yet
Research Proposal
8 pages
Heart Disease Detection Using Machine Learning: Chithambaram T Logesh Kannan N Gowsalya M (Gowsalya.m@vit - Ac.in)
No ratings yet
Heart Disease Detection Using Machine Learning: Chithambaram T Logesh Kannan N Gowsalya M (Gowsalya.m@vit - Ac.in)
5 pages
AB Report Group 2
No ratings yet
AB Report Group 2
14 pages
Olayinka Babe-2
No ratings yet
Olayinka Babe-2
48 pages
Heart Failure Prediction Using Hybrid Method
No ratings yet
Heart Failure Prediction Using Hybrid Method
8 pages
IEEE Paper Format Template
No ratings yet
IEEE Paper Format Template
3 pages
Intelligent Heart Disease Prediction
No ratings yet
Intelligent Heart Disease Prediction
10 pages
Thesis Task 1
No ratings yet
Thesis Task 1
4 pages
Effective Models For Predicting Heart Disease Using Machine Learn - Information Sciences Letters - 2023
No ratings yet
Effective Models For Predicting Heart Disease Using Machine Learn - Information Sciences Letters - 2023
13 pages
Sat - 95.Pdf - Heart Disease Prediction Using Machine Learning Algorithms
No ratings yet
Sat - 95.Pdf - Heart Disease Prediction Using Machine Learning Algorithms
11 pages
Manuscript
No ratings yet
Manuscript
7 pages
Heart Disease Prediction Using Machine Learning
No ratings yet
Heart Disease Prediction Using Machine Learning
7 pages
Sharma Yash Thesis 2023
No ratings yet
Sharma Yash Thesis 2023
46 pages
B.Tech Seminar: Heart Disease Prediction
No ratings yet
B.Tech Seminar: Heart Disease Prediction
21 pages
Heart Disease Prediction via ML
No ratings yet
Heart Disease Prediction via ML
5 pages
Acstv10n7 13
No ratings yet
Acstv10n7 13
24 pages
Research Paper - IT - Group No 8
No ratings yet
Research Paper - IT - Group No 8
10 pages
Heart Disease Prediction Using KNN Algorithm-2
No ratings yet
Heart Disease Prediction Using KNN Algorithm-2
19 pages
Heart Disease
No ratings yet
Heart Disease
14 pages
Machine Learning in Heart Disease Prediction
No ratings yet
Machine Learning in Heart Disease Prediction
25 pages
J Imu 2019 100203
No ratings yet
J Imu 2019 100203
18 pages
Analysis of Heart Disease Prediction Using Various Machine Learning Techniques
No ratings yet
Analysis of Heart Disease Prediction Using Various Machine Learning Techniques
8 pages
Diagnosing Diabetes Using Binary Whale Optimization Algorithm-Based Feature Selection
No ratings yet
Diagnosing Diabetes Using Binary Whale Optimization Algorithm-Based Feature Selection
6 pages
Bresenham's Line Algorithm: (X +1, Y) (X +1, y +1)
No ratings yet
Bresenham's Line Algorithm: (X +1, Y) (X +1, y +1)
4 pages
Egusphere 2025 16
No ratings yet
Egusphere 2025 16
22 pages
Test and Score Data: 1997-98 Edition
No ratings yet
Test and Score Data: 1997-98 Edition
8 pages
Faculty VET Full
No ratings yet
Faculty VET Full
221 pages
Ore Reserve Estimation of Saprolite Nickel Using I
No ratings yet
Ore Reserve Estimation of Saprolite Nickel Using I
7 pages
3 Roll Plate Bending Machines
100% (1)
3 Roll Plate Bending Machines
8 pages
Robot Operating System
No ratings yet
Robot Operating System
4 pages
Lecture 3 App 403-Professional Practice 3
100% (1)
Lecture 3 App 403-Professional Practice 3
44 pages
604 Computer Graphics
No ratings yet
604 Computer Graphics
13 pages
Faith-Based Video Conferencing Guide
No ratings yet
Faith-Based Video Conferencing Guide
8 pages
Coca Cola: Introduction and History
No ratings yet
Coca Cola: Introduction and History
2 pages
Mathematics of Codes
No ratings yet
Mathematics of Codes
29 pages
Copperbelt University Online Registration System
No ratings yet
Copperbelt University Online Registration System
16 pages
MiCOM P132 Address Assignment Guide
No ratings yet
MiCOM P132 Address Assignment Guide
36 pages
Alcatel-Lucent Omniswitch 6350: Gigabit Ethernet Lan Switch Family
No ratings yet
Alcatel-Lucent Omniswitch 6350: Gigabit Ethernet Lan Switch Family
8 pages
Uputstvo Za Advance Steel 2019
No ratings yet
Uputstvo Za Advance Steel 2019
55 pages
Mathematics 5
No ratings yet
Mathematics 5
18 pages
Information Technology: A Group Chat Application Using Java
No ratings yet
Information Technology: A Group Chat Application Using Java
10 pages
Classic Data Structure D.Samanta
77% (35)
Classic Data Structure D.Samanta
404 pages
Black Controller User Guide
No ratings yet
Black Controller User Guide
9 pages
Manual SYCON 2702 (GB) PDF
No ratings yet
Manual SYCON 2702 (GB) PDF
64 pages
Ece 4219
No ratings yet
Ece 4219
2 pages
Missing Values Analysis & Data Imputation: Single User License. Do Not Copy or Post
No ratings yet
Missing Values Analysis & Data Imputation: Single User License. Do Not Copy or Post
26 pages
A Dynamic Star Spots Extraction Method Based On Pi - 2024 - Advances in Space Re
No ratings yet
A Dynamic Star Spots Extraction Method Based On Pi - 2024 - Advances in Space Re
12 pages
Tools For Inviting LaBuena Vida
No ratings yet
Tools For Inviting LaBuena Vida
5 pages
Course Syllabus POFT 1309 - Administrative Office Procedures I
No ratings yet
Course Syllabus POFT 1309 - Administrative Office Procedures I
6 pages
Frequently Asked Questions (FAQ) About Firepower Licensing
No ratings yet
Frequently Asked Questions (FAQ) About Firepower Licensing
15 pages
Lec # 7 Secant Method
No ratings yet
Lec # 7 Secant Method
14 pages
Data Security Perspectives Quiz Answers NSE 1 Information Security Awareness Fortinet
100% (1)
Data Security Perspectives Quiz Answers NSE 1 Information Security Awareness Fortinet
3 pages
Tab s10 Plus 5G
No ratings yet
Tab s10 Plus 5G
3 pages
Android Error Log Analysis
No ratings yet
Android Error Log Analysis
19 pages
CSC 314
No ratings yet
CSC 314
10 pages
Diccionario Historico Cronologico, Geografico y Universal de La Santa Biblia T2 - Joseph Armesto y Goyanes 1790
No ratings yet
Diccionario Historico Cronologico, Geografico y Universal de La Santa Biblia T2 - Joseph Armesto y Goyanes 1790
397 pages
SpinView Getting Started
No ratings yet
SpinView Getting Started
12 pages

Mini Research

Uploaded by

Mini Research

Uploaded by

Heart disease prediction using machine learning

Abstract—one of the most well-known uses of artificial II. LITERATURE REVIEW

XXX-X-XXXX-XXXX-X/XX/$XX.00 ©20XX IEEE

Fig 1 , Data info

Fig 2 , Distributions of target attribute

It is clear from the findings displayed by attribute

∑ 100 143 243

From the confusion matrix, the accuracy is calculated

TABLE III. RESULTS ALGORITHM

Algorithm Used TP FP TN FN Accuracy

K-NN 83 26 115 19 79.0%

Random Forest 79 30 113 21 80.7%

You might also like