0% found this document useful (0 votes)

76 views5 pages

Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050

The document discusses implementing two machine learning algorithms on an iris dataset: 1) A k-nearest neighbors algorithm is used to classify the iris data, achieving 96.67% accuracy. 2) A naive Bayes classifier is also implemented on the iris data, with its accuracy to be computed.

Uploaded by

test

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views5 pages

Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050

Uploaded by

test

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

10/8/21, 1:09 PM 20190802050_DS_Lab4

AI/ML LAB-4
Name: Pratik Jadhav

PRN: 20190802050

AIM: To implement two algorithms on a data set and impute the

accuracy score of the predictions

Q1. Write a program to implement k-Nearest Neighbour algorithm to classify the iris data
set. Print both correct and wrong predictions. Java/Python ML library classes can be used
for this problem.

In [1]:
%matplotlib inline

import numpy as np

import pandas as pd

import matplotlib.pyplot as plt

In [2]:
iris_data = pd.read_csv("Iris.csv")

iris_data.head()

Out[2]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

In [3]:
len(iris_data)

150
Out[3]:

In [4]:
iris_data.isna().sum()

Id 0

Out[4]:
SepalLengthCm 0

SepalWidthCm 0

PetalLengthCm 0

PetalWidthCm 0

Species 0

dtype: int64

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 1/5
10/8/21, 1:09 PM 20190802050_DS_Lab4

In [5]: X = iris_data.drop("Species", axis=1)

y = iris_data["Species"]

len(X), len(y)

(150, 150)
Out[5]:

In [6]:
from sklearn.neighbors import KNeighborsClassifier

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.2,

random_state=1)

clf = KNeighborsClassifier(n_neighbors=3)

clf.fit(X_train, y_train)

clf.score(X_test, y_test)

0.9666666666666667
Out[6]:

In [7]:
y_preds = clf.predict(X_test)

y_preds[:10]

array(['Iris-setosa', 'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

Out[7]:
'Iris-virginica', 'Iris-versicolor', 'Iris-virginica',

'Iris-setosa', 'Iris-setosa', 'Iris-virginica'], dtype=object)

In [8]:
y_preds_proba = clf.predict_proba(X_test)

y_preds_proba[:10]

array([[1., 0., 0.],

Out[8]:
[0., 1., 0.],

[0., 1., 0.],

[1., 0., 0.],

[0., 0., 1.],

[0., 1., 0.],

[0., 0., 1.],

[1., 0., 0.],

[0., 0., 1.]])

In [9]:
from sklearn.metrics import accuracy_score, confusion_matrix, classification_report

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%\n")

print(f"Classfication Report: {classification_report(y_preds, y_test)}\n")

print(f"Confusion Matrix: \n{confusion_matrix(y_preds, y_test)}")

The accuracy of the ML model for iris data: 96.67%

Classfication Report: precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 11

Iris-versicolor 0.92 1.00 0.96 12

Iris-virginica 1.00 0.86 0.92 7

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 2/5
10/8/21, 1:09 PM 20190802050_DS_Lab4

accuracy 0.97 30

macro avg 0.97 0.95 0.96 30

weighted avg 0.97 0.97 0.97 30

Confusion Matrix:

[[11 0 0]

[ 0 12 0]

[ 0 1 6]]

In [10]:
from sklearn.model_selection import cross_val_score

cvs = cross_val_score(clf, X, y)

print(cvs)

print(f"Mean of each testing data set: {np.mean(cvs) * 100:.2f}%")

[0.66666667 1. 1. 1. 0.7 ]

Mean of each testing data set: 87.33%

In [11]:
y_testing = pd.Series(y_test).reset_index().drop("index",axis=1)

y_predictions = pd.Series(y_preds)

In [12]:
predictions_df = pd.DataFrame(data={

"Species": y_testing["Species"],

"Predicted Species": y_predictions

})

In [13]:
predicts = []

for index, i in enumerate(y_testing["Species"]):

if i == y_preds[index]:

predicts.append("Correct")

else:

predicts.append("Wrong")

In [14]:
predictions_df["Correct or Wrong"] = pd.Series(predicts)

predictions_df.head()

Out[14]: Species Predicted Species Correct or Wrong

0 Iris-setosa Iris-setosa Correct

1 Iris-versicolor Iris-versicolor Correct

2 Iris-versicolor Iris-versicolor Correct

3 Iris-setosa Iris-setosa Correct

4 Iris-virginica Iris-virginica Correct

In [15]:
print(f"Total Correct or Wrong Predictions:\n\

{predictions_df['Correct or Wrong'].value_counts()}")

Total Correct or Wrong Predictions:

Correct 29

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 3/5
10/8/21, 1:09 PM 20190802050_DS_Lab4

Wrong 1

Name: Correct or Wrong, dtype: int64

Q2. Write a program to implement the naïve Bayesian classifier for a sample training data
set stored as a .CSV file. Compute the accuracy of the classifier, considering few test data
sets.

In [16]:
iris_data = pd.read_csv("Iris.csv")

iris_data.head()

Out[16]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

In [17]:
X = iris_data.drop("Species", axis=1)

y = iris_data["Species"]

len(X), len(y)

(150, 150)
Out[17]:

In [18]:
from sklearn.naive_bayes import GaussianNB

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

test_size=0.3,

random_state=1)

gnb = GaussianNB()

gnb.fit(X_train, y_train)

gnb.score(X_test, y_test)

1.0
Out[18]:

In [19]:
y_preds = gnb.predict(X_test)

y_preds[:10]

array(['Iris-setosa', 'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

Out[19]:
'Iris-virginica', 'Iris-versicolor', 'Iris-virginica',

'Iris-setosa', 'Iris-setosa', 'Iris-virginica'], dtype='<U15')

In [20]:
from sklearn.metrics import accuracy_score

accuracy = accuracy_score(y_preds, y_test)

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 4/5
10/8/21, 1:09 PM 20190802050_DS_Lab4

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%")

The accuracy of the ML model for iris data: 100.00%

In [21]:
from sklearn.model_selection import cross_val_score

cvs = cross_val_score(gnb, X, y)

print(cvs)

print(f"Mean of each testing data set: {np.mean(cvs) * 100:.2f}%")

[0.96666667 1. 1. 1. 1. ]

Mean of each testing data set: 99.33%

In [22]:
from sklearn.metrics import accuracy_score, confusion_matrix, classification_report

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%\n")

print(f"Classfication Report: {classification_report(y_preds, y_test)}\n")

print(f"Confusion Matrix: \n{confusion_matrix(y_preds, y_test)}")

The accuracy of the ML model for iris data: 100.00%

Classfication Report: precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 14

Iris-versicolor 1.00 1.00 1.00 18

Iris-virginica 1.00 1.00 1.00 13

accuracy 1.00 45

macro avg 1.00 1.00 1.00 45

weighted avg 1.00 1.00 1.00 45

Confusion Matrix:

[[14 0 0]

[ 0 18 0]

[ 0 0 13]]

Conclusion: Hence, we have successfully implemented kNeigbhours and Naive Bayesian

algorithms on iris data set and computed the accuracy and different evaluation model on the
predictions. We got an accuray of 96.67% on testing data and 87.33% on different testing data
sets of the KNeighbours Algorithm. And for Naive Bayesian we got an accuracy of 100% and
99.33% on different testing data sets of iris data.

localhost:8888/nbconvert/html/20190802050_DS_Lab4.ipynb?download=false 5/5

NaiveBayesClassifier - Jupyter Notebook
No ratings yet
NaiveBayesClassifier - Jupyter Notebook
2 pages
ML Lab Manual
No ratings yet
ML Lab Manual
6 pages
Iris Data Analysis & Modeling
No ratings yet
Iris Data Analysis & Modeling
5 pages
Iris Dataset
No ratings yet
Iris Dataset
3 pages
Mnbnmnbnnmbbhhuyrgh
No ratings yet
Mnbnmnbnnmbbhhuyrgh
3 pages
Lab Program 9
No ratings yet
Lab Program 9
5 pages
Lab Program 9
No ratings yet
Lab Program 9
5 pages
ML Expt 4
No ratings yet
ML Expt 4
4 pages
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
No ratings yet
Bagging, Random Forest, Gradient Boost, AdaBoost & PCA
8 pages
TASK01 IrisFlowerClassificationwithMachineLearning 1752340862
No ratings yet
TASK01 IrisFlowerClassificationwithMachineLearning 1752340862
3 pages
Vighnesh - S Log 13
No ratings yet
Vighnesh - S Log 13
4 pages
Lab - 5 (CB - En.u4ece22115)
No ratings yet
Lab - 5 (CB - En.u4ece22115)
5 pages
Aml Lab
No ratings yet
Aml Lab
6 pages
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
No ratings yet
Codes and Other Relevant Explanations For Supervised Learning (Part 1) - Session by Sabyasachi Mukhopadhyay - August 3
5 pages
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
No ratings yet
Data Mining and Warehousing Concepts Lab: (ITPC - 228)
6 pages
Iris - Copy1 - Jupyter Notebook
No ratings yet
Iris - Copy1 - Jupyter Notebook
8 pages
ML Keshav
No ratings yet
ML Keshav
23 pages
Comparison of Classifiers
No ratings yet
Comparison of Classifiers
6 pages
Code Examples in Space
No ratings yet
Code Examples in Space
13 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Week 11 KNN
No ratings yet
Week 11 KNN
5 pages
Machine Learning
No ratings yet
Machine Learning
3 pages
EX - NO:3: Algorithm
No ratings yet
EX - NO:3: Algorithm
11 pages
Dsbda 10
No ratings yet
Dsbda 10
5 pages
Perform The Data Classification Using SVM Classifier - BI Prac 1
No ratings yet
Perform The Data Classification Using SVM Classifier - BI Prac 1
8 pages
Iris Classifier Accuracy Comparison
No ratings yet
Iris Classifier Accuracy Comparison
5 pages
DS 6
No ratings yet
DS 6
2 pages
Nomlab 14 Ai
No ratings yet
Nomlab 14 Ai
3 pages
Lab06 KNN 01
No ratings yet
Lab06 KNN 01
3 pages
ABHAYMLFILE
No ratings yet
ABHAYMLFILE
16 pages
Wa0001
No ratings yet
Wa0001
39 pages
MLAss Code
No ratings yet
MLAss Code
1 page
Dsbda Assig 6 Data Analytcs 3
No ratings yet
Dsbda Assig 6 Data Analytcs 3
6 pages
SVM and Kmeans - Iris Dataset - Ipynb - Colab
No ratings yet
SVM and Kmeans - Iris Dataset - Ipynb - Colab
5 pages
Machine Learning Aiml
No ratings yet
Machine Learning Aiml
7 pages
DS6BAYES
No ratings yet
DS6BAYES
2 pages
KNN and Random Forests Guide
No ratings yet
KNN and Random Forests Guide
6 pages
Remaining ML Program
No ratings yet
Remaining ML Program
12 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
Lab Week 7
No ratings yet
Lab Week 7
3 pages
ML File
No ratings yet
ML File
7 pages
ML Internal Answers
No ratings yet
ML Internal Answers
9 pages
DSBDA6
No ratings yet
DSBDA6
3 pages
It - S All About Neighbors - Completed
No ratings yet
It - S All About Neighbors - Completed
14 pages
AIML Lab 3 4
No ratings yet
AIML Lab 3 4
5 pages
SVM K NN MLP With Sklearn Jupyter NoteBo
No ratings yet
SVM K NN MLP With Sklearn Jupyter NoteBo
22 pages
Data Analytics III
No ratings yet
Data Analytics III
5 pages
VAMSHI PR (1) 2 Edit
No ratings yet
VAMSHI PR (1) 2 Edit
16 pages
MLT Lab 09
No ratings yet
MLT Lab 09
3 pages
Assigmnent 3 (Data Mining)
No ratings yet
Assigmnent 3 (Data Mining)
18 pages
PRGM 8
No ratings yet
PRGM 8
1 page
33NaiveBayesOn Iris
No ratings yet
33NaiveBayesOn Iris
1 page
Prac7 23bme053
No ratings yet
Prac7 23bme053
2 pages
AML Lab3 2021wb15156
No ratings yet
AML Lab3 2021wb15156
13 pages
Machine Learning Assignment
No ratings yet
Machine Learning Assignment
8 pages
Lab 6
No ratings yet
Lab 6
4 pages
Python ML Algorithms Guide
No ratings yet
Python ML Algorithms Guide
7 pages
KNN
No ratings yet
KNN
4 pages
INFA Product Lifecycle Guide v2024 05
No ratings yet
INFA Product Lifecycle Guide v2024 05
12 pages
Pison VH10 User Manual
No ratings yet
Pison VH10 User Manual
166 pages
Project Management
No ratings yet
Project Management
15 pages
Lab 3 Yolo Object Detection
No ratings yet
Lab 3 Yolo Object Detection
5 pages
Pine Script Guide for TradingView
No ratings yet
Pine Script Guide for TradingView
6 pages
KBlaze
No ratings yet
KBlaze
1 page
SC200 Controller Datasheet
No ratings yet
SC200 Controller Datasheet
4 pages
Linkedin On Resume Example
100% (2)
Linkedin On Resume Example
6 pages
Fast Python High Performance Techniques For Large Datasets MEAP V10 Tiago Rodrigues Antao Instant Download
No ratings yet
Fast Python High Performance Techniques For Large Datasets MEAP V10 Tiago Rodrigues Antao Instant Download
110 pages
MPI Python Workshop Day1 Fall2024
No ratings yet
MPI Python Workshop Day1 Fall2024
22 pages
Working of Chatgpt Report
No ratings yet
Working of Chatgpt Report
24 pages
Autonomous Fire Fighter Robot Based On Image Processing 4
No ratings yet
Autonomous Fire Fighter Robot Based On Image Processing 4
6 pages
22 - Elementary Graph Algorithms
No ratings yet
22 - Elementary Graph Algorithms
55 pages
Session11 Papers
No ratings yet
Session11 Papers
13 pages
Grade 10 Computer CM
No ratings yet
Grade 10 Computer CM
27 pages
Evolution of Internet
No ratings yet
Evolution of Internet
8 pages
Ingles II Jorge - Fernnada
No ratings yet
Ingles II Jorge - Fernnada
5 pages
Second Quarterly Test in ICT 9
No ratings yet
Second Quarterly Test in ICT 9
2 pages
RSTI - SA05 - DPS 5ka
No ratings yet
RSTI - SA05 - DPS 5ka
4 pages
DPP7 Waivycurve
No ratings yet
DPP7 Waivycurve
6 pages
Article List
No ratings yet
Article List
21 pages
Falcon 7X-Ice and Rain Protection
No ratings yet
Falcon 7X-Ice and Rain Protection
106 pages
DAA Module1
No ratings yet
DAA Module1
9 pages
Observability Monitoring 1735803011
No ratings yet
Observability Monitoring 1735803011
34 pages
Thesis Guideline Oyagsb
100% (3)
Thesis Guideline Oyagsb
7 pages
DP-440 430 340 330 Service Manual PDF
No ratings yet
DP-440 430 340 330 Service Manual PDF
311 pages
Hypothesis Testing: Nicotine & Search Engines
No ratings yet
Hypothesis Testing: Nicotine & Search Engines
3 pages
Ladder Logic PLC Programming
No ratings yet
Ladder Logic PLC Programming
3 pages
WWW Kopykitab Com Index PHP Route Pdfviewer View&Product Id 54784&parent Id 5950204
No ratings yet
WWW Kopykitab Com Index PHP Route Pdfviewer View&Product Id 54784&parent Id 5950204
3 pages
CHAPTER 2 & 3 Comp Graph
No ratings yet
CHAPTER 2 & 3 Comp Graph
10 pages

Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050

Uploaded by

Ai/Ml Lab-4: Name: Pratik Jadhav PRN: 20190802050

Uploaded by

10/8/21, 1:09 PM 20190802050_DS_Lab4

AIM: To implement two algorithms on a data set and impute the

import matplotlib.pyplot as plt

Out[2]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

In [5]: X = iris_data.drop("Species", axis=1)

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

array(['Iris-setosa', 'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

'Iris-setosa', 'Iris-setosa', 'Iris-virginica'], dtype=object)

array([[1., 0., 0.],

[0., 1., 0.],

[1., 0., 0.],

[0., 0., 1.],

[0., 1., 0.],

[0., 0., 1.],

[1., 0., 0.],

[1., 0., 0.],

[0., 0., 1.]])

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%\n")

print(f"Classfication Report: {classification_report(y_preds, y_test)}\n")

The accuracy of the ML model for iris data: 96.67%

Classfication Report: precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 11

Iris-versicolor 0.92 1.00 0.96 12

Iris-virginica 1.00 0.86 0.92 7

macro avg 0.97 0.95 0.96 30

weighted avg 0.97 0.97 0.97 30

print(f"Mean of each testing data set: {np.mean(cvs) * 100:.2f}%")

Mean of each testing data set: 87.33%

"Predicted Species": y_predictions

for index, i in enumerate(y_testing["Species"]):

Out[14]: Species Predicted Species Correct or Wrong

0 Iris-setosa Iris-setosa Correct

1 Iris-versicolor Iris-versicolor Correct

2 Iris-versicolor Iris-versicolor Correct

3 Iris-setosa Iris-setosa Correct

4 Iris-virginica Iris-virginica Correct

Total Correct or Wrong Predictions:

Name: Correct or Wrong, dtype: int64

Out[16]: Id SepalLengthCm SepalWidthCm PetalLengthCm PetalWidthCm Species

0 1 5.1 3.5 1.4 0.2 Iris-setosa

1 2 4.9 3.0 1.4 0.2 Iris-setosa

2 3 4.7 3.2 1.3 0.2 Iris-setosa

3 4 4.6 3.1 1.5 0.2 Iris-setosa

4 5 5.0 3.6 1.4 0.2 Iris-setosa

from sklearn.model_selection import train_test_split

X_train, X_test, y_train, y_test = train_test_split(X, y,

array(['Iris-setosa', 'Iris-versicolor', 'Iris-versicolor', 'Iris-setosa',

'Iris-setosa', 'Iris-setosa', 'Iris-virginica'], dtype='<U15')

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%")

The accuracy of the ML model for iris data: 100.00%

print(f"Mean of each testing data set: {np.mean(cvs) * 100:.2f}%")

Mean of each testing data set: 99.33%

accuracy = accuracy_score(y_preds, y_test)

print(f"The accuracy of the ML model for iris data: {accuracy * 100:.2f}%\n")

print(f"Classfication Report: {classification_report(y_preds, y_test)}\n")

The accuracy of the ML model for iris data: 100.00%

Classfication Report: precision recall f1-score support

Iris-setosa 1.00 1.00 1.00 14

Iris-versicolor 1.00 1.00 1.00 18

Iris-virginica 1.00 1.00 1.00 13

macro avg 1.00 1.00 1.00 45

weighted avg 1.00 1.00 1.00 45

Conclusion: Hence, we have successfully implemented kNeigbhours and Naive Bayesian

You might also like