0% found this document useful (0 votes)

25 views6 pages

Apply Linear Regression Model Techniques To Predict Data On Any Dataset

Uploaded by

sonawaneabhishek69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views6 pages

Apply Linear Regression Model Techniques To Predict Data On Any Dataset

Uploaded by

sonawaneabhishek69

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

4 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/4.

ipynb

4. Apply Linear Regression Model techniques to predict data on any

dataset.

In [18]: import pandas as pd

import numpy as np
import matplotlib.pyplot as plt
import seaborn as sns

In [19]: df = pd.read_csv(r"C:\Users\ABHISHEK\Downloads\LungCapData - LungCapData.csv")

In [20]: df.head()

Out[20]: LungCap Age Height Smoke Gender Caesarean

0 6.475 6 62.1 no male no

1 10.125 18 74.7 yes female no

2 9.550 16 69.7 no female yes

3 11.125 14 71.0 no male no

4 4.800 5 56.9 no male no

In [21]: df.isnull().sum()

Out[21]: LungCap 0
Age 0
Height 0
Smoke 0
Gender 0
Caesarean 0
dtype: int64

In [22]: df.info()

<class 'pandas.core.frame.DataFrame'>
RangeIndex: 725 entries, 0 to 724
Data columns (total 6 columns):
# Column Non-Null Count Dtype
--- ------ -------------- -----
0 LungCap 725 non-null float64
1 Age 725 non-null int64
2 Height 725 non-null float64
3 Smoke 725 non-null object
4 Gender 725 non-null object
5 Caesarean 725 non-null object
dtypes: float64(2), int64(1), object(3)
memory usage: 34.1+ KB

1 of 6 30-10-2024, 22:06
4 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/4.ipynb

In [23]: from sklearn.preprocessing import LabelEncoder

from sklearn.model_selection import train_test_split

In [24]: le = LabelEncoder()

In [25]: df.Smoke = le.fit_transform(df.Smoke)

In [26]: df.Gender = le.fit_transform(df.Gender)

In [27]: df.Caesarean = le.fit_transform(df.Caesarean)

In [28]: df.head()

Out[28]: LungCap Age Height Smoke Gender Caesarean

0 6.475 6 62.1 0 1 0

1 10.125 18 74.7 1 0 0

2 9.550 16 69.7 0 0 1

3 11.125 14 71.0 0 1 0

4 4.800 5 56.9 0 1 0

In [29]: x = df.drop(['LungCap'],axis = 1)

In [30]: x

Out[30]: Age Height Smoke Gender Caesarean

0 6 62.1 0 1 0

1 18 74.7 1 0 0

2 16 69.7 0 0 1

3 14 71.0 0 1 0

4 5 56.9 0 1 0

... ... ... ... ... ...

720 9 56.0 0 0 0

721 18 72.0 1 1 1

722 11 60.5 1 0 0

723 15 64.9 0 0 0

724 10 67.7 0 1 0

725 rows × 5 columns

In [31]: y = df['LungCap']

2 of 6 30-10-2024, 22:06
4 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/4.ipynb

In [32]: X_test,X_train,y_test,y_train = train_test_split(x,y,test_size = 0.2,random_state

In [34]: X_train.shape,y_train.shape

Out[34]: ((145, 5), (145,))

In [35]: from sklearn.linear_model import LinearRegression

In [36]: lr = LinearRegression()

In [37]: lr.fit(X_train,y_train)

Out[37]: LinearRegression()
In a Jupyter environment, please rerun this cell to show the HTML representation or trust
the notebook.
On GitHub, the HTML representation is unable to render, please try loading this page
with nbviewer.org.

In [38]: y_pred = lr.predict(X_test)

In [39]: from sklearn.metrics import mean_squared_error,r2_score,mean_absolute_error

In [40]: mse = mean_squared_error(y_test,y_pred)

In [41]: err_train = y_test - y_pred

In [43]: mse = np.mean(np.square(err_train))

In [44]: mse

Out[44]: 1.0559850321341964

In [45]: rmse = np.sqrt(mse)

In [46]: rmse

Out[46]: 1.0276113234750754

In [47]: r2_score(y_test,y_pred)

Out[47]: 0.8511008247863296

3 of 6 30-10-2024, 22:06
4 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/4.ipynb

In [48]: plt.plot(err_train,"*")

Out[48]: [<matplotlib.lines.Line2D at 0x165af248dc0>]

In [51]: plt.hist(err_train,bins=20,edgecolor='g')
plt.grid()

In [55]: y_test.shape,y_pred.shape

Out[55]: ((580,), (580,))

In [65]: d = {"Actual":(y_test),
"Predicted":(y_pred)}

In [66]: pred_actual_df = pd.DataFrame(d)

4 of 6 30-10-2024, 22:06
4 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/4.ipynb

In [67]: pred_actual_df

Out[67]: Actual Predicted

446 6.300 6.251456

6 4.950 6.996499

423 7.800 9.078604

596 3.925 4.716138

411 8.675 8.229591

... ... ...

71 9.700 9.940940

106 10.875 11.602824

270 6.100 5.671011

435 11.300 10.971752

102 3.450 6.361120

580 rows × 2 columns

In [69]: sns.jointplot(x ='Actual',y = 'Predicted',data= pred_actual_df ,kind = 'reg')

plt.grid()

5 of 6 30-10-2024, 22:06
4 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/4.ipynb

In [ ]:

6 of 6 30-10-2024, 22:06

Logistic Regression for Heart Disease
No ratings yet
Logistic Regression for Heart Disease
8 pages
Heart Disease Diagnosis Using Machine Learning
No ratings yet
Heart Disease Diagnosis Using Machine Learning
26 pages
Logistic Regression
No ratings yet
Logistic Regression
12 pages
Stroke Prediction
No ratings yet
Stroke Prediction
14 pages
Heart - Cleveland - Ipynb - Colab
No ratings yet
Heart - Cleveland - Ipynb - Colab
5 pages
Project
No ratings yet
Project
8 pages
ExNo 08ml
No ratings yet
ExNo 08ml
4 pages
Logistic - Ipynb - Colaboratory
No ratings yet
Logistic - Ipynb - Colaboratory
6 pages
AML Sessional 1 Students
No ratings yet
AML Sessional 1 Students
16 pages
Medical Cost Prediction
No ratings yet
Medical Cost Prediction
27 pages
ASSIGNMENT II - Logistic Regression (Sukanya Das - 221001001006)
No ratings yet
ASSIGNMENT II - Logistic Regression (Sukanya Das - 221001001006)
10 pages
Heart Failure Prediction
100% (1)
Heart Failure Prediction
41 pages
Assignment 1
No ratings yet
Assignment 1
10 pages
Diabetes
No ratings yet
Diabetes
7 pages
Heart Attack Prediction Model EDA
100% (1)
Heart Attack Prediction Model EDA
24 pages
Apply Logistic Regression Model Techniques To Predict Data On Any Dataset
No ratings yet
Apply Logistic Regression Model Techniques To Predict Data On Any Dataset
5 pages
ML Practical 04
No ratings yet
ML Practical 04
20 pages
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
No ratings yet
Week - 6 - SWI - MLP - LogisticRegression - Ipynb - Colaboratory
15 pages
Diabetes Prediction Model Guide
No ratings yet
Diabetes Prediction Model Guide
20 pages
Heart Attack
No ratings yet
Heart Attack
18 pages
Linear Merged Pagenumber
No ratings yet
Linear Merged Pagenumber
48 pages
Inbound 3085046103164618170
No ratings yet
Inbound 3085046103164618170
2 pages
LAB8 LogisticReg HeartDisease
No ratings yet
LAB8 LogisticReg HeartDisease
31 pages
Ml4.ipynb - Colab
No ratings yet
Ml4.ipynb - Colab
3 pages
6034 Logistic Regression
No ratings yet
6034 Logistic Regression
6 pages
AI Mini Project
No ratings yet
AI Mini Project
6 pages
Assignment 1
No ratings yet
Assignment 1
11 pages
Ass 1 Dsbda
No ratings yet
Ass 1 Dsbda
8 pages
Diabetes Prediction with Logistic Regression
No ratings yet
Diabetes Prediction with Logistic Regression
9 pages
C ML1
No ratings yet
C ML1
10 pages
Heart Health Data Analysis
No ratings yet
Heart Health Data Analysis
1 page
Diabetes Prediction 1704256341
No ratings yet
Diabetes Prediction 1704256341
17 pages
ML Practicals
No ratings yet
ML Practicals
21 pages
Heart Disease Prediction - Jupyter Notebook
100% (1)
Heart Disease Prediction - Jupyter Notebook
9 pages
Stroke Prediction
No ratings yet
Stroke Prediction
10 pages
Diabetes Prediction System
No ratings yet
Diabetes Prediction System
4 pages
Experiment 5
No ratings yet
Experiment 5
9 pages
Unit5 - Logistic Regression
No ratings yet
Unit5 - Logistic Regression
4 pages
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
No ratings yet
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
8 pages
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
No ratings yet
Lab Manual - MachineLearningLaboratory-DR - Vaishnavi
71 pages
Diabetes - Test Report
No ratings yet
Diabetes - Test Report
62 pages
Logistic Regression
No ratings yet
Logistic Regression
28 pages
Diabetes Prediction with SVM & RF
No ratings yet
Diabetes Prediction with SVM & RF
8 pages
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
No ratings yet
Step-By-Step-Diabetes-Classification-Knn-Detailed-Copy1 - Jupyter Notebook
12 pages
Healthcare-Project-Simplilearn - Week1
No ratings yet
Healthcare-Project-Simplilearn - Week1
6 pages
Heart Failure Prediction EDA & Modeling
No ratings yet
Heart Failure Prediction EDA & Modeling
38 pages
Heart Disease Classification Using Ann Hands-On
No ratings yet
Heart Disease Classification Using Ann Hands-On
7 pages
Data Mining Lab - Ipynb - Colab
No ratings yet
Data Mining Lab - Ipynb - Colab
7 pages
Heart Disease Report With Comments and Code
No ratings yet
Heart Disease Report With Comments and Code
9 pages
Adaboost 2
No ratings yet
Adaboost 2
9 pages
Data Pre-Processing
No ratings yet
Data Pre-Processing
22 pages
ML Manual Final
No ratings yet
ML Manual Final
35 pages
Smoking Habits of Boston Youth - Solution - Jupyter Notebook
No ratings yet
Smoking Habits of Boston Youth - Solution - Jupyter Notebook
9 pages
Diabetes and Glucose Correlation - IBM Machine Learning Training Project
No ratings yet
Diabetes and Glucose Correlation - IBM Machine Learning Training Project
10 pages
REgression 1
No ratings yet
REgression 1
19 pages
Major Project - Colab
No ratings yet
Major Project - Colab
15 pages
Binary Prediction of Smoker Status Using Bio-Signals
No ratings yet
Binary Prediction of Smoker Status Using Bio-Signals
20 pages
Ai in HC - 2
No ratings yet
Ai in HC - 2
9 pages
The Brian D. Kirkpatrick - Resume
No ratings yet
The Brian D. Kirkpatrick - Resume
2 pages
Re Fix Match
No ratings yet
Re Fix Match
11 pages
ATPG Srivatsa PPT
100% (4)
ATPG Srivatsa PPT
37 pages
EGBe Series Catalog
No ratings yet
EGBe Series Catalog
6 pages
8960 - DWM Experiment 5
No ratings yet
8960 - DWM Experiment 5
6 pages
Common Machine Learning Issues
No ratings yet
Common Machine Learning Issues
2 pages
Clase 2 ESAN MARK 2024 - 2
No ratings yet
Clase 2 ESAN MARK 2024 - 2
45 pages
12 STD Homework
No ratings yet
12 STD Homework
2 pages
Software Engineering ch1 and 2
No ratings yet
Software Engineering ch1 and 2
30 pages
Image Segmentation Techniques
No ratings yet
Image Segmentation Techniques
58 pages
Chennai Water Resource Management Using GIS
No ratings yet
Chennai Water Resource Management Using GIS
7 pages
Red Black Tree
No ratings yet
Red Black Tree
15 pages
KBlaze
No ratings yet
KBlaze
1 page
Y8 Python Notes 3: Variables
No ratings yet
Y8 Python Notes 3: Variables
6 pages
DescribingDataGraphically Activity
No ratings yet
DescribingDataGraphically Activity
7 pages
Introduction To BA
No ratings yet
Introduction To BA
41 pages
Zoho Array Questions
100% (1)
Zoho Array Questions
58 pages
Some Visual Literacy Initiatives in Academic Institutions - A Literature Review From 1999 To The Present
No ratings yet
Some Visual Literacy Initiatives in Academic Institutions - A Literature Review From 1999 To The Present
35 pages
Momo Statement Report
No ratings yet
Momo Statement Report
34 pages
Cyber Security Practical File
No ratings yet
Cyber Security Practical File
21 pages
Extended Essay
No ratings yet
Extended Essay
18 pages
1.3.4 Lab - Visualizing The Black Hats
No ratings yet
1.3.4 Lab - Visualizing The Black Hats
3 pages
Result
No ratings yet
Result
48 pages
AI Based Health Monitoring System
No ratings yet
AI Based Health Monitoring System
2 pages
How To Define Build and Operationalize A Data Fabric
100% (1)
How To Define Build and Operationalize A Data Fabric
51 pages
Agriculture 13 00936
No ratings yet
Agriculture 13 00936
24 pages
Swahili Exercises
No ratings yet
Swahili Exercises
221 pages
Evolution of Internet
No ratings yet
Evolution of Internet
8 pages
Newtom Giano-Vg3 - User Manual-8a889788
No ratings yet
Newtom Giano-Vg3 - User Manual-8a889788
1,090 pages
First Hello World Program in JavaScript
No ratings yet
First Hello World Program in JavaScript
2 pages

Apply Linear Regression Model Techniques To Predict Data On Any Dataset

Uploaded by

Apply Linear Regression Model Techniques To Predict Data On Any Dataset

Uploaded by

4 - Jupyter Notebook http://localhost:8888/notebooks/Practicals_AI/4.

4. Apply Linear Regression Model techniques to predict data on any

In [18]: import pandas as pd

In [19]: df = pd.read_csv(r"C:\Users\ABHISHEK\Downloads\LungCapData - LungCapData.csv")

Out[20]: LungCap Age Height Smoke Gender Caesarean

0 6.475 6 62.1 no male no

1 10.125 18 74.7 yes female no

2 9.550 16 69.7 no female yes

3 11.125 14 71.0 no male no

4 4.800 5 56.9 no male no

In [23]: from sklearn.preprocessing import LabelEncoder

In [25]: df.Smoke = le.fit_transform(df.Smoke)

In [26]: df.Gender = le.fit_transform(df.Gender)

In [27]: df.Caesarean = le.fit_transform(df.Caesarean)

Out[28]: LungCap Age Height Smoke Gender Caesarean

Out[30]: Age Height Smoke Gender Caesarean

... ... ... ... ... ...

725 rows × 5 columns

In [32]: X_test,X_train,y_test,y_train = train_test_split(x,y,test_size = 0.2,random_state

Out[34]: ((145, 5), (145,))

In [35]: from sklearn.linear_model import LinearRegression

In [38]: y_pred = lr.predict(X_test)

In [39]: from sklearn.metrics import mean_squared_error,r2_score,mean_absolute_error

In [40]: mse = mean_squared_error(y_test,y_pred)

In [41]: err_train = y_test - y_pred

In [43]: mse = np.mean(np.square(err_train))

In [45]: rmse = np.sqrt(mse)

Out[48]: [<matplotlib.lines.Line2D at 0x165af248dc0>]

Out[55]: ((580,), (580,))

In [66]: pred_actual_df = pd.DataFrame(d)

Out[67]: Actual Predicted

446 6.300 6.251456

423 7.800 9.078604

596 3.925 4.716138

411 8.675 8.229591

... ... ...

106 10.875 11.602824

270 6.100 5.671011

435 11.300 10.971752

102 3.450 6.361120

580 rows × 2 columns

In [69]: sns.jointplot(x ='Actual',y = 'Predicted',data= pred_actual_df ,kind = 'reg')

You might also like