0% found this document useful (0 votes)

14 views53 pages

RRB - Unit 2 Regresion

Sppu university AI DS engineering final year BE semester 7th subject Machine learning unit 2 nd notes

Uploaded by

pawaletrupti434

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views53 pages

RRB - Unit 2 Regresion

Sppu university AI DS engineering final year BE semester 7th subject Machine learning unit 2 nd notes

Uploaded by

pawaletrupti434

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 53

Unit 2

Regression
Syllabus- AInDS
● Introduction- Regression, Need of Regression,
● Difference between Regression and Correlation,
● Types of Regression: Univariate vs. Multivariate, Linear vs. Nonlinear, Simple Linear vs.
Multiple Linear,
● Bias-Variance tradeoff, Overfitting and Underfitting.
● Regression Techniques - Polynomial Regression, Stepwise Regression, Decision Tree
Regression, Random Forest Regression, Support Vector Regression, Ridge Regression, Lasso
Regression, Elastic Net Regression, Bayesian Linear Regression.
● Evaluation Metrics: Mean Squared Error (MSE), Mean Absolute Error (MAE), Root Mean
Squared Error (RMSE),R-squared ,Adjusted R-squared.
Syllabus- Computer
● Bias, Variance,
● Generalization, Underfitting, Overfitting,
● Linear regression,
● Regression: Lasso regression, Ridge regression
● Gradient descent algorithm
● Evaluation Metrics: MAE, RMSE, R2
Errors in Machine Learning

Important Link
https://www.javatpoint.com/bias-a
nd-variance-in-machine-learning
Errors in Machine Learning
● Irreducible errors are errors which will always be present in a
machine learning model, because of unknown variables, and
whose values cannot be reduced.
● Reducible errors are those errors whose values can be
further reduced to improve a model. They are caused
because our model’s output function does not match the
desired output function and can be optimized.
Bias
● Bias is the diﬀerence
between our actual and
predicted values.

● Bias is the simple

assumptions that our model
makes about our data to be
able to predict new data.
Variance
● Variance can defined as
as the model’s sensitivity
to fluctuations in the
data.
● The model may learn
from noise.
Variance
Bias VS Variance
Bias-Variance Tradeoff
Bias-Variance Tradeoff
Bias-Variance Tradeoff
Overfitting and Underfitting
Generalization
● Generalization is a term used to describe a model’s ability to react
to new data. That is, after being trained on a training set, a model
can digest new data and make accurate predictions.
● If a model has been trained too well on training data, it will be unable
to generalize.
● It will make inaccurate predictions when given new data, making the
model useless even though it is able to make accurate predictions for
the training data. This is called overfitting.
● The inverse is also true. Underfitting happens when a model has not 16

been trained enough on the data. In the case of underﬁtting, it

makes the model just as useless and it is not capable of making
accurate predictions, even with the training data.
Overﬁtting
Underﬁtting
Regression
Regression

"Regression shows a line or curve that passes

through all the datapoints on target-predictor
graph in such a way that the vertical distance
between the data points and the regression
line is minimum."
Simple Linear Regression
Linear Regression
● Regression analysis is a statistical method to model the
relationship between a dependent (target) and independent
(predictor) variables with one or more independent variables.

● It predicts continuous/real values such as temperature, age,

salary, price, etc.
● Regression is a supervised learning technique
● It is mainly used for prediction, forecasting, time series
modeling, and determining the causal-eﬀect relationship
between variables.
Linear Regression
Y= 𝛃 0+𝛃 1X
where
● Y is the dependent variable: the variable we wish to explain (also called the
endogenous variable)
● X is the independent variable: the variable used to explain the dependent variable
(also called the exogenous variable)
● β0 is the intercept: where the line cuts Y-axis.
● β1 is the slope of the line. (This slope is very important because it indicates the
change in Y-variable when the variable X changes.)
Linear Regression
Simple Linear Regression

Solved Example

Google Colab
X Y

Person Bahubali1 Bahubali2

P1 4 3
P2 2 4
P3 3 2
P4 5 5
P5 1 3
P6 3 1
AVG 3 3
Xavg Yavg
(X-Xavg)*( (X-Xavg)*
Bahubali1 Bahubali2 X-Xavg Y - Yavg
Y-Yavg) (X-Xavg)
4 3 1 0 0 1
2 4 -1 1 -1 1
3 2 0 -1 0 0
5 5 2 2 4 4
1 3 -2 0 0 4
3 1 0 -2 0 0
3 3 3 10
avg avg sum sum
𝛃1 0.3 Bahubali1 Bahubali2
predicted y
(x) (Actual Y)
4 3 3.3
𝛃0 2.1 2 4 2.7
3 2 3
5 5 3.6
Y= 𝛃 0+𝛃 1X y= 2.1 + 0.3 x 1 3 2.4
3 1 3
Y Actual Vs Y Predicted
How Good is the model’s prediction power?
SSE SSR SST
Bahubali1 Bahubali2 predicted y Yi- Ypredicted- Y-
Square Square Square
Ypredicted Avg Y Avg Y
4 3 3.3 -0.3 0.09 0.3 0.09 0 0
2 4 2.7 1.3 1.69 -0.3 0.09 1 1
3 2 3 -1 1 0 0 -1 1
5 5 3.6 1.4 1.96 0.6 0.36 2 4
1 3 2.4 0.6 0.36 -0.6 0.36 0 0
3 1 3 -2 4 0 0 -2 4
SSE 9.1 SSR 0.9 SST 10

SST=SSR+SSE
10
SST = 0.9+0.1
How Good is the model’s prediction power?
SSE SSR SST
Bahubali1 Bahubali2 predicted y Yi- Ypredicted- Y-
Square Square Square
Ypredicted Avg Y Avg Y
SSE 9.1 SSR 0.9 SST 10

SST=SSR+SSE
10
SST = 0.9+0.1

2 =0.9/10
r = 0.09
Correlation
The term correlation is a combination of two words 'Co' (together) and the relation
between two quantities. Correlation is when it is observed that a change in a unit in
one variable is retaliated by an equivalent change in another variable, i.e., direct or
indirect, at the time of study of two variables.
Correlation can be either negative or positive.
If the two variables move in the same direction, i.e. an increase in one variable results in the
corresponding increase in another variable, and vice versa, then the variables are considered to be
positively correlated. For example, Investment and proﬁt.

On the contrary, if the two variables move in different directions so that an increase in one
variable leads to a decline in another variable and vice versa, this situation is known as a negative
correlation. For example, Product price and demand.
Polynomial Regression
Polynomial Regression
Polynomial Regression

The problem of non-linear

regression can be solved by two
methods

1. Transformation of non-linear
data to linear data, so that the
linear regression can handle the
data
2. Using polynomial regression
Polynomial Regression
Polynomial Regression
Polynomial Regression
Polynomial Regression
Lets See the Example

x y

1 1

2 4

3 9

4 15

y=- 0.75 + 0.95 x + 0.75 x2

Stepwise Regression

Read Upto Diagram Only

https://quantifyinghealth.com/stepwise-selection/
Regression
Statistical technique based on the average mathematical relationship between two
or more variables is known as regression, to estimate the change in the metric
dependent variable due to the change in one or more independent variables.

It plays an important role in many human activities since it is a powerful and flexible
tool that is used to forecast past, present, or future events based on past or present
events. For example, The future profit of a business can be estimated on the basis of
past records.
There are two variables x and y in a simple linear regression, wherein y depends on x
or say that is influenced by x. Here y is called as a variable dependent, or criterion,
and x is a variable independent or predictor.
Types of Regression: Univariate vs. Multivariate, Linear vs. Nonlinear, Simple Linear
vs. Multiple Linear,
1. Univariate data –This type of data consists of only one variable. The analysis of univariate data is
thus the simplest form of analysis since the information deals with only one quantity that changes.

It does not deal with causes or relationships and the main purpose of the analysis is to describe the data
and ﬁnd patterns that exist within it. The example of a univariate data can be height.

Suppose that the heights of seven students of a class is recorded (ﬁgure 1),there is only one variable that
is height and it is not dealing with any cause or relationship.
Types of Regression: Univariate vs. Multivariate, Linear vs. Nonlinear, Simple Linear
vs. Multiple Linear,
2. Bivariate data: This type of data involves two different variables.

The analysis of this type of data deals with causes and relationships and the analysis is done to ﬁnd out the
relationship among the two variables. Example of bivariate data can be temperature and ice cream sales in summer
season.

Suppose the temperature and ice cream sales are the two variables of a bivariate data (ﬁgure 2). Here, the
relationship is visible from the table that temperature and sales are directly proportional to each other and thus
related because as the temperature increases, the sales also increase. Thus bivariate data analysis involves
comparisons, relationships, causes and explanations.
Types of Regression: Univariate vs. Multivariate, Linear vs. Nonlinear, Simple Linear
vs. Multiple Linear

3. Multivariate data
When the data involves three or more variables, it is categorized under multivariate.
Example of this type of data is suppose an advertiser wants to compare the popularity of
four advertisements on a website, then their click rates could be measured for both men
and women and relationships between variables can then be examined. It is similar to
bivariate but contains more than one dependent variable. The ways to perform analysis on
this data depends on the goals to be achieved. Some of the techniques are regression
analysis, path analysis, factor analysis and multivariate analysis of variance (MANOVA).
Click Here: Regularization (Ridge and Lasso)

Example

ML Unit-2
No ratings yet
ML Unit-2
123 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
Module 3
No ratings yet
Module 3
34 pages
Unit-2 Supervised Machine Learning
No ratings yet
Unit-2 Supervised Machine Learning
132 pages
Regression: Unit Iii
No ratings yet
Regression: Unit Iii
54 pages
DA-3rd Unit
No ratings yet
DA-3rd Unit
16 pages
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
No ratings yet
Applying Machine Learning Algorithms With Scikit-Learn (Sklearn) - Notes
19 pages
Lecture 9-10
No ratings yet
Lecture 9-10
28 pages
Regression Analysis for ML Beginners
No ratings yet
Regression Analysis for ML Beginners
12 pages
Da Unit 3 R22
No ratings yet
Da Unit 3 R22
15 pages
Types of Supervised Learning2
No ratings yet
Types of Supervised Learning2
66 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
Unit 2 Topic 1 REGRESSION
No ratings yet
Unit 2 Topic 1 REGRESSION
19 pages
Linear Regression
No ratings yet
Linear Regression
18 pages
Classical Machine Learning: Linear Regression: Ramesh S
No ratings yet
Classical Machine Learning: Linear Regression: Ramesh S
28 pages
Intro To Regresion: Codergirl Data Analysis
No ratings yet
Intro To Regresion: Codergirl Data Analysis
32 pages
Module 2
No ratings yet
Module 2
21 pages
Linear Regression
No ratings yet
Linear Regression
49 pages
Unit 2
No ratings yet
Unit 2
136 pages
CH 5
No ratings yet
CH 5
36 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
Unit-3 Part 2 DA
No ratings yet
Unit-3 Part 2 DA
20 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Unit-4 DS Student
No ratings yet
Unit-4 DS Student
43 pages
MLT Unit 2
No ratings yet
MLT Unit 2
53 pages
Data Science
100% (1)
Data Science
14 pages
Unit 2-1
No ratings yet
Unit 2-1
30 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Regression
No ratings yet
Regression
25 pages
Model Development
No ratings yet
Model Development
80 pages
Unit 2
No ratings yet
Unit 2
26 pages
Mod3 Eda
No ratings yet
Mod3 Eda
16 pages
Regression
No ratings yet
Regression
45 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Da Module 3
No ratings yet
Da Module 3
54 pages
Regression Unit-2
No ratings yet
Regression Unit-2
5 pages
1.linear Regression PSP
No ratings yet
1.linear Regression PSP
92 pages
ML Module3 Regression
No ratings yet
ML Module3 Regression
51 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
Lecture Note #8 - PEC-CS701E
No ratings yet
Lecture Note #8 - PEC-CS701E
20 pages
Data Analytics Unit III
No ratings yet
Data Analytics Unit III
15 pages
Unit - 3 Machine Learning
No ratings yet
Unit - 3 Machine Learning
30 pages
Unit-III (Data Analytics)
50% (2)
Unit-III (Data Analytics)
15 pages
4 ML
No ratings yet
4 ML
41 pages
Unit-2 ML
No ratings yet
Unit-2 ML
39 pages
Supervised Learning
No ratings yet
Supervised Learning
20 pages
Unit 2
No ratings yet
Unit 2
19 pages
Unit 2
No ratings yet
Unit 2
67 pages
Regression Notes
No ratings yet
Regression Notes
7 pages
Unit III
No ratings yet
Unit III
13 pages
DA unit-III
No ratings yet
DA unit-III
30 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
13 pages
Unit 3
No ratings yet
Unit 3
48 pages
Unit - 3 - ML - 24
No ratings yet
Unit - 3 - ML - 24
41 pages
DMV Unit 1
No ratings yet
DMV Unit 1
8 pages
Cyber Guard X
No ratings yet
Cyber Guard X
9 pages
ML-Unit 3 Classification
No ratings yet
ML-Unit 3 Classification
41 pages
On Child Tracking
No ratings yet
On Child Tracking
18 pages
Presentation On Child Tracking System
100% (2)
Presentation On Child Tracking System
15 pages
Curve Fitting & Interpolation Guide
No ratings yet
Curve Fitting & Interpolation Guide
41 pages
2023 CMOST Presentation Conventional Tight
No ratings yet
2023 CMOST Presentation Conventional Tight
220 pages
M2 Dav
No ratings yet
M2 Dav
148 pages
15 Splines
No ratings yet
15 Splines
51 pages
Classification and Regression
No ratings yet
Classification and Regression
34 pages
Bitcoin Price Forecasting with Differential Evolution
No ratings yet
Bitcoin Price Forecasting with Differential Evolution
26 pages
Module 4
No ratings yet
Module 4
41 pages
Scilab Text Probability - and - Stat For Engineers Ross
No ratings yet
Scilab Text Probability - and - Stat For Engineers Ross
137 pages
8480 30452 1 PB
No ratings yet
8480 30452 1 PB
10 pages
Polynomial Regression
No ratings yet
Polynomial Regression
3 pages
Modified Question Bank Unit-1 Statistics
No ratings yet
Modified Question Bank Unit-1 Statistics
6 pages
Case Study Curve Fitting
No ratings yet
Case Study Curve Fitting
9 pages
Unit III - Data Visualization and Representation
No ratings yet
Unit III - Data Visualization and Representation
17 pages
2.3 ML (Implementation of Polynomial Regression Using Python)
No ratings yet
2.3 ML (Implementation of Polynomial Regression Using Python)
9 pages
Regression for ML Beginners
No ratings yet
Regression for ML Beginners
18 pages
Applied Regression Analysis and Other Multivariable Methods 5th Edition David G. Kleinbaum - Downloadable PDF 2025
33% (3)
Applied Regression Analysis and Other Multivariable Methods 5th Edition David G. Kleinbaum - Downloadable PDF 2025
52 pages
Machine Learning Lab Guide
No ratings yet
Machine Learning Lab Guide
69 pages
Least Squares Regression
No ratings yet
Least Squares Regression
15 pages
Final Exam
No ratings yet
Final Exam
5 pages
7 Least Square PDF
No ratings yet
7 Least Square PDF
34 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
44 pages
Intro To Machine Learning With PyTorch
No ratings yet
Intro To Machine Learning With PyTorch
48 pages
21CSU393 Kunal Verma - AI&ML Lab Manual
No ratings yet
21CSU393 Kunal Verma - AI&ML Lab Manual
71 pages
Tutorial 2 PSNM (2024-25) Unit-1 Correlation, Regression and Curve Fitting
No ratings yet
Tutorial 2 PSNM (2024-25) Unit-1 Correlation, Regression and Curve Fitting
2 pages
UCS-401 - CSE7th M L Lect 07 - Case Study of Polynomial Regressions
No ratings yet
UCS-401 - CSE7th M L Lect 07 - Case Study of Polynomial Regressions
10 pages
Ex No: 5 Curve Fitting Using Polynomial Regression: Description
No ratings yet
Ex No: 5 Curve Fitting Using Polynomial Regression: Description
5 pages
Behavioral Styles in Decision Making
No ratings yet
Behavioral Styles in Decision Making
12 pages
BDA Unit 4
No ratings yet
BDA Unit 4
144 pages
Regression Analysis Chapter Assignment - Student
No ratings yet
Regression Analysis Chapter Assignment - Student
6 pages
Machine Learning and Regression
No ratings yet
Machine Learning and Regression
8 pages

RRB - Unit 2 Regresion

Uploaded by

RRB - Unit 2 Regresion

Uploaded by

Unit 2

● Bias is the simple

been trained enough on the data. In the case of underﬁtting, it

"Regression shows a line or curve that passes

● It predicts continuous/real values such as temperature, age,

Person Bahubali1 Bahubali2

The problem of non-linear

y=- 0.75 + 0.95 x + 0.75 x2

Read Upto Diagram Only

You might also like