0% found this document useful (0 votes)

6 views12 pages

Complete

its about technical

Uploaded by

maliksubhaan15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views12 pages

Complete

its about technical

Uploaded by

maliksubhaan15

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Machine Learning

Machine Learning enables machines to learn from data and experiences without explicit programming.
Instead of writing code, you provide data to an algorithm, which builds logic based on that data.

How it works: A training dataset is used to train an ML algorithm to create a model. New input data is
then processed through this model to make predictions. If the predictions meet acceptable accuracy, the
model is deployed. Otherwise, it is retrained with enhanced data until the accuracy improves.

Types of Machine Learning:

1. Supervised Learning –

Learning is guided by labeled data (a

"teacher"). The model is trained on this
dataset and then makes predictions or
decisions when new data is introduced.

2. Unsupervised Learning
The model learns by observing and
finding patterns in data without
labels. It organizes data into clusters
based on relationships, though it
doesn't assign labels to these
clusters. For example, it can group
apples, bananas, and mangoes into
clusters without naming them.

3. Reinforcement Learning
An agent interacts with its
environment and learns through rewards and penalties. It refines its decisions over time by
maximizing positive rewards and minimizing mistakes. Once trained, it can make predictions
based on new data.
Classification of machine learning

Unit 2
Regression Models:

Regression predicts continuous response values, such as house prices, stock values, or cricket scores.
Common models include:

1. Simple Linear Regression – Predicts using one independent variable.

2. Multiple Linear Regression – Predicts using multiple independent variables.

Key Concepts:

• Cost Function & Gradient Descent: Methods for optimizing the model by minimizing error.

• Performance Metrics:

o Mean Absolute Error (MAE)

o Mean Squared Error (MSE)

o R-Squared & Adjusted R-Squared (indicate model fit).

Types of Regression
1. Linear Regression
2. Logistic Regression
3. Polynomial Regression
4. Support Vector Regression
5. Decision Tree Regression

6 Random Forest Regression

7 Ridge Regression

8 Lasso Regression

Linear Regression:

Linear regression is a simple statistical method for predictive analysis that models the relationship
between continuous variables. It addresses regression problems by showing a linear relationship
between the independent variable (X) and the dependent variable (Y).

Types:

1. Simple Linear Regression – One input variable.

2. Multiple Linear Regression – Multiple input variables.

Equation:
Y=aX+bY = aX + b

• YY: Dependent variable (target)

• XX: Independent variable

Example: Predicting an employee's salary based on years of experience.

Some popular applications of linear regression are:

• Analyzing trends and sales estimates

• Salary forecasting
• Real estate prediction
• Arriving at ETAs in traffic.
LINEAR REGRESSION
Linear regression is a statistical approach for modeling relationship between a dependent variable with a
given set of independent variables.

Simple Linear Regression

Simple linear regression is an approach for predicting a response using a single feature.

It is assumed that the two variables are linearly related. Hence, we try to find a linear function that
predicts the response value(y) as accurately as possible as a function of the feature or independent
variable(x). Let us consider a dataset where we have a value of response y for every feature x:
LOGISTIC REGRESSION

Consider an example dataset which maps the number of hours of study with the result of an exam. The
result can take only two values, namely passed (1) or failed(0)

i.e. y is a categorical target variable which can take only two possible type:“0” or “1”.In order to
generalize our model, we assume that:
Differences between Linear Regression and Logistic Regression: -
LINEAR REGRESSION LOGISTIC REGRESSION
1. Linear Regression is a supervised regression model. 1. Logistic Regression is a supervised classification
2. In Linear Regression, we predict the value by an model.
integer number. 2. In Logistic Regression ,we predict the value by 1 or 0.
3. Here no activation function is used 3. Here activation function is used to convert a linear
regression
Performance Metrics

1. Accuracy can be calculated by taking average of the values lying across the “main diagonal”

2. Precision:-It is the number of correct positive results divided by the number of positive results
predicted by classifier.

3. Recall :- It is the number of correct positive results divided by the number of all relevant samples
Residuals and Residual Plots

Residuals: Residuals measure the vertical distance between

observed data points and the regression line, representing the error
between predicted and actual values.

Residual Plots:

• Residuals (Y-axis) vs. independent variable (X-axis) are

visualized in residual plots.

• Key assumption: Residuals should be independent and

normally distributed.

Residual Plot Analysis

A key assumption of linear regression is that residuals (errors)

are independent and normally distributed. Since predictions are
never 100% accurate, some randomness is inherent. The
regression model aims to capture all predictive information in
the deterministic part, leaving residuals as completely random
and unpredictable (stochastic). Ideally, residuals should follow
a normal distribution, validating this assumption.

Characteristics of a Good Residual Plot:

1. High density of points near the origin and low density away from it.

2. Symmetry about the origin.

3. No patterns as residuals are distributed evenly along the X-axis.

4. Projected residuals on the Y-axis form a normal distribution.

A good residual plot shows random, pattern less scatter, while a bad one shows systematic patterns or
deviations from normality. This validates the assumption that residual errors are stochastic and
independent.

A good residual plot satisfies key assumptions:

1. Residuals projected onto the Y-axis form a normal distribution, confirming normality.

2. Residuals are evenly distributed across the X-axis with no visible patterns, ensuring
independence.
Good residual plots
Project on to the Y axis

In contrast, a bad residual plot shows:

• High density far from the origin and low density near it.

• A non-normal distribution when projected onto the Y-axis, violating these assumptions.

Polynomial Regression

Polynomial regression models the relationship between the independent variable xxx and the
dependent variable yyy as an nnn-degree polynomial. It fits a nonlinear relationship using the least-
squares method.

Types of Polynomial Regression:

• Linear: Degree = 1

• Quadratic: Degree = 2

• Cubic: Degree = 3
• Higher degrees follow similarly.

Assumptions of Polynomial Regression

For effective polynomial regression:

1. The relationship between the dependent variable and independent variables should be linear or
curved and additive.

2. Independent variables must not correlate with each other.

3. Errors should be independent, normally distributed with a mean of zero, and have constant
variance.

Polynomial regression alters the structure from a linear equation to a quadratic or higher-degree
equation, which can be visualized through its curve.

Linear Regression vs. Polynomial Regression

Linear regression models straight-line relationships but struggles when data points follow a curve. When
linear regression underfits the data, polynomial regression captures the nonlinear patterns by fitting a
curved line.
Key Difference:

• Linear regression assumes a linear relationship between variables.

• Polynomial regression handles nonlinear relationships effectively by increasing model complexity

(e.g., quadratic curves) while keeping feature weights linear.

Polynomial regression overcomes underfitting by transforming the model structure without changing the
linear nature of the weights.

MEASURES FOR IN – SAMPLE EVALUATION:

Measures for in – sample evaluation:
A way to numerically determine how good the model fits the data set.

Two important measures to determine the fit of a model:

• Mean squared error (MSE)

• R squared (R^2)

Mean Squared Error (MSE)

Mean Squared Error (MSE) quantifies how close a regression line is to data points by calculating the
average of squared errors.

• Smaller MSE indicates closely dispersed data with fewer errors, resulting in a better model.

• Larger MSE suggests widely scattered data points around the mean.

Goal: Minimize MSE for improved model accuracy.

Unit 2
No ratings yet
Unit 2
19 pages
UNIT II Regration
No ratings yet
UNIT II Regration
62 pages
Classification & Regression Models
No ratings yet
Classification & Regression Models
32 pages
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
No ratings yet
What Are Linear Models in Machine Learning (1) .Docx (Unit3 ML)
60 pages
ML - Module 2
No ratings yet
ML - Module 2
16 pages
Unit-2 Supervised Machine Learning
No ratings yet
Unit-2 Supervised Machine Learning
132 pages
Unit 3 Da
No ratings yet
Unit 3 Da
20 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
20 pages
Linear Regression
No ratings yet
Linear Regression
11 pages
Unit 2
No ratings yet
Unit 2
67 pages
Unit - Iii Data Analysis
No ratings yet
Unit - Iii Data Analysis
39 pages
Machine Learning: Bilal Khan
100% (2)
Machine Learning: Bilal Khan
20 pages
4 ML
No ratings yet
4 ML
41 pages
Unit - 2 MLA
No ratings yet
Unit - 2 MLA
57 pages
Unit 2 Notes - Final
No ratings yet
Unit 2 Notes - Final
32 pages
Regression
No ratings yet
Regression
19 pages
Module 2
No ratings yet
Module 2
21 pages
DS Unit-Iv
No ratings yet
DS Unit-Iv
34 pages
Linear Regression
No ratings yet
Linear Regression
24 pages
OE-ML Unit - 3
No ratings yet
OE-ML Unit - 3
29 pages
Linear Regression
No ratings yet
Linear Regression
7 pages
Unit 3
No ratings yet
Unit 3
48 pages
Hanan
No ratings yet
Hanan
9 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
5 pages
Analytics Compendium
No ratings yet
Analytics Compendium
41 pages
BA3 4 5modules
No ratings yet
BA3 4 5modules
258 pages
Unit 2
No ratings yet
Unit 2
26 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
12 pages
Regression
No ratings yet
Regression
45 pages
Linear Regression
No ratings yet
Linear Regression
16 pages
Unit 3
No ratings yet
Unit 3
30 pages
What Is Linear Regression
No ratings yet
What Is Linear Regression
14 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
26 pages
L4a - Supervised Learning
No ratings yet
L4a - Supervised Learning
25 pages
MLDAP Module2
No ratings yet
MLDAP Module2
32 pages
Linear & Polynomial Regression Guide
No ratings yet
Linear & Polynomial Regression Guide
56 pages
Week 7. Intro To ML. Regression
No ratings yet
Week 7. Intro To ML. Regression
24 pages
Module 3
No ratings yet
Module 3
34 pages
Ch-2 Supervised Machine Learning
No ratings yet
Ch-2 Supervised Machine Learning
48 pages
Types of Supervised Learning2
No ratings yet
Types of Supervised Learning2
66 pages
Regression Analysis Guide
No ratings yet
Regression Analysis Guide
13 pages
Linear Regression in Machine Learning
No ratings yet
Linear Regression in Machine Learning
6 pages
Regression Analysis in Machine Learning
No ratings yet
Regression Analysis in Machine Learning
11 pages
Regression Analysis for ML Beginners
No ratings yet
Regression Analysis for ML Beginners
12 pages
ML Unit-2
No ratings yet
ML Unit-2
123 pages
Regression
No ratings yet
Regression
6 pages
Da Module 3
No ratings yet
Da Module 3
54 pages
U-4 Iml
No ratings yet
U-4 Iml
17 pages
Unit - II - DA
No ratings yet
Unit - II - DA
22 pages
Unit 2
No ratings yet
Unit 2
100 pages
Supervised Learning Essentials
No ratings yet
Supervised Learning Essentials
30 pages
RRB - Unit 2 Regresion
No ratings yet
RRB - Unit 2 Regresion
53 pages
Unit 2
No ratings yet
Unit 2
136 pages
(Unit-04) Part-01 - ML Algo
No ratings yet
(Unit-04) Part-01 - ML Algo
49 pages
AAI Lecture 10 SP 25
No ratings yet
AAI Lecture 10 SP 25
37 pages
Group 1 Practical
No ratings yet
Group 1 Practical
16 pages
DSR Notes 3 To 5
No ratings yet
DSR Notes 3 To 5
70 pages
ML Exp 1
No ratings yet
ML Exp 1
4 pages
Enhancing Ad Recall with Zeigarnik Effect
No ratings yet
Enhancing Ad Recall with Zeigarnik Effect
9 pages
Psychology - Class XI - Chapter 02
No ratings yet
Psychology - Class XI - Chapter 02
21 pages
MLR with SPSS: A Beginner's Guide
No ratings yet
MLR with SPSS: A Beginner's Guide
17 pages
SME Bankruptcy Prediction Models
No ratings yet
SME Bankruptcy Prediction Models
21 pages
Econometrics Eviews 2
No ratings yet
Econometrics Eviews 2
13 pages
New Template Geoplanning 2021
No ratings yet
New Template Geoplanning 2021
16 pages
Regression Analysis for Students
No ratings yet
Regression Analysis for Students
17 pages
Predictive Analytics-Mid Sem Exam Question Bank
No ratings yet
Predictive Analytics-Mid Sem Exam Question Bank
28 pages
Numerical Method For Engineers-Chapter 18
89% (9)
Numerical Method For Engineers-Chapter 18
20 pages
1 s2.0 S0304422X22000110 Main
No ratings yet
1 s2.0 S0304422X22000110 Main
15 pages
Business Research Basics
No ratings yet
Business Research Basics
42 pages
Maize Production Factors in Assosa
No ratings yet
Maize Production Factors in Assosa
47 pages
Schuuring, Eline 1
No ratings yet
Schuuring, Eline 1
60 pages
Discriminant Analysis: Prepared By-Sumit Jain
No ratings yet
Discriminant Analysis: Prepared By-Sumit Jain
44 pages
Logistic Regression Quiz: Pandas Version: 1.0.5 Seaborn Version: 0.10.1 Matplotlib Version: 3.2.1 Sklearn Version: 0.23.1
50% (2)
Logistic Regression Quiz: Pandas Version: 1.0.5 Seaborn Version: 0.10.1 Matplotlib Version: 3.2.1 Sklearn Version: 0.23.1
1 page
MYP Crit B - C Level 2 - 3 - Differentiated Lab Report
No ratings yet
MYP Crit B - C Level 2 - 3 - Differentiated Lab Report
20 pages
2 Measuring Student Satisfaction in PDF
No ratings yet
2 Measuring Student Satisfaction in PDF
13 pages
What Is Design of Experiments (DOE) ?
No ratings yet
What Is Design of Experiments (DOE) ?
8 pages
Review of The Technology Acceptance Model TAM in Internet Banking and Mobile Banking
No ratings yet
Review of The Technology Acceptance Model TAM in Internet Banking and Mobile Banking
20 pages
Wa0001.
No ratings yet
Wa0001.
43 pages
An Empirical Analysis of The Antecedents and Performance Consequences of Using The Moodle Platform
No ratings yet
An Empirical Analysis of The Antecedents and Performance Consequences of Using The Moodle Platform
5 pages
Bridgesetal2007-Patientpreferencemethods ISPOR
No ratings yet
Bridgesetal2007-Patientpreferencemethods ISPOR
4 pages
Modelling Subsurface Uncertainties With Experimental Design: Some Arguments of Non-Conformists
No ratings yet
Modelling Subsurface Uncertainties With Experimental Design: Some Arguments of Non-Conformists
11 pages
Lesson 2 - Business Research - FM
No ratings yet
Lesson 2 - Business Research - FM
22 pages
Design Experiment
No ratings yet
Design Experiment
45 pages
Roney Simmons 2012 Men Smelling Women Null Effects of Exposure To Ovulatory Sweat On Men S Testosterone
No ratings yet
Roney Simmons 2012 Men Smelling Women Null Effects of Exposure To Ovulatory Sweat On Men S Testosterone
11 pages
Upgrad Live Project For Ai
No ratings yet
Upgrad Live Project For Ai
13 pages
Acute 20 Toxicity
100% (1)
Acute 20 Toxicity
63 pages
Social Psychology: Credits: Dr. Hansika Singhal
No ratings yet
Social Psychology: Credits: Dr. Hansika Singhal
41 pages
Fyp Full Report (MD Tajul Islam - b061710370)
No ratings yet
Fyp Full Report (MD Tajul Islam - b061710370)
129 pages

Complete

Uploaded by

Complete

Uploaded by

Machine Learning

Types of Machine Learning:

Learning is guided by labeled data (a

1. Simple Linear Regression – Predicts using one independent variable.

o Mean Absolute Error (MAE)

o Mean Squared Error (MSE)

o R-Squared & Adjusted R-Squared (indicate model fit).

6 Random Forest Regression

1. Simple Linear Regression – One input variable.

2. Multiple Linear Regression – Multiple input variables.

• YY: Dependent variable (target)

• XX: Independent variable

Example: Predicting an employee's salary based on years of experience.

Some popular applications of linear regression are:

• Analyzing trends and sales estimates

Simple Linear Regression

Residuals: Residuals measure the vertical distance between

• Residuals (Y-axis) vs. independent variable (X-axis) are

• Key assumption: Residuals should be independent and

Residual Plot Analysis

A key assumption of linear regression is that residuals (errors)

Characteristics of a Good Residual Plot:

2. Symmetry about the origin.

3. No patterns as residuals are distributed evenly along the X-axis.

4. Projected residuals on the Y-axis form a normal distribution.

A good residual plot satisfies key assumptions:

In contrast, a bad residual plot shows:

Types of Polynomial Regression:

Assumptions of Polynomial Regression

For effective polynomial regression:

2. Independent variables must not correlate with each other.

Linear Regression vs. Polynomial Regression

• Linear regression assumes a linear relationship between variables.

• Polynomial regression handles nonlinear relationships effectively by increasing model complexity

MEASURES FOR IN – SAMPLE EVALUATION:

Two important measures to determine the fit of a model:

• Mean squared error (MSE)

Mean Squared Error (MSE)

Goal: Minimize MSE for improved model accuracy.

You might also like