0% found this document useful (0 votes)

26 views31 pages

Lec 3

The document discusses various machine learning approaches, including supervised, unsupervised, semi-supervised, and reinforcement learning, emphasizing the importance of data structure and volume in selecting the appropriate method. It details types of supervised learning, such as classification and regression, and introduces algorithms like decision trees, Naive Bayes, and K-Means clustering, explaining their applications and functionalities. Additionally, it highlights the process of K-Means clustering and its iterative nature in grouping data points.

Uploaded by

sachin.2.ocomback

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views31 pages

Lec 3

Uploaded by

sachin.2.ocomback

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

Selecting the ML Approach

The data modeling approach for machine learning is based on the structure
and volume of the data at hand, regardless of the use case. Any of the
following approaches can be chosen considering all the factors.

Supervised Learning Unsupervised Learning Semi-supervised Learning Reinforcement Learning

Quiz Time

Guess what ML approach is used

by spam detection? Supervised Learning

Unsupervised Learning

Semi-supervised Learning

Reinforcement Learning
Quiz Time

Guess what ML approach is used

Supervised Learning
by spam detection?

Unsupervised Learning

Semi-supervised Learning

Reinforcement Learning
Fundamentals of Machine Learning and Deep Learning
Topic 5: Algorithms of Machine Learning
Machine Learning Algorithms

• There are four main types of machine learning algorithms.

• The choice of the algorithm depends on the type of data in the use case.
Types of Supervised Learning

The two main types of supervised learning that use labeled data are
regression and classification.
Classification

• Classification is applied when the output has finite and

discrete values.
• For example, social media sentiment analysis has three
potential outcomes: positive, negative, or neutral.
Regression

• Regression is applied when the output is a continuous

number.

• A simple regression algorithm: y = wx + b. For example,

relationship between environmental temperature (y) and
humidity levels (x).
Classification vs. Regression

By fitting to the labeled training set, you can find the most optimal model
parameters to predict unknown labels on other objects (test set).

If the label is a real number, we call the task regression. If the label is from the limited number of unordered
For example, finding actual value of house price based values, we call it classification. For example,
features like location, construction year, etc. classifying images of animals into separate groups
(labels) of dogs and cats.
Linear Regression

• Linear regression is an equation that describes a line

that represents the relationship between the input
variables (x) and the output variables (y).

• It does so by finding specific weightings for the input

variables called coefficients (B).
Quiz Time

Which of these is a use case for

linear regression? Spam detection

Google Translate

Car mileage based on brand,

model, year, weight, etc.

Robot learning to walk

Quiz Time

Which of these is a use case for

linear regression? Spam detection

Google Translate

Car mileage based on brand,

model, year, weight, etc.

Robot learning to walk

Meaning of Decision Tree

• A decision tree is a graphical representation of all the

possible solutions to a decision based on a few conditions.

• It uses predictive models to achieve results.

• A decision tree is drawn upside down with its root at the
top.
Classification and Regression Trees

• The tree splits into branches based on a condition or internal

Decision Root Node
node.
Node Commute more

Yes
than 1 hour
No
• The end of the branch that doesn’t split anymore is the
decision/leaf.
Commute more
than 1 hour
Decline offer • In this case, the condition whether the employee accepts or
Yes rejects the job offer is represented as green oval shaped
No
boxes.
Offers free
coffee
Decline offer
• This tree is called as classification tree as the target is to
Yes

No
classify whether the job is accepted by the employee or not.
• Regression trees are represented in the same manner, but
Decline offer
they predict continuous values like price of a house.
Decision Tree:
Should I accept a new
Job offer? Decline offer • Decision tree algorithms are referred to as CART or
Classification and Regression Trees.
• Each node represents a single input variable (x) and a split
point on that variable, assuming the variable is numeric.
Quiz Time

Can you think of a use case for

decision tree?
Naive Bayes

• Naive Bayes is a simple but surprisingly powerful algorithm for predictive modeling.
• The model comprises of two types of probabilities: the probability of each class and
the conditional probability of each class based on the value of x.

• Once calculated, this probability model can be used to make predictions for new data
using Bayes theorem.
• The probabilities can be easily estimated as bell curve when your data is real valued.
Naive Bayes Example

How does an email client classify between valid and spam emails?

Spam/Junk Ham/Inbox
Naive Bayes Classification

• The objects can be classified as either green or red. The task is to classify new cases
as they arrive.
• For Example, using Naïve Bayes, you can classify the class labels based on the
current objects.
• Since there are twice as many green objects as red, it is reasonable to believe that a
new case (which has not been observed yet) has same ratio.
Naive Bayes Classification

• In Bayesian analysis, this belief is known as prior probability.

• Prior probabilities are based on previous experience.

• Prior probability of green: number of green objects/total number of objects

• Prior probability of red: number of red objects/total number of objects

Naive Bayes Classification

Since there is a total of 60 objects, 40 of which are green and 20 are red, prior probabilities
for class membership are:
• Prior probability for green: 40/60
• Prior probability for red: 20/60 (number of red objects/total number of objects)
Naive Bayes Classification

• The more green (or red) objects there are in the vicinity of X, the more likely that the new
cases will belong to that particular color.

• To measure the likelihood, draw a circle around X which encompasses a number of points
irrespective of their class labels.

• Then, calculate the number of points in the circle that belong to each class label.
Naive Bayes Classification
CALCULATION OF LIKELIHOOD

In this illustration, it is clear that likelihood of X given GREEN is smaller than Likelihood of
X given RED, since the circle encompasses 1 GREEN object and 3 RED ones.
Naive Bayes Classification
CALCULATION OF PRIOR PROBABILITY

• Although the prior probabilities indicate that X may belong to GREEN (given that there
are twice as many GREEN compared to RED) the likelihood indicates otherwise.

• The class membership of X is RED (given that there are more RED objects in the vicinity
of X than GREEN).
• In Bayesian analysis, the final classification is produced by combining both sources of
information, i.e., the prior and the likelihood, to form a posterior probability using
Bayes' rule (named after Rev. Thomas Bayes 1702-1761).
Naive Bayes Classification
CALCULATION OF PRIOR PROBABILITY
Naive Bayes Classification

Finally, we classify X as RED since its class membership achieves the largest posterior probability.
Machine Learning Algorithms

The next algorithm is K-Means clustering.

K-Means Clustering

• K-Means clustering is an algorithm that can be used for any type of grouping.
• Examples of K-Means clustering:
o Group images
o Detect activity types in motion sensors
o Separate bots from anomalies
o Segment by purchasing history
• Meaningful changes in data can be detected by monitoring to see if a tracked data point
switches groups over time.
K-Means Clustering: Use Cases

Behavioral Inventory Sorting sensor Detecting bots or

segmentation categorization measurements anomalies

Segment by purchase Group inventory by Detect activity types in Separate valid activity
history sales activity motion sensors groups from bots

Segment by activities Group valid activity to

Group inventory by
on application, Group images clean up outlier
manufacturing metrics
website, or platform detection

Define personas
Separate audio
based on interests

Create profiles based Identify groups in

on activity monitoring health monitoring
K-Means Clustering for Unsupervised Learning

• To run a K-Means algorithm, randomly initialize three points called the cluster centroids.

• There are three cluster centroids in the image given below since data is grouped into three
clusters.

K-Means is an iterative algorithm and it involves two steps:

Step 1: Cluster assignment Step 2: Move centroid step

K-Means Clustering for Unsupervised Learning

Step 1:

Algorithm travels through data points, depending on which cluster is closer.

It assigns it to red, blue, or green cluster.

Step 2:

Algorithm calculates average of all points in cluster and moves centroid to the average location.
K-Means Clustering for Unsupervised Learning

• Steps 1 and 2 are repeated until there are no changes in clusters or when the specified
condition is met.

• K is chosen randomly, or elbow plot/silhouette score helps decide it.

ML-Unit - 3 & 4
No ratings yet
ML-Unit - 3 & 4
33 pages
AIML
No ratings yet
AIML
30 pages
ML Unit 2
No ratings yet
ML Unit 2
33 pages
Machine Learning - Iii
No ratings yet
Machine Learning - Iii
53 pages
Intro to Supervised Machine Learning
No ratings yet
Intro to Supervised Machine Learning
42 pages
Machine Learning 1
No ratings yet
Machine Learning 1
29 pages
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
No ratings yet
Machine Learning: Mona Leeza Email: Monaleeza - Bukc@bahria - Edu.pk
60 pages
An Intro To ML II
No ratings yet
An Intro To ML II
42 pages
Introduction To ML
No ratings yet
Introduction To ML
31 pages
Unit 6
No ratings yet
Unit 6
55 pages
MILIT PPT Modifies
No ratings yet
MILIT PPT Modifies
43 pages
DS ML CompleteSlides PDF
No ratings yet
DS ML CompleteSlides PDF
211 pages
Machine Learning Algorithms Laiki
No ratings yet
Machine Learning Algorithms Laiki
123 pages
Machine Learning Classification Guide
No ratings yet
Machine Learning Classification Guide
83 pages
Learning AI
No ratings yet
Learning AI
34 pages
Decision Trees. These Models Use Observations About Certain
No ratings yet
Decision Trees. These Models Use Observations About Certain
6 pages
AI For Eng Supervised-Learning
No ratings yet
AI For Eng Supervised-Learning
25 pages
Machine Learning
No ratings yet
Machine Learning
32 pages
Lesson 8 - Classification
No ratings yet
Lesson 8 - Classification
74 pages
ML Notes (III BCA)
No ratings yet
ML Notes (III BCA)
64 pages
ML Unit 2
No ratings yet
ML Unit 2
37 pages
Classification
100% (2)
Classification
105 pages
Machine Learning Overview & Techniques
No ratings yet
Machine Learning Overview & Techniques
30 pages
Lecture - 2 & 3
No ratings yet
Lecture - 2 & 3
62 pages
DM Chapter 0
No ratings yet
DM Chapter 0
4 pages
Category AI Model
No ratings yet
Category AI Model
7 pages
Dav Unit 3
No ratings yet
Dav Unit 3
50 pages
08 CSE358 Intro To Machine Learning II
No ratings yet
08 CSE358 Intro To Machine Learning II
100 pages
INT354 - Unit 2
No ratings yet
INT354 - Unit 2
26 pages
Big Data Lesson 5 Lucrezia Noli
No ratings yet
Big Data Lesson 5 Lucrezia Noli
30 pages
CH 5
No ratings yet
CH 5
21 pages
UNIT II Machine Learning
No ratings yet
UNIT II Machine Learning
118 pages
Introduction to Classification in AI
No ratings yet
Introduction to Classification in AI
66 pages
Aiya Session 4
No ratings yet
Aiya Session 4
42 pages
1 - Supervised Learning & Its Types
No ratings yet
1 - Supervised Learning & Its Types
24 pages
Machine Learning
No ratings yet
Machine Learning
33 pages
20150908-Lecture-3-Draft Asd Def HFL DFGF Lkreglker Lerg Kelr GK
No ratings yet
20150908-Lecture-3-Draft Asd Def HFL DFGF Lkreglker Lerg Kelr GK
15 pages
Classification and Clustering Techniques in Data Mining
No ratings yet
Classification and Clustering Techniques in Data Mining
18 pages
Intro To Machine Learning
No ratings yet
Intro To Machine Learning
15 pages
CH 5
No ratings yet
CH 5
19 pages
8 Classification
No ratings yet
8 Classification
45 pages
Machine Learning Issues & Algorithms
No ratings yet
Machine Learning Issues & Algorithms
133 pages
Unit 1
100% (1)
Unit 1
13 pages
Unit 2 R Programming
No ratings yet
Unit 2 R Programming
15 pages
Financial Machine Learning-Unit-1: Dr. J.Dhanalakshmi
No ratings yet
Financial Machine Learning-Unit-1: Dr. J.Dhanalakshmi
70 pages
DWBI4
No ratings yet
DWBI4
10 pages
Introduction To AI
No ratings yet
Introduction To AI
51 pages
NLP Chapter 2
No ratings yet
NLP Chapter 2
79 pages
Classification Chapter 5
No ratings yet
Classification Chapter 5
26 pages
Supervised Learning
No ratings yet
Supervised Learning
187 pages
Week 2: Machine Learning Intro: Instructor: Ting Sun
No ratings yet
Week 2: Machine Learning Intro: Instructor: Ting Sun
21 pages
Supervised Learning Algorithms
No ratings yet
Supervised Learning Algorithms
224 pages
Presentation ML-1
No ratings yet
Presentation ML-1
67 pages
01 - ML - Introduction
No ratings yet
01 - ML - Introduction
65 pages
Unit 3
No ratings yet
Unit 3
123 pages
Classification
No ratings yet
Classification
33 pages
AI.5 Machine Learning (21 26)
No ratings yet
AI.5 Machine Learning (21 26)
196 pages
Week 4 Part 1 Classification
No ratings yet
Week 4 Part 1 Classification
71 pages
Catch-22's Antiwar Message Analyzed
0% (1)
Catch-22's Antiwar Message Analyzed
4 pages
The Powerof Music-Notes
No ratings yet
The Powerof Music-Notes
3 pages
Tamamo no Mae: Evolution in Japanese Literature
No ratings yet
Tamamo no Mae: Evolution in Japanese Literature
66 pages
Jealousy Self-Assessment Guide
100% (3)
Jealousy Self-Assessment Guide
2 pages
Plato's Atlantis Story and The Birth of Fiction
No ratings yet
Plato's Atlantis Story and The Birth of Fiction
15 pages
COMM1120 - T2 - Assessment 1 - Team Contract Preparation Template
No ratings yet
COMM1120 - T2 - Assessment 1 - Team Contract Preparation Template
5 pages
The Japanese Writing System
100% (1)
The Japanese Writing System
25 pages
Tafsir of Surah Al-Ma'un Explained
No ratings yet
Tafsir of Surah Al-Ma'un Explained
5 pages
Self-Initiation of Vajrabhairava
94% (16)
Self-Initiation of Vajrabhairava
76 pages
Psych R Package
No ratings yet
Psych R Package
412 pages
ECE 198 JL Worksheet 8: Storage Elements: Random Access Memory
No ratings yet
ECE 198 JL Worksheet 8: Storage Elements: Random Access Memory
6 pages
Monetary Policy of Pakistan
No ratings yet
Monetary Policy of Pakistan
8 pages
Crimpro Rules 110 To 127
No ratings yet
Crimpro Rules 110 To 127
8 pages
Yoga for Expectant Mothers
100% (5)
Yoga for Expectant Mothers
228 pages
Nomad Capitalist - How To Reclaim Your Freedom With Offshore Bank Accounts - Dual Citizenship - Foreign Notebook
100% (1)
Nomad Capitalist - How To Reclaim Your Freedom With Offshore Bank Accounts - Dual Citizenship - Foreign Notebook
9 pages
Thesis Statement Examples On Animal Testing
100% (3)
Thesis Statement Examples On Animal Testing
4 pages
Vocabulary PDF For Competitive Exams Preparation Downloaded From Exampundit - in
No ratings yet
Vocabulary PDF For Competitive Exams Preparation Downloaded From Exampundit - in
90 pages
The Afghans, Muhammad Ali
100% (1)
The Afghans, Muhammad Ali
196 pages
Midterm Examination Schedule Fall 2019, Ver-Final
No ratings yet
Midterm Examination Schedule Fall 2019, Ver-Final
34 pages
Capitulo Jorge Ayala Libro Filosofia ROTH
No ratings yet
Capitulo Jorge Ayala Libro Filosofia ROTH
44 pages
Ericsson Bts Commands
100% (2)
Ericsson Bts Commands
4 pages
Midterm Exam Mca
No ratings yet
Midterm Exam Mca
7 pages
The Wisdom Commentary - Volume 3 - 2013 - Wisdom International - Inc.
No ratings yet
The Wisdom Commentary - Volume 3 - 2013 - Wisdom International - Inc.
268 pages
M.Tech Thesis Guideline
No ratings yet
M.Tech Thesis Guideline
12 pages
Data Cleaning & Analysis Guide
No ratings yet
Data Cleaning & Analysis Guide
11 pages
Youth's Call for Global Peace
No ratings yet
Youth's Call for Global Peace
2 pages
History CH 3 Nazism and The Rise of Hitler
No ratings yet
History CH 3 Nazism and The Rise of Hitler
31 pages
Artemisia Gentileschi
100% (1)
Artemisia Gentileschi
19 pages
Design of Experiments: Concepts & Applications
No ratings yet
Design of Experiments: Concepts & Applications
19 pages
Employee Retention Study Report
No ratings yet
Employee Retention Study Report
25 pages

Lec 3

Uploaded by

Lec 3

Uploaded by

Selecting the ML Approach

Supervised Learning Unsupervised Learning Semi-supervised Learning Reinforcement Learning

Guess what ML approach is used

Guess what ML approach is used

• There are four main types of machine learning algorithms.

• Classification is applied when the output has finite and

• Regression is applied when the output is a continuous

• A simple regression algorithm: y = wx + b. For example,

• Linear regression is an equation that describes a line

• It does so by finding specific weightings for the input

Which of these is a use case for

Car mileage based on brand,

Robot learning to walk

Which of these is a use case for

Car mileage based on brand,

Robot learning to walk

• A decision tree is a graphical representation of all the

• It uses predictive models to achieve results.

• The tree splits into branches based on a condition or internal

Can you think of a use case for

• In Bayesian analysis, this belief is known as prior probability.

• Prior probabilities are based on previous experience.

• Prior probability of green: number of green objects/total number of objects

• Prior probability of red: number of red objects/total number of objects

The next algorithm is K-Means clustering.

Behavioral Inventory Sorting sensor Detecting bots or

Segment by activities Group valid activity to

Create profiles based Identify groups in

K-Means is an iterative algorithm and it involves two steps:

Step 1: Cluster assignment Step 2: Move centroid step

Algorithm travels through data points, depending on which cluster is closer.

• K is chosen randomly, or elbow plot/silhouette score helps decide it.

You might also like