0% found this document useful (0 votes)

35 views40 pages

Bagging and Boosting

Baaging and boosting ML

Uploaded by

haseebmon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

35 views40 pages

Bagging and Boosting

Baaging and boosting ML

Uploaded by

haseebmon

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 40

Ensemble Learning

Bagging and Boosting

Manu Madhavan
Outline
● Techniques to improve classiﬁcation accuracy
● We focus on ensemble methods

● Bagging
● Boosting
○ Adaptive Boosting (AdaBoost)
● Random Forest
Ensemble Learning
● No solution is perfect!!

● Diﬀerent algorithms may produce diﬀerent solution because they impose

diﬀerent structure on the data;

● No single algorithm is optimal

● Combine multiple partitions of given data into a single partition of

better quality
What is an ensemble method?
● Ensemble is a Machine Learning concept in which the idea is to train
multiple models using the same learning algorithm
● An ensemble for classification is a composite model, made up of a
combination of classifiers
● The individual classifiers vote, and a class label prediction is returned by
the ensemble based on the collection of votes
● Ensembles tend to be more accurate than their component classifiers
Why ?
Traditional learning models assume that the data
classes are well distributed.

In many real-world data domains, however, the

data are class-imbalanced, where the main class of
interest is represented by only a few tuples.

Ensemble learning is a solution for improving

classiﬁcation in imbalanced data

Ensemble models decrease the variance of a single

estimate as they combine several estimates from
different models. So the result may be a model with
higher stability.
Ensemble Methods
● An ensemble combines a series of k learned models (or base classifiers),
M1 , M2 , . . . , Mk with the aim of creating an improved composite
classification model, M∗

● A given data set, D, is used to create k training sets, D1 , D2 , . . . , Dk ,

where Di (1 ≤ i ≤ k − 1) is used to generate classiﬁer Mi

● Given a new data tuple to classify, the base classiﬁers each vote by
returning a class prediction. The ensemble returns a class prediction
based on the votes of the base classiﬁers.
Bagging and Boosting
Bagging: It is a homogeneous weak learners’ model that learns from each
other independently in parallel and combines them for determining the model
average.

Boosting: It is also a homogeneous weak learners’ model but works

diﬀerently from Bagging. In this model, learners learn sequentially and
adaptively to improve model predictions of a learning algorithm.
Bagging- intuition
Suppose that you are a patient and would like to
have a diagnosis made based on your symptoms.
Instead of asking one doctor, you may choose to
ask several. If a certain diagnosis occurs more than
any other, you may choose this as the ﬁnal or best
diagnosis.

That is, the ﬁnal diagnosis is made based on a

majority vote, where each doctor gets an equal
vote.
Bagging
● Bootstrap aggregating
● Bootstrap is a sampling technique- random sampling with
replacement

● Given a set, D, of d tuples, bagging works as follows. For iteration i (i = 1, 2,

. . . , k), a training set, Di , of d tuples is sampled with replacement from
the original set of tuples, D.
● In bagging, each training set is a bootstrap sample
Bagging
● Because sampling with replacement is used, some of the original tuples of
D may not be included in Di , whereas others may occur more than once.
● A classifier model, Mi , is learned for each training set, Di .
● To classify an unknown tuple, X, each classifier, Mi , returns its class
prediction, which counts as one vote.
● The bagged classifier, M∗, counts the votes and assigns the class with the
most votes to X.
Bagging
Bagging
Random Forest
● Another ensemble method
● The "forest" it builds, is an ensemble of decision trees, usually trained
with the “bagging” method.
● The general idea of the bagging method is that a combination of learning
models increases the overall result.
● Random forest builds multiple decision trees and merges them
together to get a more accurate and stable prediction.

Decision Trees are the building blocks for Random Forest algorithm
Decision Tree: Recap
● How to classify 1s and 0s
○ Color
○ Underline
● What feature will allow me to split
the observations at hand in a way
that the resulting groups are as
diﬀerent from each other as
possible (and the members of each
resulting subgroup are as similar to
each other as possible)?
Random Forest
● Consists of a large number of
individual decision trees that operate
as an ensemble.
● Each individual tree in the random
forest spits out a class prediction and
the class with the most votes
becomes our model’s prediction

https://towardsdatascience.com/understanding-random-forest-58381e0602d2
Random Forest

https://www.section.io/engineering-education/introduction-to-random-forest-in-machine-learning/
Random Forest

https://www.section.io/engineering-education/introduction-to-random-forest-in-machine-learning/
Random Forest
● Imagine that each of the classifiers in the ensemble is a decision tree
classifier so that the collection of classifiers is a “forest”.
● The individual decision trees are generated using a random selection of
attributes at each node to determine the split
● Each tree depends on the values of a random vector sampled
independently and with the same distribution for all trees in the forest.
● During classification, each tree votes and the most popular class is
returned.
Random Forest -working
● There are two stages in Random Forest algorithm
○ Random forest creation
○ Make a prediction from the random forest classifier
● Decisions trees are very sensitive to the data they are trained on —
small changes to the training set can result in significantly different tree
structures.
● Random forest takes advantage of this by allowing each individual tree to
randomly sample from the dataset with replacement
○ Bagging
Random Forest -working
Random Forest creation

1. Randomly select “K” features from total “m” features where k << m
2. Among the “K” features, calculate the node “d” using the best split
point
3. Split the node into daughter nodes using the best split
4. Repeat the 1-3 steps until “l” number of nodes has been reached
5. Build forest by repeating steps 1-4 for “n” number times to create “n”
number of trees
Random Forest -working
Random Forest prediction

1. Takes the test features and use the rules of each randomly created
decision tree to predict the outcome and stores the predicted outcome
(target)
2. Calculate the votes for each predicted target
3. Consider the high voted predicted target as the final prediction from the
random forest algorithm
Random Forest- Advantages
1. For applications in classification problems, Random Forest algorithm will
avoid the overfitting problem
2. For both classification and regression task, the same random forest
algorithm can be used
3. The Random Forest algorithm can be used for identifying the most
important features from the training dataset, in other words, feature
engineering.
Random Forest
Python example
Boosting
● Ensemble learning technique

Assume this situation

● If a data point is incorrectly predicted by the ﬁrst model, and then the
next (probably all models), will combining the predictions provide better
results?
● Such situations are taken care of by boosting.
● Combine weak learners into a strong learner.
Boosting- intuition
Suppose that as a patient, you have
certain symptoms. Instead of
consulting one doctor, you choose to
consult several. Suppose you assign
weights to the value or worth of each
doctor’s diagnosis, based on the
accuracies of previous diagnoses they
have made. The ﬁnal diagnosis is then
a combination of the weighted
diagnoses.
Boosting
● Boosting is a sequential process, where each subsequent model
attempts to correct the errors of the previous model.
●

https://medium.com/swlh/boosting-and-bagging-explained-with-examples-5353a36eb78d
Boosting
● Train model A on the whole set
● Train the model B with exaggerated data on the regions in which A
performs poorly (i.e “pay more attention” to the training tuples that
were misclassified)
● The final boosted classifier, M∗, combines the votes of each individual
classifier
Boosting
● Train model A on the whole set
● Train the model B with exaggerated data on the regions in which A
performs poorly (i.e “pay more attention” to the training tuples that
were misclassified) Weak classifier
● The final boosted classifier, M∗, combines the votes of each individual
classifier
Strong classifier
Adaboost
Adaptive Boosting

1. We are given D, a data set of d class-labeled tuples, (X1 , y1 ), (X2 , y2 ), . . .

, (Xd , yd )
2. Initialise the dataset and assign equal weight (of 1/d) to each of the data
point
3. Provide this as input to the model and identify the wrongly classiﬁed data
points
4. Increase the weight of the wrongly classiﬁed data points.
5. Repeat steps 3-4 until got required results
Adaboost
B1 - We have 10 points 5 + and 5 -

Each one has been assigned equal weight

initially.

The ﬁrst model tries to classify the data

points and generates a vertical separator
line but it wrongly classiﬁes 3 plus(+) as
minus(-).
Adaboost
B2 consists of the 10 data points from the
previous model in which the 3 wrongly
classiﬁed plus(+) are weighted more so that
the current model tries more to classify
these pluses(+) correctly.

This model generates a vertical separator

line which correctly classifies the previously
wrongly classified pluses(+) but in this
attempt, it wrongly classifies two
minuses(-).
Adaboost
B3 consists of the 10 data points from the
previous model in which the 3 wrongly
classified minus(-) are weighted more so
that the current model tries more to
classify these minuses(-) correctly.

This model generates a horizontal

separator line which correctly classiﬁes the
previously wrongly classiﬁed minuses(-).
Adaboost
B4 combines together B1, B2 and B3 in
order to build a strong prediction model
which is much better than any individual
model used.

(Example from Geeksforgeeks)

Adaboost - how to ﬁnd weak learners?
To compute the error rate of model Mi
Adaboost
If a tuple in round i was correctly classiﬁed, its weight is multiplied by
error(Mi )/(1 − error(Mi )).

Once the weights of all the correctly classiﬁed tuples are updated, the
weights for all tuples (including the misclassiﬁed ones) are normalized so
that their sum remains the same as it was before.

To normalize a weight, we multiply it by the sum of the old weights, divided

by the sum of the new weights. As a result, the weights of misclassified
tuples are increased and the weights of correctly classified tuples are
decreased
Adaboost- ensemble prediction
Adaboost assigns a weight to each classifier’s vote, based on how well the
classifier performed.

The lower a classiﬁer’s error rate, the more accurate it is, and therefore, the
higher its weight for voting should be.

For each class, c, we sum the weights of each classiﬁer that assigned class c to
X. The class with the highest sum is the “winner” and is returned as the class
prediction for tuple X.

Bagging
No ratings yet
Bagging
7 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
Module 2
No ratings yet
Module 2
34 pages
ML-Unit I - Ensemble Methods
No ratings yet
ML-Unit I - Ensemble Methods
54 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Lecture 6
No ratings yet
Lecture 6
24 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Enseble LEarning
100% (1)
Enseble LEarning
57 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Unit 3
No ratings yet
Unit 3
63 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Eda - M4
No ratings yet
Eda - M4
7 pages
D3 IT Random Forest Apr 2023
No ratings yet
D3 IT Random Forest Apr 2023
32 pages
Unit 3
No ratings yet
Unit 3
59 pages
Lecture 05 Random Forest 07112022 124639pm
No ratings yet
Lecture 05 Random Forest 07112022 124639pm
25 pages
Ensemble Learning Explained
No ratings yet
Ensemble Learning Explained
32 pages
Assessing Predictive Models
No ratings yet
Assessing Predictive Models
25 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
UNIT-3 Material
No ratings yet
UNIT-3 Material
19 pages
Random Forest-Supervised ML
No ratings yet
Random Forest-Supervised ML
45 pages
ML Unit-3 Part-1
No ratings yet
ML Unit-3 Part-1
17 pages
22AIP3101A Session 11
No ratings yet
22AIP3101A Session 11
30 pages
Bagging and Random Forest Presentation1
100% (4)
Bagging and Random Forest Presentation1
23 pages
5 - EnsembleModeling
No ratings yet
5 - EnsembleModeling
80 pages
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
No ratings yet
UNIT-V (Bagging, Boosting, Random Forest) : by Dr. K. Aditya Shastry Associate Professor Dept. of ISE NMIT, Bengaluru
27 pages
Module 5,1 Ensemble - Bagging, RF, Boosting
No ratings yet
Module 5,1 Ensemble - Bagging, RF, Boosting
66 pages
Random Forest
No ratings yet
Random Forest
10 pages
Ensemble Learning
No ratings yet
Ensemble Learning
13 pages
Data Mining - Ensemble Methods
No ratings yet
Data Mining - Ensemble Methods
12 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
Ensembles
No ratings yet
Ensembles
9 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
ML Unit-3
No ratings yet
ML Unit-3
15 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Unit 3
No ratings yet
Unit 3
99 pages
Ensemble Learning-1
No ratings yet
Ensemble Learning-1
61 pages
Ensemble Methods
No ratings yet
Ensemble Methods
19 pages
Unit 2
No ratings yet
Unit 2
13 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
100% (1)
Outlines: Statements of Problems Objectives Bagging Random Forest Boosting Adaboost
14 pages
Ensemble Method
No ratings yet
Ensemble Method
8 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
Module 7 Notes
No ratings yet
Module 7 Notes
3 pages
Week 11
No ratings yet
Week 11
16 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
ML Unit 3 (DS)
No ratings yet
ML Unit 3 (DS)
31 pages
What Is Ensemble Learning
No ratings yet
What Is Ensemble Learning
4 pages
Ensemble Learning
No ratings yet
Ensemble Learning
16 pages
Ensemble Learning
No ratings yet
Ensemble Learning
52 pages
LR Desktop Udo6rlp
No ratings yet
LR Desktop Udo6rlp
4 pages
Ensemble Learning: Wisdom of The Crowd
100% (1)
Ensemble Learning: Wisdom of The Crowd
12 pages
Bagging Vs Boosting in Machine Learning
100% (1)
Bagging Vs Boosting in Machine Learning
4 pages
ML - 5
No ratings yet
ML - 5
53 pages
Data Mining Notes
No ratings yet
Data Mining Notes
5 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Ensemble Learning Methods
100% (1)
Ensemble Learning Methods
24 pages
Introduction
No ratings yet
Introduction
15 pages
Problem Solving Strategy
No ratings yet
Problem Solving Strategy
52 pages
Problem Solving Process
No ratings yet
Problem Solving Process
28 pages
AutoEncoders and GANs
No ratings yet
AutoEncoders and GANs
44 pages
Explainable AI
No ratings yet
Explainable AI
41 pages
Project Equipment List - Nizwa Tech
No ratings yet
Project Equipment List - Nizwa Tech
1 page
IEEE Standard For Qualification of Class 1E Static Battery Chargers and Inverters For Nuclear Power Generating Stations
No ratings yet
IEEE Standard For Qualification of Class 1E Static Battery Chargers and Inverters For Nuclear Power Generating Stations
44 pages
Sterling International Consulting FZEinitiates ISO 22000consulting Projects With Afghan Soybean Factory, Afghanistan
No ratings yet
Sterling International Consulting FZEinitiates ISO 22000consulting Projects With Afghan Soybean Factory, Afghanistan
2 pages
Album Cover Design Tutorial
No ratings yet
Album Cover Design Tutorial
42 pages
GAZELLE G9202, G9203 600A True RMS Digital Clamp Meter
No ratings yet
GAZELLE G9202, G9203 600A True RMS Digital Clamp Meter
1 page
Draw The General View of Telecommunication and Explain The Function of The Each Unit?
No ratings yet
Draw The General View of Telecommunication and Explain The Function of The Each Unit?
22 pages
Advanced Web Designing
No ratings yet
Advanced Web Designing
11 pages
Duty Statements Office Technician Typing PDF
No ratings yet
Duty Statements Office Technician Typing PDF
3 pages
Eet 228 Lab 8 Fire Alarm Installation and Wiring
No ratings yet
Eet 228 Lab 8 Fire Alarm Installation and Wiring
14 pages
Dot-Hack CCG - Demo Sample Deck B
No ratings yet
Dot-Hack CCG - Demo Sample Deck B
3 pages
Engineering Homework Help Online
100% (1)
Engineering Homework Help Online
5 pages
AgilePath Corporation Cloud Computing
No ratings yet
AgilePath Corporation Cloud Computing
8 pages
40N03GP - N Channel Pwer MOSFET, 30v 40A, Vgs (TH) 3v, Vgs 20v
No ratings yet
40N03GP - N Channel Pwer MOSFET, 30v 40A, Vgs (TH) 3v, Vgs 20v
4 pages
Chap 5
No ratings yet
Chap 5
111 pages
Fast 3102 1108 GB
No ratings yet
Fast 3102 1108 GB
2 pages
Untitled - 0286 PDF
No ratings yet
Untitled - 0286 PDF
1 page
SDK FS2004
No ratings yet
SDK FS2004
68 pages
Highest Rated Gold Goalkeepers FIFA 14 Career Mode Players - FUTWIZ
No ratings yet
Highest Rated Gold Goalkeepers FIFA 14 Career Mode Players - FUTWIZ
3 pages
Fuel Injection System
100% (1)
Fuel Injection System
49 pages
OpenText Media Management 16.3 - Administration Guide English (MEDMGT160300-AGD-EN-02) PDF
No ratings yet
OpenText Media Management 16.3 - Administration Guide English (MEDMGT160300-AGD-EN-02) PDF
306 pages
Asd PPT
No ratings yet
Asd PPT
36 pages
Passive Voice
No ratings yet
Passive Voice
2 pages
Excel Data Visualization Essentials
100% (3)
Excel Data Visualization Essentials
24 pages
Ashish Sharma: Cluster Innovation Centre
No ratings yet
Ashish Sharma: Cluster Innovation Centre
1 page
Free Goods Handling in SAP MM
No ratings yet
Free Goods Handling in SAP MM
5 pages
CN CS203 Lab Manual
No ratings yet
CN CS203 Lab Manual
36 pages
Ntep06 019 A3
No ratings yet
Ntep06 019 A3
5 pages
FASER: Binary Code Similarity Search Through The Use of Intermediate Representations
No ratings yet
FASER: Binary Code Similarity Search Through The Use of Intermediate Representations
12 pages
Meaning and Classification of Business Reports
No ratings yet
Meaning and Classification of Business Reports
6 pages
Computer Programming
50% (2)
Computer Programming
25 pages
Systec Mediaprep 20 30 1
No ratings yet
Systec Mediaprep 20 30 1
1 page
Quantus Fluorometer Operating Manual TM396
No ratings yet
Quantus Fluorometer Operating Manual TM396
17 pages

Bagging and Boosting

Uploaded by

Bagging and Boosting

Uploaded by

Ensemble Learning

Bagging and Boosting

● Diﬀerent algorithms may produce diﬀerent solution because they impose

● No single algorithm is optimal

● Combine multiple partitions of given data into a single partition of

In many real-world data domains, however, the

Ensemble learning is a solution for improving

Ensemble models decrease the variance of a single

● A given data set, D, is used to create k training sets, D1 , D2 , . . . , Dk ,

Boosting: It is also a homogeneous weak learners’ model but works

That is, the ﬁnal diagnosis is made based on a

● Given a set, D, of d tuples, bagging works as follows. For iteration i (i = 1, 2,

Assume this situation

1. We are given D, a data set of d class-labeled tuples, (X1 , y1 ), (X2 , y2 ), . . .

Each one has been assigned equal weight

The ﬁrst model tries to classify the data

This model generates a vertical separator

This model generates a horizontal

(Example from Geeksforgeeks)

To normalize a weight, we multiply it by the sum of the old weights, divided

You might also like