0% found this document useful (0 votes)

15 views11 pages

ML Ca1

_ml_ca1

Uploaded by

soumyaadeepdas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views11 pages

ML Ca1

_ml_ca1

Uploaded by

soumyaadeepdas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

B. P.

Poddar Institute of Management and Technology

Machine Learning (PEC-CS701E) CA1-PPT

TOPIC: Ensemble Methods: Combining Multiple Models for Better Predictions[SUPERVISED

LEARNING]

SUBTOPICS: A Deep Dive into Bagging, Boosting, and Stacking

• Soumyadeep Das (11500122094)

• Akashdeep Naha (11500122070)

• Angshook Banerjee (11500122068)

preencoded.png

PAGE 1
Introduction to Ensemble Learning
Good morning! Today, we delve into the fascinating world of Ensemble Learning, a powerful technique in machine learning that combines
multiple models to achieve superior predictive performance compared to any single model. This approach is crucial for building robust and
accurate predictive systems across various domains.

1 The Power of Many 2 Key Ensemble Techniques 3 Presentation Roadmap

Explore why combining models Focus on three primary methods: Outline the structure of our
enhances overall prediction accuracy Bagging, Boosting, and Stacking. discussion, ensuring clarity and
and generalisation. coherence.

preencoded.png

PAGE2
Bagging: Bootstrap Aggregating
Let's begin with Bagging, short for Bootstrap Aggregating. This method involves training multiple instances of the same learning algorithm
on different random subsets of the training data, then combining their predictions, typically through averaging for regression or voting for
classification.

Random Subsets Notable Example Key Advantage

Models are trained on bootstrapped Random Forest is a prominent Bagging primarily reduces variance in
subsets of the original dataset, which example of Bagging, where multiple predictions, making models less prone
means sampling with replacement. decision trees are built and their results to overfitting, especially with complex
This introduces diversity among the are aggregated. base learners like decision trees.
models.

preencoded.png

PAGE3
Bagging Process Visualised
This diagram provides a clear illustration of how Bagging operates. Observe how
the original dataset is resampled to create multiple bootstrapped subsets, each
used to train an independent model.

The final prediction is derived by either averaging the outputs of individual

models (for regression tasks) or through a majority vote (for classification tasks),
leading to a more stable and accurate result.

preencoded.png

PAGE4
Boosting: Sequential Improvement
Next, we move to Boosting, a sequential ensemble technique that constructs models in a stepwise fashion. Each new model is specifically
designed to correct the errors made by the previously trained models, iteratively improving the overall performance.

1 2 3

Sequential Training Prominent Algorithms Primary Advantage

Models are built one after another, with Key algorithms include AdaBoost, Boosting primarily works to reduce bias,
each subsequent model focusing on the Gradient Boosting, and the highly making models more accurate by
data points that were misclassified or optimised XGBoost, all known for their focusing on difficult examples and
poorly predicted by the preceding strong predictive power. adapting to complex patterns in the
models. data.

preencoded.png

PAGE5
Boosting Process Visualised
This diagram visually explains the Boosting process. Notice how each model in
the sequence is built to give more weight to the data points that previous models
struggled with, leading to a refined and more accurate overall predictor.

This iterative correction mechanism allows Boosting algorithms to achieve high

accuracy, particularly in situations where initial models might have high bias.

preencoded.png

PAGE6
Bagging vs. Boosting: A Comparison
Understanding the nuances between Bagging and Boosting is crucial for effective model selection. While both are powerful ensemble
techniques, they address different aspects of model error (variance and bias, respectively).

Bagging: Parallel Training Boosting: Sequential Refinement

• Reduces variance. • Reduces bias.

• Models are trained in parallel on independent subsets. • Models are trained sequentially, each correcting the prior one's
• Ideal when base models are prone to overfitting (e.g., complex errors.
decision trees). • Useful for reducing bias and improving overall model
performance on challenging datasets.

Both techniques are invaluable, but their optimal application depends on the specific characteristics of your data and the type of errors
you aim to minimise.

preencoded.png

PAGE7
Stacking: The Ensemble of Ensembles
Finally, we explore Stacking, also known as Stacked Generalisation. This advanced ensemble method takes the predictions of multiple
diverse base models and uses a separate meta-model (or "learner") to learn how to optimally combine these predictions for the final
output.

Base Models Meta-Model Synergistic Advantage

Multiple different learning algorithms A higher-level model is trained on Stacking leverages the unique
(e.g., decision trees, support vector the outputs (predictions) of the strengths of various models, often
machines, neural networks) are base models. This meta-model resulting in performance that
trained on the same original dataset. learns the strengths and surpasses any single base model or
weaknesses of each base model. even simpler ensemble techniques
like Bagging or Boosting.

preencoded.png

PAGE8
Stacking Process Visualised
This diagram illustrates the sophisticated workflow of Stacking. You can see how
raw data is fed into multiple diverse base models, whose predictions then form a
new dataset for the meta-model to learn from, ultimately generating the final
prediction.

This hierarchical approach allows Stacking to capture complex non-linear

relationships between the base models' predictions and the true labels, leading
to highly accurate and robust models.

preencoded.png

PAGE9
A Case Study: Predicting
Customer Churn
Ensemble methods are incredibly powerful in real-world applications. Take
customer churn prediction in telecommunications. Identifying customers at risk
allows proactive intervention, significantly impacting revenue.

The Challenge Ensemble Solution

Traditional models often struggle Random Forest (Bagging) or
with complex, non-linear Gradient Boosting (Boosting) can
patterns and imbalanced churn dramatically improve accuracy
data, leading to suboptimal by combining weaker learners.
accuracy.

Key Benefits
Ensemble models provide robust, accurate predictions, enabling precise
targeting of at-risk customers and better understanding of churn drivers.

preencoded.png

PAGE10
Conclusion & Key Takeaways
In summary, Bagging, Boosting, and Stacking represent powerful strategies for enhancing machine learning model performance by
cleverly combining multiple individual learners.

Bagging: Variance Reduction Boosting: Bias Correction Stacking: Synergistic

Combination
Minimises bias through sequential
Effective in mitigating overfitting by training, where each model focuses Leverages the unique strengths of
averaging predictions from models on correcting the errors of its various models via a meta-model,
trained on diverse data subsets. predecessors. often yielding superior predictive
accuracy.

References
• Breiman, L. (1996). "Bagging Predictors." Machine Learning, 24(2), 123-140.
• Freund, Y., & Schapire, R. (1997). "A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting." Journal of
Computer and System Sciences, 55(1), 119-139.
• Zhou, Z.-H. (2012). Ensemble Methods: Foundations and Algorithms. CRC Press.

preencoded.png

PAGE11

Ensembling Techniques
No ratings yet
Ensembling Techniques
11 pages
AI25
No ratings yet
AI25
7 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
Ensemble Learning
No ratings yet
Ensemble Learning
26 pages
Lecture 5
No ratings yet
Lecture 5
11 pages
Ensemble Learning
No ratings yet
Ensemble Learning
13 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Model Ensembles
No ratings yet
Model Ensembles
26 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
Bagging and Boosting and Stacking
No ratings yet
Bagging and Boosting and Stacking
5 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
Ensemble Learning UNIT 3
No ratings yet
Ensemble Learning UNIT 3
7 pages
Ensemble Methods
No ratings yet
Ensemble Methods
3 pages
Bagging Vs Boosting in Machine Learning
100% (1)
Bagging Vs Boosting in Machine Learning
5 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
4 pages
Unit 5 ML
No ratings yet
Unit 5 ML
14 pages
UMl - Unit 3
No ratings yet
UMl - Unit 3
50 pages
E4fbc2f-C755-Ed1a-C18-F18ec25eb0d Ensemble Learning Bagging Boosting and Stacking
No ratings yet
E4fbc2f-C755-Ed1a-C18-F18ec25eb0d Ensemble Learning Bagging Boosting and Stacking
6 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Ensemble Techniques Presentation
No ratings yet
Ensemble Techniques Presentation
17 pages
ML Unit 3 V2
No ratings yet
ML Unit 3 V2
47 pages
Ensemble Methods (Final)
No ratings yet
Ensemble Methods (Final)
16 pages
Group9 ABA Ensemble Model
No ratings yet
Group9 ABA Ensemble Model
5 pages
Chapter 3 Ensemble Learning
No ratings yet
Chapter 3 Ensemble Learning
37 pages
LR Desktop Udo6rlp
No ratings yet
LR Desktop Udo6rlp
4 pages
Unit 4
No ratings yet
Unit 4
24 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Ensemble Methods Unit - 4
No ratings yet
Ensemble Methods Unit - 4
17 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Ensemble Final
No ratings yet
Ensemble Final
41 pages
Unit 3-Ensemble Techniques
No ratings yet
Unit 3-Ensemble Techniques
47 pages
Bagging Vs Boosting - Javatpoint
No ratings yet
Bagging Vs Boosting - Javatpoint
8 pages
Bagging
No ratings yet
Bagging
7 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
Technical Report
No ratings yet
Technical Report
10 pages
Ensemble Methods
No ratings yet
Ensemble Methods
19 pages
Ensemble Techniques
No ratings yet
Ensemble Techniques
9 pages
Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Ensemble Learning (Autosaved)
No ratings yet
Ensemble Learning (Autosaved)
31 pages
Ensemble Learning
No ratings yet
Ensemble Learning
20 pages
CSE 445 - Lecture 7 - Ensemble Learning
No ratings yet
CSE 445 - Lecture 7 - Ensemble Learning
17 pages
UNIT3 Class
No ratings yet
UNIT3 Class
30 pages
Unit 4 ML
No ratings yet
Unit 4 ML
25 pages
What Is Bagging in Machine Learning and How To Perform Bagging
No ratings yet
What Is Bagging in Machine Learning and How To Perform Bagging
9 pages
ML Lecture 15 Ensemble
No ratings yet
ML Lecture 15 Ensemble
27 pages
Bagging - Boosting
No ratings yet
Bagging - Boosting
8 pages
An Introduction of Ensemble Learning
100% (1)
An Introduction of Ensemble Learning
40 pages
Ensemble Methods Send
No ratings yet
Ensemble Methods Send
20 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
Week 11
No ratings yet
Week 11
16 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
26 pages
Article Review 9 Eng
No ratings yet
Article Review 9 Eng
21 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
8 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
4 pages
EMV CPS v1.1
67% (3)
EMV CPS v1.1
104 pages
CSE3003-Operating Systems-Laboratory-Practical Exercises
No ratings yet
CSE3003-Operating Systems-Laboratory-Practical Exercises
9 pages
Delay Prediction
No ratings yet
Delay Prediction
37 pages
Signal Processing for Engineers
No ratings yet
Signal Processing for Engineers
3 pages
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
No ratings yet
Mercity - Ai-Guide To Fine-Tuning LLMs Using PEFT and LoRa Techniques
25 pages
Posadas 07 Practice Problem 1
No ratings yet
Posadas 07 Practice Problem 1
27 pages
(Ebook PDF) Time Series: A Data Analysis Approach Using R Download PDF
100% (7)
(Ebook PDF) Time Series: A Data Analysis Approach Using R Download PDF
42 pages
StatQuest Statistics
100% (5)
StatQuest Statistics
149 pages
Retinal Lesion Detection With Deep Learning Using Image Patches
No ratings yet
Retinal Lesion Detection With Deep Learning Using Image Patches
7 pages
Mortality Laws
100% (1)
Mortality Laws
6 pages
Smart Antenna Algorithm Analysis
No ratings yet
Smart Antenna Algorithm Analysis
3 pages
Metric Unit Conversions Liters To Kiloliters Hectoliters Decaliters 1 v1
No ratings yet
Metric Unit Conversions Liters To Kiloliters Hectoliters Decaliters 1 v1
2 pages
Coding Theory Problem Set
No ratings yet
Coding Theory Problem Set
4 pages
Understanding Human-AI Cooperation Through Game-Theory and Reinforcement Learning Models
No ratings yet
Understanding Human-AI Cooperation Through Game-Theory and Reinforcement Learning Models
11 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
4 pages
Numerical SDE Solutions in Finance
No ratings yet
Numerical SDE Solutions in Finance
22 pages
Autocorr and Linreg Technique For Trend
No ratings yet
Autocorr and Linreg Technique For Trend
5 pages
Understanding Security Protocols
No ratings yet
Understanding Security Protocols
175 pages
Nomor 2
No ratings yet
Nomor 2
9 pages
MCS 224 (2025)
No ratings yet
MCS 224 (2025)
5 pages
Method For Automated Assessment of Potholes, Cracks and Patches From Road Surface Video Clips
No ratings yet
Method For Automated Assessment of Potholes, Cracks and Patches From Road Surface Video Clips
10 pages
Quality-Based Design With Probabilistic Methods - ANSYS Users
No ratings yet
Quality-Based Design With Probabilistic Methods - ANSYS Users
3 pages
CV Syllabus
No ratings yet
CV Syllabus
3 pages
Mathematical Foundation of Computer Science
No ratings yet
Mathematical Foundation of Computer Science
3 pages
Self-Training Multi-Sequence Learning With Transformer
No ratings yet
Self-Training Multi-Sequence Learning With Transformer
9 pages
Lecture 01 Overview
No ratings yet
Lecture 01 Overview
39 pages
Time Series Analysis
No ratings yet
Time Series Analysis
22 pages
An Introduction To Structural Optimization
No ratings yet
An Introduction To Structural Optimization
5 pages
A Hybrid Particle Swarm Optimization To Solve Economic Dispatch With Valve-Point Effect
No ratings yet
A Hybrid Particle Swarm Optimization To Solve Economic Dispatch With Valve-Point Effect
7 pages
Host-Parasitoid Models in Difference Equations
No ratings yet
Host-Parasitoid Models in Difference Equations
12 pages

ML Ca1

Uploaded by

ML Ca1

Uploaded by

B. P.

Poddar Institute of Management and Technology

TOPIC: Ensemble Methods: Combining Multiple Models for Better Predictions[SUPERVISED

SUBTOPICS: A Deep Dive into Bagging, Boosting, and Stacking

• Soumyadeep Das (11500122094)

• Akashdeep Naha (11500122070)

• Angshook Banerjee (11500122068)

1 The Power of Many 2 Key Ensemble Techniques 3 Presentation Roadmap

Random Subsets Notable Example Key Advantage

The final prediction is derived by either averaging the outputs of individual

Sequential Training Prominent Algorithms Primary Advantage

This iterative correction mechanism allows Boosting algorithms to achieve high

Bagging: Parallel Training Boosting: Sequential Refinement

• Reduces variance. • Reduces bias.

Base Models Meta-Model Synergistic Advantage

This hierarchical approach allows Stacking to capture complex non-linear

The Challenge Ensemble Solution

Bagging: Variance Reduction Boosting: Bias Correction Stacking: Synergistic

You might also like