ML Exp 9

The document discusses ensemble learning, focusing on the AdaBoost algorithm and its various types, including bagging and boosting. It explains how ensemble methods combine multiple models to improve predictive performance, particularly in complex datasets. The document also details the workings of AdaBoost, highlighting its iterative training process and effectiveness in classification tasks.

Uploaded by

vedantkulkarni05910

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views3 pages

ML Exp 9

Uploaded by

vedantkulkarni05910

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Machine Learning TECSE

Experiment No-9

Title: Ensemble learning-Adaboost Algorithm

Aim: To understand ensemble learning algorithm and its different types.

Theory:
Ensemble learning is a powerful technique in machine learning where multiple models are
combined to improve the overall performance of the system. The basic idea behind ensemble
learning is that by combining multiple models, each capturing different aspects of the data or
making different kinds of errors, the ensemble can make more accurate predictions than any
individual model.
There are several approaches to ensemble learning, including:
Voting: Different models make predictions, and the final prediction is determined by a majority
vote (for classification tasks) or averaging (for regression tasks).
Bagging (Bootstrap Aggregating): Multiple copies of the same model are trained on different
subsets of the training data (with replacement), and their predictions are averaged. Random
Forests are a popular example of this approach.
Boosting: Models are trained sequentially, with each new model focusing on the examples that
the previous models found difficult. Examples include AdaBoost and Gradient Boosting
Machines (GBM).
Stacking (Stacked Generalization): In stacking, the predictions of multiple models are used as
input features for a meta-model, which then makes the final prediction. This meta-model is often
a simple linear model, but it can also be more complex.
Ensemble methods are widely used in practice because they often result in better predictive
performance compared to individual models, especially when the individual models are diverse
and make different kinds of errors. They are particularly useful when dealing with complex,
high-dimensional datasets or when the underlying relationships in the data are not well
understood.

Bagging:
Bagging, short for Bootstrap Aggregating, is a popular ensemble learning technique in machine
learning. It aims to improve the stability and accuracy of machine learning algorithms,
particularly decision trees and their variations.
Here's how bagging works:

1
Machine Learning TECSE

Bootstrap Sampling: Bagging starts by creating multiple bootstrap samples from the original
dataset. Bootstrap sampling involves randomly selecting data points from the dataset with
replacement, meaning that the same data point can be selected multiple times or not at all.
Model Training: For each bootstrap sample, a base learner (usually a decision tree) is trained on
that sample. Because each bootstrap sample is different, each base learner is trained on a slightly
different subset of the original data.
Voting or Averaging: Once all base learners are trained, bagging combines their predictions
using a voting (for classification problems) or averaging (for regression problems) mechanism.
This aggregation helps to reduce variance and improve the overall performance of the model.
The key idea behind bagging is that by training multiple models on different subsets of the data
and combining their predictions, it reduces overfitting and variance in the final model. This often
leads to improved generalization performance, especially when dealing with complex datasets or
noisy data. Random Forest is one of the most well-known algorithms that uses the bagging
technique.

Boosting:
Boosting is a machine learning ensemble technique that combines multiple weak learners to
create a strong learner. Weak learners are models that perform slightly better than random
guessing, such as decision trees with only a few nodes. Boosting works by training a series of
weak learners sequentially, with each one focusing on the instances that the previous learners
struggled with. In essence, it pays more attention to the mistakes of earlier models, hence
"boosting" their performance.
The most popular boosting algorithm is AdaBoost (Adaptive Boosting), which assigns weights to
each training instance and adjusts them at each iteration to focus on the harder-to-classify
instances. Gradient Boosting Machines (GBMs) like XGBoost, LightGBM, and CatBoost are
also widely used and have become the go-to methods for many machine learning competitions
and real-world applications due to their exceptional performance and flexibility.
Boosting algorithms are particularly effective for tasks such as classification and regression.
They often outperform individual models and other ensemble methods like bagging (e.g.,
Random Forests) when applied correctly. However, they can be sensitive to noisy data and
outliers, and they may require careful tuning of hyperparameters to achieve optimal performance.

AdaBoost:
AdaBoost, short for Adaptive Boosting, is a popular ensemble learning algorithm used in
machine learning, specifically for classification tasks. It was proposed by Yoav Freund and
Robert Schapire in 1996. The main idea behind AdaBoost is to combine multiple weak learners
(classifiers that perform slightly better than random guessing) to create a strong classifier.

2
Machine Learning TECSE

Here's a brief overview of how AdaBoost works:

Initialization: Each training instance is assigned an equal weight initially.
Iterative Training: AdaBoost iteratively trains a series of weak classifiers on the training data. At
each iteration:
A weak learner (e.g., decision tree, perceptron) is trained on the dataset, focusing more on the
instances that were misclassified in the previous iteration.
After training, the weak learner's performance (accuracy) on the training set is evaluated.
The contribution of the weak learner to the final ensemble is calculated based on its performance.
Higher accuracy leads to higher contribution.
Weight Update: After each iteration, the weights of misclassified instances are adjusted.
Misclassified instances are given higher weights to ensure they receive more attention in the next
iteration. This allows AdaBoost to focus on the difficult instances that were not correctly
classified in previous rounds.
Ensemble Construction: The final strong classifier is constructed by combining the weak
classifiers, giving more weight to those with higher accuracy.
Prediction: To make predictions on new data, AdaBoost combines the predictions of all weak
classifiers using a weighted majority vote or a weighted sum.
AdaBoost is particularly effective when used with weak learners that have a slight edge over
random guessing, such as shallow decision trees or perceptrons. Despite its age, AdaBoost
remains a powerful and widely used algorithm in machine learning, especially in scenarios where
interpretability and performance are both important.

Conclusion: In this experiment we have learnt different ensemble learning algorithms with
their applications.

Bagging+Boosting+Gradient Boosting
100% (1)
Bagging+Boosting+Gradient Boosting
48 pages
Module 4 ML
No ratings yet
Module 4 ML
33 pages
LR Desktop Udo6rlp
No ratings yet
LR Desktop Udo6rlp
4 pages
Ch-4 Ensemble Learning
No ratings yet
Ch-4 Ensemble Learning
18 pages
Ensemble Techniques Presentation
No ratings yet
Ensemble Techniques Presentation
17 pages
Ensemble Methods
No ratings yet
Ensemble Methods
31 pages
Unit-3 ML
No ratings yet
Unit-3 ML
18 pages
UNIT III Word File
No ratings yet
UNIT III Word File
13 pages
ML Unit 3-1
No ratings yet
ML Unit 3-1
14 pages
Unit V - Multiple Learners
No ratings yet
Unit V - Multiple Learners
54 pages
Ensemble
No ratings yet
Ensemble
33 pages
Ensemble Machine Learning Approach
No ratings yet
Ensemble Machine Learning Approach
13 pages
Bagging
No ratings yet
Bagging
7 pages
Boosted Trees
No ratings yet
Boosted Trees
66 pages
Chapter Five
No ratings yet
Chapter Five
42 pages
Ai Model
No ratings yet
Ai Model
79 pages
ML Chapter 3
No ratings yet
ML Chapter 3
25 pages
Bagging Vs Boosting in Machine Learning
100% (1)
Bagging Vs Boosting in Machine Learning
5 pages
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
No ratings yet
1.1 - Xgboost, GBboost, Adaboost - Boosting - Medium
6 pages
Ensemble Learning in Machine Learning
No ratings yet
Ensemble Learning in Machine Learning
9 pages
Module 7 - Ensemble Learning
No ratings yet
Module 7 - Ensemble Learning
41 pages
Ensemble Classifiers Overview
No ratings yet
Ensemble Classifiers Overview
37 pages
Ens Embling
No ratings yet
Ens Embling
8 pages
CSE 445 - Lecture 7 - Ensemble Learning
No ratings yet
CSE 445 - Lecture 7 - Ensemble Learning
17 pages
AI25
No ratings yet
AI25
7 pages
Ensemble - Part 1
No ratings yet
Ensemble - Part 1
33 pages
Ensemble Classifiers
100% (1)
Ensemble Classifiers
37 pages
Unit Iv
No ratings yet
Unit Iv
14 pages
Ensemble Learning
No ratings yet
Ensemble Learning
13 pages
Class Adv Classification V
No ratings yet
Class Adv Classification V
50 pages
Adaboost
No ratings yet
Adaboost
22 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
Lesson 8 - Ensemble Learning
No ratings yet
Lesson 8 - Ensemble Learning
61 pages
ML Mod 5.1
No ratings yet
ML Mod 5.1
18 pages
Voting or Averaging of Predictions of Multiple Pre-Trained Models
No ratings yet
Voting or Averaging of Predictions of Multiple Pre-Trained Models
23 pages
16-Ensemble Learning - Cont... - 12-04-2024
No ratings yet
16-Ensemble Learning - Cont... - 12-04-2024
13 pages
Boosting
No ratings yet
Boosting
2 pages
Unit 3 Aml
No ratings yet
Unit 3 Aml
9 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
8 pages
12 Ensemble Model
No ratings yet
12 Ensemble Model
90 pages
Ensemble, Voting, Bagging, Boosting
No ratings yet
Ensemble, Voting, Bagging, Boosting
15 pages
UNIT1
No ratings yet
UNIT1
80 pages
ML-Unit I - Ensemble Methods
No ratings yet
ML-Unit I - Ensemble Methods
54 pages
Data Mining - Ensemble Methods
No ratings yet
Data Mining - Ensemble Methods
12 pages
LECTURE+NOTES Boosting
No ratings yet
LECTURE+NOTES Boosting
8 pages
Ensemble Learning (Autosaved)
No ratings yet
Ensemble Learning (Autosaved)
31 pages
Random Forest-Supervised ML
No ratings yet
Random Forest-Supervised ML
45 pages
Lecture 2
No ratings yet
Lecture 2
35 pages
Module 7 Notes
No ratings yet
Module 7 Notes
3 pages
What Is Ensemble Learning
No ratings yet
What Is Ensemble Learning
4 pages
Ensembling Techniques
No ratings yet
Ensembling Techniques
11 pages
Aiml Unit 4
No ratings yet
Aiml Unit 4
26 pages
Bagging and Boosting and Stacking
No ratings yet
Bagging and Boosting and Stacking
5 pages
Ensemble Learning
No ratings yet
Ensemble Learning
35 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
Bagging and Boosting
No ratings yet
Bagging and Boosting
4 pages
Hanwha WN7 Camera Cybersecurity Certificate
No ratings yet
Hanwha WN7 Camera Cybersecurity Certificate
3 pages
First Hello World Program in JavaScript
No ratings yet
First Hello World Program in JavaScript
2 pages
AI Based Health Monitoring System
No ratings yet
AI Based Health Monitoring System
2 pages
Buy Games, Gift Cards & Top Ups Cheaper ENEBA
No ratings yet
Buy Games, Gift Cards & Top Ups Cheaper ENEBA
1 page
10 Best Email Extractor For Lead Generation 5cqxph PDF
No ratings yet
10 Best Email Extractor For Lead Generation 5cqxph PDF
12 pages
Ens Lab Manual: Sir Bhavsinhji Polytechnic Institute Bhavnagar
No ratings yet
Ens Lab Manual: Sir Bhavsinhji Polytechnic Institute Bhavnagar
51 pages
APC AP8853 Metered Rack PDU Data Sheet
No ratings yet
APC AP8853 Metered Rack PDU Data Sheet
2 pages
SM-A530F SVC Guide - F PDF
No ratings yet
SM-A530F SVC Guide - F PDF
30 pages
Titan 2000: Advanced Digital Radiography System
100% (1)
Titan 2000: Advanced Digital Radiography System
16 pages
A 55nm Ultra High Density Two-Port Register File Compiler With Improved Write Replica Technique
No ratings yet
A 55nm Ultra High Density Two-Port Register File Compiler With Improved Write Replica Technique
4 pages
Government of Andhra Pradesh: Municipal Administration & Urban Development (M) Department
No ratings yet
Government of Andhra Pradesh: Municipal Administration & Urban Development (M) Department
5 pages
Top 45 Machine Learning Interview Questions in 2025
100% (1)
Top 45 Machine Learning Interview Questions in 2025
37 pages
Task Sheet 1.3-3 Title: Performance Objective
No ratings yet
Task Sheet 1.3-3 Title: Performance Objective
2 pages
AI & Expert System Lab Manual
No ratings yet
AI & Expert System Lab Manual
23 pages
5.4 Contents of Good Resume
No ratings yet
5.4 Contents of Good Resume
2 pages
Student Marks Management System
No ratings yet
Student Marks Management System
16 pages
Research Paper B
No ratings yet
Research Paper B
44 pages
Python MySQL Fashion Store Management
No ratings yet
Python MySQL Fashion Store Management
17 pages
Kishan Baranwal Resume
No ratings yet
Kishan Baranwal Resume
1 page
Computer System-SET B EXAM
No ratings yet
Computer System-SET B EXAM
5 pages
Picanol OptiMax: Versatile High-Speed Weaving
No ratings yet
Picanol OptiMax: Versatile High-Speed Weaving
6 pages
Dasgip CC
No ratings yet
Dasgip CC
61 pages
The Dysphonia Severity Index
No ratings yet
The Dysphonia Severity Index
14 pages
DAMBI DOLLO UNIVERSITY PPT I-1
No ratings yet
DAMBI DOLLO UNIVERSITY PPT I-1
25 pages
Service Management in Linux
No ratings yet
Service Management in Linux
21 pages
Problem 2 Businessreport ML
No ratings yet
Problem 2 Businessreport ML
9 pages
Scripts
No ratings yet
Scripts
2 pages
12 STD Homework
No ratings yet
12 STD Homework
2 pages
Dow 330
No ratings yet
Dow 330
3 pages
Quadrant Technologies - Introduction
No ratings yet
Quadrant Technologies - Introduction
1 page