0% found this document useful (0 votes)

15 views10 pages

MLT 9

The document outlines a procedure for simulating a boosting ensemble method using MATLAB on a dataset for medical diagnosis prediction. It details the steps for initializing weak learners, defining loss functions, training on weighted data, and tracking cumulative training error over 1000 iterations. The final output includes a trained ensemble model, accuracy plots, and evaluation metrics such as precision, recall, and F1-score.

Uploaded by

Poojasri Boopathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views10 pages

MLT 9

Uploaded by

Poojasri Boopathi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 10

EX NO:09 Simulate a boosting ensemble method for any

DATE: dataset

AIM:

To Simulate boosting ensemble method for any dataset.

SOFTWARE REQUIRED:

 MATLAB

PROCEDURE

 Initialize weak learners, dataset, prediction array, and learning parameters (η, T, error).
 Define the loss function for different sample-label combinations.
 For each iteration, randomly sample a weighted training subset.
 For each learner, train on the weighted data (focus on harder cas
 Accumulate weighted predictions from all learners.
 Repeat for all iterations and record training error.
 Display final ensemble model and plot accuracy with moving average.

THEORY

Boosting, a supervised ensemble learning technique, is implemented to simulate medical

diagnosis prediction based on patient symptom data. The model operates on a dataset where
each instance includes features and a diagnosis label: Positive or Negative. The ensemble
combines multiple weak classifiers—simple models like decision stumps—trained
sequentially to focus on misclassified cases. An error function evaluates how well each
learner classifies the data, with higher weights assigned to incorrect predictions. Initially, all
samples are assigned equal weights to reflect uniform importance across the dataset. The
boosting process runs for 1000 iterations, where in each iteration, a weak learner is trained
using the current weighted distribution of the data.After training, each learner's error is
calculated, and its weight is computed based on accuracy. Misclassified samples have their
weights increased, encouraging the next learner to focus on them in subsequent rounds.
This process reinforces correct predictions and mitigates individual model
weaknesses.Throughout the iterations, cumulative training error is tracked to evaluate
ensemble improvement. After all rounds, the final ensemble is presented, representing a
strong classifier formed by many weak learners.
A graph is also plotted showing prediction accuracy per iteration and its moving average,
highlighting steady performance enhancement over time.This experiment demonstrates how
ensemble learning enables robust predictive modeling by leveraging the strengths of multiple
weak learners and iteratively refining performance.

PROGRAM

% Read the dataset

data= readtable('/MATLAB
Drive/mlt/network_traffic_data.csv/loan/loan_approval_dataset.csv');
% Remove 'loan_id' column if it exists

if ismember('loan_id',
data.Properties.VariableNames) data.loan_id = [];
end
% Convert 'education' to categorical if it is a cell array
if iscell(data.education)
data.education = categorical(data.education);
end

data.education = categorical(data.education); % Ensure it's categorical

% Create dummy variables for 'education'
eduDummies = dummyvar(data.education);
eduNames = categories(data.education);
% Add dummy variables to the
dataset for i = 1:length(eduNames)
varName = matlab.lang.makeValidName(['Education_' eduNames{i}]);
data.(varName) = eduDummies(:, i);
end
% Remove original 'education'
column data.education = [];
% Convert 'self_employed' to categorical and then to binary (Yes = 1, No =
0) if iscell(data.self_employed)
data.self_employed = categorical(data.self_employed);
end
data.self_employed = double(data.self_employed == 'Yes');
% Create binary label for 'approved' based on 'cibil_score'
data.approved = double(data.cibil_score >= 700);
% Define core features
features = {'no_of_dependents', 'self_employed', 'income_annum', ...
'loan_amount', 'loan_term', 'cibil_score'};
% Append the dummy variable names for education

eduCols = startsWith(data.Properties.VariableNames, 'Education_');

features = [features, data.Properties.VariableNames(eduCols)];

% Extract features (X) and labels
(Y) X = table2array(data(:,
features));
Y = table2array(data(:, 'approved'));
% Normalize the
features X =
normalize(X);
% Split the data into 70% training and 30% testing
cv = cvpartition(Y, 'HoldOut', 0.3);
XTrain = X(training(cv),
:); YTrain =
Y(training(cv)); XTest =
X(test(cv), :); YTest =
Y(test(cv));
% Train the SVM model with RBF kernel
SVMModel = fitcsvm(XTrain, YTrain, ...
'KernelFunction', 'rbf', ...

'Standardize', true, ...

'ClassNames', [0 1]);
% Predict on the test set

YPred = predict(SVMModel, XTest);

% Generate confusion matrix

confMat = confusionmat(YTest,
YPred); TP = confMat(2,2);
FP = confMat(1,2);
FN = confMat(2,1);
TN = confMat(1,1);
% Compute evaluation
metrics precision = TP / (TP +
FP); recall = TP / (TP + FN);
f1 = 2 * (precision * recall) / (precision + recall);
accuracy = (TP + TN) / sum(confMat(:));
% Display results

fprintf('Accuracy: %.2f%%\n', accuracy *

100); fprintf('Precision: %.2f\n', precision);
fprintf('Recall: %.2f\n', recall);
fprintf('F1-Score: %.2f\n', f1);
% Compute scores for ROC curve

[~, scores] = predict(SVMModel, XTest);

[Xroc, Yroc, ~, AUC] = perfcurve(YTest, scores(:,2), 1);

% Plot the ROC
curve figure;
plot(Xroc, Yroc, 'b-', 'LineWidth',
2); xlabel('False Positive Rate');
ylabel('True Positive Rate');
title(['ROC Curve (AUC = ' num2str(AUC, '%.2f')
')']); grid on;

FLOWCHART:

Figure:1
Output:

Figure :2

RESULT:
Thus, the simulation of a boosting ensemble for a given dataset was completed
successfully using MATLAB Software.

CORE COMPETENCY:
Thus, Successfully learned how to simulate a boosting ensemble for a givendataset using
MATLAB software.
MARKS ALLOCATION:
Details Marks Allotted Vinantika E A Vishmitha M

Preparation 20

Conducting 20

Calculation / Graphs 15

Results 10
Basic understanding (Core
15
competency learned)
Viva 10
Record 10

Total 100

Signature of faculty

Simulate SVM Classification For A Dataset.: EX NO:04 Date
No ratings yet
Simulate SVM Classification For A Dataset.: EX NO:04 Date
4 pages
GTE03 ML List of Experiments
No ratings yet
GTE03 ML List of Experiments
4 pages
Data Classification & Visualization
No ratings yet
Data Classification & Visualization
5 pages
17 Ensemble Techniques Problem Statement
No ratings yet
17 Ensemble Techniques Problem Statement
28 pages
Briefly Explain The Trade-Offs Associated Between The Model Variance Versus Bias-Squared To Inform Model Selection
No ratings yet
Briefly Explain The Trade-Offs Associated Between The Model Variance Versus Bias-Squared To Inform Model Selection
7 pages
Agniva
No ratings yet
Agniva
16 pages
Tinywow Matlabworkbookstathw4 83108852
No ratings yet
Tinywow Matlabworkbookstathw4 83108852
16 pages
Lec 5
No ratings yet
Lec 5
28 pages
ML File - Merged
No ratings yet
ML File - Merged
24 pages
R Assignment
No ratings yet
R Assignment
8 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
Universal Bank Case Solution
No ratings yet
Universal Bank Case Solution
9 pages
Mat Lab Workbooks Ta THW 4
No ratings yet
Mat Lab Workbooks Ta THW 4
4 pages
Svmtrain M
No ratings yet
Svmtrain M
10 pages
Btech1007022 Lab5
No ratings yet
Btech1007022 Lab5
14 pages
Btech1007022 Lab5.1
No ratings yet
Btech1007022 Lab5.1
9 pages
1
No ratings yet
1
10 pages
ML2
No ratings yet
ML2
7 pages
Exp 5
No ratings yet
Exp 5
9 pages
Implementation
No ratings yet
Implementation
14 pages
Practical Machine Learning Guide
No ratings yet
Practical Machine Learning Guide
7 pages
Sol Eval 1
No ratings yet
Sol Eval 1
4 pages
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
MATLAB Lasso Regression Guide
No ratings yet
MATLAB Lasso Regression Guide
3 pages
DA Programs
No ratings yet
DA Programs
44 pages
B05 Matlab
No ratings yet
B05 Matlab
9 pages
Data604 Final Submission Sravani
No ratings yet
Data604 Final Submission Sravani
21 pages
Data604 Sravani FinalCombined
No ratings yet
Data604 Sravani FinalCombined
22 pages
Dimensionality Reduction & Model Evaluation
No ratings yet
Dimensionality Reduction & Model Evaluation
80 pages
NNpred
100% (2)
NNpred
74 pages
Jupyter Lab
No ratings yet
Jupyter Lab
42 pages
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
No ratings yet
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
19 pages
TE ML LAB Mannual
No ratings yet
TE ML LAB Mannual
21 pages
Regression
No ratings yet
Regression
4 pages
Al3451 Ia 2 Answer Key
No ratings yet
Al3451 Ia 2 Answer Key
12 pages
Contigency
No ratings yet
Contigency
9 pages
Machine Learning in MATLAB: Roland Memisevic
No ratings yet
Machine Learning in MATLAB: Roland Memisevic
16 pages
Matlab Program
No ratings yet
Matlab Program
15 pages
Machine Learning Quick Start Guide
No ratings yet
Machine Learning Quick Start Guide
1 page
Lecture 2.1 - AML
No ratings yet
Lecture 2.1 - AML
32 pages
Machine Learning Model Building
No ratings yet
Machine Learning Model Building
6 pages
FRA Assignment - India Credit Model
No ratings yet
FRA Assignment - India Credit Model
14 pages
Assignment II Machine Learning
No ratings yet
Assignment II Machine Learning
8 pages
Assignment 3
No ratings yet
Assignment 3
6 pages
ML Minors Exp8
No ratings yet
ML Minors Exp8
6 pages
Machine Learning (CSEN3203) 1-14
No ratings yet
Machine Learning (CSEN3203) 1-14
15 pages
MATLAB Data Science Assignment Guide
100% (1)
MATLAB Data Science Assignment Guide
5 pages
10-701/15-781 Machine Learning Mid-Term Exam Solution: Your Name
No ratings yet
10-701/15-781 Machine Learning Mid-Term Exam Solution: Your Name
12 pages
Midterm 2008s Solution
No ratings yet
Midterm 2008s Solution
12 pages
ML W8 Merged
No ratings yet
ML W8 Merged
27 pages
Da Pra Week 12 (SVM)
No ratings yet
Da Pra Week 12 (SVM)
15 pages
NNFLC Assignment - 7 Submitted By:-Name: - Aman Kapil Roll No.: - 21001017005 Q1: - Implement AND Gate Using Hebb's Network in Matlab
No ratings yet
NNFLC Assignment - 7 Submitted By:-Name: - Aman Kapil Roll No.: - 21001017005 Q1: - Implement AND Gate Using Hebb's Network in Matlab
7 pages
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
No ratings yet
Perform Prediction Using Regression Algorithm: Ex No: 1 Date
13 pages
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
No ratings yet
Homework 2: SVM, Kernel Methods, Ensemble Learning, Learning Theory
12 pages
DS File Et C1 23
No ratings yet
DS File Et C1 23
15 pages
Wearable Health
No ratings yet
Wearable Health
18 pages
Digital Stopwatch: Objective
No ratings yet
Digital Stopwatch: Objective
5 pages
Applications of Wireless Sensor Networks (WSNS)
No ratings yet
Applications of Wireless Sensor Networks (WSNS)
8 pages
Es Mini
100% (1)
Es Mini
6 pages
Student Member List
No ratings yet
Student Member List
4 pages
Text Join
No ratings yet
Text Join
65 pages
Pooja ES (OE1) Final
No ratings yet
Pooja ES (OE1) Final
6 pages
Zigbee Smart Home System Case Study
No ratings yet
Zigbee Smart Home System Case Study
8 pages
Fpga Names
No ratings yet
Fpga Names
4 pages
Unit III
No ratings yet
Unit III
51 pages
Giza Pyramids: Ancient Wonders
No ratings yet
Giza Pyramids: Ancient Wonders
3 pages
Minimum Pipe Thickness - B31.1 - PG1
No ratings yet
Minimum Pipe Thickness - B31.1 - PG1
10 pages
I Messages
No ratings yet
I Messages
2 pages
Evolution of Liberalism in the Philippines
No ratings yet
Evolution of Liberalism in the Philippines
2 pages
Georgia
No ratings yet
Georgia
15 pages
Rubrik CDM Version 8.0 User Guide (Rev. A1)
0% (2)
Rubrik CDM Version 8.0 User Guide (Rev. A1)
904 pages
Advances in Caries Diagnosis
91% (11)
Advances in Caries Diagnosis
36 pages
Madhavendra Puri Intervencion Proposal 1
No ratings yet
Madhavendra Puri Intervencion Proposal 1
70 pages
Effects of Controls
No ratings yet
Effects of Controls
9 pages
Chapter 2 Gas Research
No ratings yet
Chapter 2 Gas Research
8 pages
PDF Personal Best A2 Elm Teacherx27s Book DL
75% (4)
PDF Personal Best A2 Elm Teacherx27s Book DL
120 pages
IIRS - ISRO Brochure
No ratings yet
IIRS - ISRO Brochure
1 page
Shopping and Services Vocabulary Quiz
No ratings yet
Shopping and Services Vocabulary Quiz
2 pages
The Mouse in The Mountain
No ratings yet
The Mouse in The Mountain
181 pages
Recycled Aggregate Concrete Study
No ratings yet
Recycled Aggregate Concrete Study
10 pages
Areas Facing Physical and Economic Water Scarcity
No ratings yet
Areas Facing Physical and Economic Water Scarcity
3 pages
Oman Companies Contact List
No ratings yet
Oman Companies Contact List
78 pages
Presentation - Walkman vs. Ipod
No ratings yet
Presentation - Walkman vs. Ipod
3 pages
None Shall Sleep - Ellie Marney
No ratings yet
None Shall Sleep - Ellie Marney
304 pages
Construction Site Code & Guidelines
No ratings yet
Construction Site Code & Guidelines
37 pages
Operation Manual - VLAN Quidway S3900 Series Ethernet Switches-Release 1510
No ratings yet
Operation Manual - VLAN Quidway S3900 Series Ethernet Switches-Release 1510
16 pages
El Patrón Del Pollito
No ratings yet
El Patrón Del Pollito
15 pages
Solid State
No ratings yet
Solid State
15 pages
VLE With Goalseek TXY Diag Antoine Eqn
No ratings yet
VLE With Goalseek TXY Diag Antoine Eqn
3 pages
National Real Estate Project Management Integration
No ratings yet
National Real Estate Project Management Integration
26 pages
Criteria For Judging
No ratings yet
Criteria For Judging
7 pages
Activity 2
No ratings yet
Activity 2
5 pages
99 Suspension SM
No ratings yet
99 Suspension SM
74 pages
Philosophy Lesson Plan: Fact vs. Opinion
100% (2)
Philosophy Lesson Plan: Fact vs. Opinion
7 pages

MLT 9

Uploaded by

MLT 9

Uploaded by

EX NO:09 Simulate a boosting ensemble method for any

To Simulate boosting ensemble method for any dataset.

Boosting, a supervised ensemble learning technique, is implemented to simulate medical

% Read the dataset

data.education = categorical(data.education); % Ensure it's categorical

eduCols = startsWith(data.Properties.VariableNames, 'Education_');

features = [features, data.Properties.VariableNames(eduCols)];

'Standardize', true, ...

YPred = predict(SVMModel, XTest);

fprintf('Accuracy: %.2f%%\n', accuracy *

[~, scores] = predict(SVMModel, XTest);

[Xroc, Yroc, ~, AUC] = perfcurve(YTest, scores(:,2), 1);

You might also like