0% found this document useful (0 votes)

116 views19 pages

1 Eric Boosting304FinalRpdf

This document discusses the AdaBoost algorithm. AdaBoost is a boosting algorithm that can generate a strong classifier by combining weak learners. It works by focusing on examples that previous weak learners misclassified and adjusting example weights accordingly. AdaBoost is proven to reduce training error and empirically works well due to its ability to increase margins between classes rather than just reducing errors. While simple and effective, it can overfit if weak learners are too complex or fail to produce large margins.

Uploaded by

Christina Jenkins

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

116 views19 pages

1 Eric Boosting304FinalRpdf

Uploaded by

Christina Jenkins

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 19

BOOSTING (ADABOOST ALGORITHM)

Eric Emer

Consider Horse-Racing Gambler

Rules of Thumb for determining Win/Loss: Most favored odds Fastest recorded lap time Most wins recently, say, in the past 1 month Hard to determine how he combines analysis of feature

set into a single bet.

Consider MIT Admissions

2-class system (Admit/Deny) Both Quantitative Data and Qualitative Data

We consider (Y/N) answers to be Quantitative (-1,+1) Region, for instance, is qualitative.

Rules of Thumb, Weak Classifiers

Easy to come up with rules of thumb that correctly classify the training data at

better than chance.

E.g. IF GoodAtMath==Y THEN predict Admit. Difficult to find a single, highly accurate prediction rule. This is where our Weak

Learning Algorithm, AdaBoost, helps us.

What is a Weak Learner?

For any distribution, with high probability, given

polynomially many examples and polynomial time we can find a classifier with generalization error better than random guessing.
< 1 2 , also denoted > 0 for generalization error ( 1 2 )

Weak Learning Assumption

We assume that our Weak Learning Algorithm (Weak

Learner) can consistently find weak classifiers (rules of thumb which classify the data correctly at better than 50%) Given this assumption, we can use boosting to generate a single weighted classifier which correctly classifies our training data at 99%-100%.

AdaBoost Specifics
How does AdaBoost weight training examples optimally? Focus on difficult data points. The data points that have been misclassified most by the previous weak classifier. How does AdaBoost combine these weak classifiers into a

comprehensive prediction?
Use an optimally weighted majority vote of weak classifier.

AdaBoost Technical Description

Missing details: How to generate distribution? How to get single classifier?

Constructing Dt
D 1 ( i) = and given Dt and ht : Dt+1 c ( x) = D t ( i) = c ( x) Zt : y i = ht ( x i ) : yi 6= ht (xi )
t yi h t (x i )

1 m

e t e t

Dt+1 = where Zt = normalization constant

D t ( i) e Zt

1 1 t >0 t = ln 2 t

Getting a Single Classifier

X
t

Hf inal (x) = sign(

t ht (x))

Mini-Problem

Training Error Analysis

Claim: then,

Proof
Step 1: unwrapping the recurrence

training error(Hf inal ) p Step 3: Show Zt = 2 t (1 t )

Step 2: Show

How might test error react to AdaBoost?

We expect to encounter:
Occams Razor Overfitting

Empirical results of test error

Test error does not increase even after 1000 rounds. Test error continues to drop after training error reaches zero.

Difference from Expectation: The Margins Explanation

Our training error only measures correctness of

classifications, neglects confidence of classifications. How can we measure confidence of classifications?

Hf inal (x) = sign(f (x)) P t ht t f ( x) = P 2 [ 1, 1] t t margin(x, y ) = yf (x)

Margin(x,y) close to +1 is high confidence, correct. Margin(x,y) close to -1 is high confidence, incorrect. Margin(x,y) close to 0 is low confidence.

Empirical Evidence Supporting Margins Explanation

Hf inal (x) = sign(f (x)) P t ht t f ( x) = P 2 [ 1, 1] t t margin(x, y ) = yf (x)

Cumulative distribution of margins on training examples

Pros/Cons of AdaBoost
Pros
Fast Simple and easy to program No parameters to tune

(except T) No prior knowledge needed about weak learner Provably effective given Weak Learning Assumption versatile

Cons Weak classifiers too complex leads to overfitting. Weak classifiers too weak can lead to low margins, and can also lead to overfitting. From empirical evidence, AdaBoost is particularly vulnerable to uniform noise.

Predicting College Football Results

Training Data: 2009 NCAAF Season Test Data: 2010 NCAAF Season

A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
Boosting and Applications Yuan
No ratings yet
Boosting and Applications Yuan
41 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
A Short Introduction To Boosting
No ratings yet
A Short Introduction To Boosting
14 pages
AdaBoost: A Guide for Data Scientists
No ratings yet
AdaBoost: A Guide for Data Scientists
17 pages
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
No ratings yet
Introduction To Boosting: Cynthia Rudin PACM, Princeton University
29 pages
Ada Boost
No ratings yet
Ada Boost
25 pages
Adaboost
No ratings yet
Adaboost
22 pages
Boosting and AdaBoost For Machine Learning
No ratings yet
Boosting and AdaBoost For Machine Learning
18 pages
Lecture Notes 7
No ratings yet
Lecture Notes 7
8 pages
Adaboost: Derek Hoiem March 31, 2004
No ratings yet
Adaboost: Derek Hoiem March 31, 2004
46 pages
Ada Boost
No ratings yet
Ada Boost
7 pages
Boosting Algorithms Explained
No ratings yet
Boosting Algorithms Explained
79 pages
Boosting Approach To Machine Learn
No ratings yet
Boosting Approach To Machine Learn
23 pages
AdaBoost Algorithm: Key Features & Benefits
No ratings yet
AdaBoost Algorithm: Key Features & Benefits
9 pages
Boosting Mit
No ratings yet
Boosting Mit
36 pages
Resilience To Overfitting AdaBoosts Approach
No ratings yet
Resilience To Overfitting AdaBoosts Approach
8 pages
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
No ratings yet
A Brief Introduction To Adaboost: Hongbo Deng 6 Feb, 2007
35 pages
Multi-class AdaBoost Explained
No ratings yet
Multi-class AdaBoost Explained
12 pages
DM (Boosting)
No ratings yet
DM (Boosting)
15 pages
Computational Data Analysis: Machine Learning
No ratings yet
Computational Data Analysis: Machine Learning
26 pages
ENG6500 7 Ensembles Boosting
No ratings yet
ENG6500 7 Ensembles Boosting
49 pages
LECTURE+NOTES Boosting
No ratings yet
LECTURE+NOTES Boosting
8 pages
Addaboost
No ratings yet
Addaboost
12 pages
FAQ - Boosting - Ensemble Techniques - Great Learning
No ratings yet
FAQ - Boosting - Ensemble Techniques - Great Learning
2 pages
کتاب هفتم بارگزاری شده
No ratings yet
کتاب هفتم بارگزاری شده
57 pages
Lecture 10 Boosting
No ratings yet
Lecture 10 Boosting
20 pages
Improving Classification With AdaBoost
No ratings yet
Improving Classification With AdaBoost
20 pages
Adaboost Matas
No ratings yet
Adaboost Matas
136 pages
Boosting Algorithms Explained
No ratings yet
Boosting Algorithms Explained
2 pages
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
No ratings yet
Bagging and Boosting: 9.520 Class 10, 13 March 2006 Sasha Rakhlin
19 pages
Boosting Algo Adaboost
No ratings yet
Boosting Algo Adaboost
3 pages
Adaboost
No ratings yet
Adaboost
29 pages
AdaBoost-From Theoretical Perspective
No ratings yet
AdaBoost-From Theoretical Perspective
7 pages
Boosting
No ratings yet
Boosting
31 pages
Ensemble (v6)
No ratings yet
Ensemble (v6)
45 pages
Boosted Trees
No ratings yet
Boosted Trees
66 pages
AdaBoost Is Consistent
No ratings yet
AdaBoost Is Consistent
22 pages
AdaBoost Final
No ratings yet
AdaBoost Final
97 pages
Bagging - Boosting
No ratings yet
Bagging - Boosting
9 pages
Machine Learning Boosting Guide
No ratings yet
Machine Learning Boosting Guide
27 pages
Pradipta Kumar Pattanayak - Ada Boosting
No ratings yet
Pradipta Kumar Pattanayak - Ada Boosting
44 pages
07 Boosting Notes
No ratings yet
07 Boosting Notes
10 pages
Ensemble
No ratings yet
Ensemble
33 pages
L07 Classifiers Combination
No ratings yet
L07 Classifiers Combination
17 pages
AdaBoost New PDF
No ratings yet
AdaBoost New PDF
45 pages
Boosting Algorithms in Machine Learning
100% (1)
Boosting Algorithms in Machine Learning
41 pages
Chapter 3 - Boosting Theory
No ratings yet
Chapter 3 - Boosting Theory
7 pages
ML Unit 3 (Ab22)
No ratings yet
ML Unit 3 (Ab22)
42 pages
14-AI ML Ensemble 2022
No ratings yet
14-AI ML Ensemble 2022
41 pages
Ensemble Classifiers Overview
No ratings yet
Ensemble Classifiers Overview
37 pages
Lecture18 Boosting
No ratings yet
Lecture18 Boosting
21 pages
ML 9
No ratings yet
ML 9
64 pages
Improved Boosting with Confidence
No ratings yet
Improved Boosting with Confidence
40 pages
Ensemble Methods for ML Students
No ratings yet
Ensemble Methods for ML Students
28 pages
Gradient Boosting in ML
No ratings yet
Gradient Boosting in ML
5 pages
22 Boosting
No ratings yet
22 Boosting
32 pages
Global Curriculum Transformation
No ratings yet
Global Curriculum Transformation
23 pages
Finn 400 Outline Spring 2020 PDF
No ratings yet
Finn 400 Outline Spring 2020 PDF
8 pages
Daniel Garber, Michael Ayers (Editor) - The Cambridge History of Seventeenth-Century Philosophy, Volume 2 (1998)
100% (2)
Daniel Garber, Michael Ayers (Editor) - The Cambridge History of Seventeenth-Century Philosophy, Volume 2 (1998)
642 pages
The Art of Followership: Business Book Summaries
100% (1)
The Art of Followership: Business Book Summaries
12 pages
Lesson Planning for Educators
No ratings yet
Lesson Planning for Educators
8 pages
Spirituality, Consciousness, and Literature Syllabus
No ratings yet
Spirituality, Consciousness, and Literature Syllabus
6 pages
Module 2
No ratings yet
Module 2
3 pages
Lesson Plan 3
No ratings yet
Lesson Plan 3
11 pages
Choices Lesson Plan
No ratings yet
Choices Lesson Plan
2 pages
TLE-IA6 q0 Mod10 Simple Gadget, Furniture and Finishing v3
No ratings yet
TLE-IA6 q0 Mod10 Simple Gadget, Furniture and Finishing v3
24 pages
ELT Methodology Seminar 2011
No ratings yet
ELT Methodology Seminar 2011
3 pages
Course Outline: International Islamic University Malaysia
No ratings yet
Course Outline: International Islamic University Malaysia
6 pages
Bgas - Cswip Painting Inspector - Grade 3/2: Mermaid Training & Technical Services LTD
100% (1)
Bgas - Cswip Painting Inspector - Grade 3/2: Mermaid Training & Technical Services LTD
1 page
Compass - The FMAA Bachelor of Commerce 2014 Guide
No ratings yet
Compass - The FMAA Bachelor of Commerce 2014 Guide
51 pages
Grade 12 Media Literacy Plan
No ratings yet
Grade 12 Media Literacy Plan
2 pages
LTS 1 Syllabus New Normal
100% (1)
LTS 1 Syllabus New Normal
7 pages
Factors Affecting The Mental Health of Ste Students
100% (1)
Factors Affecting The Mental Health of Ste Students
9 pages
Importance of Science Education
No ratings yet
Importance of Science Education
6 pages
High School Dialogue Bubble Activity
100% (1)
High School Dialogue Bubble Activity
1 page
JGB Order
No ratings yet
JGB Order
4 pages
5 6 Lit May June
No ratings yet
5 6 Lit May June
40 pages
De Jumbe
No ratings yet
De Jumbe
4 pages
Chapter 1
No ratings yet
Chapter 1
23 pages
COT1 DLL - Welcoming and Greeting The Guest Procedure
0% (2)
COT1 DLL - Welcoming and Greeting The Guest Procedure
5 pages
TED Talk Reflection
No ratings yet
TED Talk Reflection
2 pages
Connor Beatty: Academic Achievement
No ratings yet
Connor Beatty: Academic Achievement
2 pages
For All PTE Aspirants
No ratings yet
For All PTE Aspirants
3 pages
ABEK (Alternative Basic Education For Karamoja) Strategic Review 2009
No ratings yet
ABEK (Alternative Basic Education For Karamoja) Strategic Review 2009
77 pages
Differentiated Teaching Insights
No ratings yet
Differentiated Teaching Insights
13 pages

1 Eric Boosting304FinalRpdf

Uploaded by

1 Eric Boosting304FinalRpdf

Uploaded by

BOOSTING (ADABOOST ALGORITHM)

Consider Horse-Racing Gambler

set into a single bet.

Consider MIT Admissions

2-class system (Admit/Deny) Both Quantitative Data and Qualitative Data

Rules of Thumb, Weak Classifiers

better than chance.

Learning Algorithm, AdaBoost, helps us.

What is a Weak Learner?

Weak Learning Assumption

AdaBoost Technical Description

Missing details: How to generate distribution? How to get single classifier?

Dt+1 = where Zt = normalization constant

Getting a Single Classifier

Hf inal (x) = sign(

Training Error Analysis

training error(Hf inal ) p Step 3: Show Zt = 2 t (1 t )

How might test error react to AdaBoost?

Empirical results of test error

Difference from Expectation: The Margins Explanation

classifications, neglects confidence of classifications. How can we measure confidence of classifications?

Hf inal (x) = sign(f (x)) P t ht t f ( x) = P 2 [ 1, 1] t t margin(x, y ) = yf (x)

Empirical Evidence Supporting Margins Explanation

Cumulative distribution of margins on training examples

Predicting College Football Results

You might also like