ML

This lecture introduces decision trees, which can be used for classification or regression problems in machine learning. Decision trees visually represent features and their values to partition a dataset and predict discrete or continuous outcomes. The goal in machine learning is to automatically build an optimal tree from a dataset to best fit the data and achieve high predictive performance. Various algorithms evaluate features to construct trees that minimize prediction error by recursively splitting the dataset based on discriminative power of features.

Uploaded by

Aptech Pitampura

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views3 pages

ML

Uploaded by

Aptech Pitampura

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

Machine Learning,ML

Prof. Carl Gustaf Jansson

Prof. Henrik Boström
Prof. Fredrik Kilander
Department of Computer Science and Engineering
KTH Royal Institute of Technology, Sweden

Lecture 12
Decision Trees

Welcome to the second lecture of the third week of the course in machine learning this lecture will
be about Decision Trees. So let's talk about decision trees so on some general characteristics of this
representation so actually decision trees is something that been developed in decision analysis where
tree can be used to visually and explicitly represent decision alternatives and this is making of options
in various situations and typically a decision tree is drawn upside down with its root at the top. Nodes
in the decision represents features, edges represent feature values or feature intervals, which of course
the Feature value and Feature Intervals they can embody decision options. Leaves represent either
discrete values typically than the discrete values and situations or classes. But it can also be that the
leaves are present continuous outcomes, so this discrete case we talk more or less about classification
of entities and situation while in the second case we talk about regression situations. In the decision-
making scenario decision trees are typically used in a purely normative or prescriptive mode that we
can actually set up a tree that is the guide for how to behave and the decision trees then simply
stipulated, so if you get a simple example to the right you say this is a manual for how to act so you
wake up in the morning you look if it's raining, okay it is it's raining, if it's not raining it don't do
anything. Then in the other case you may say always windy it's raining but it we yeah but it's not
windy if it's not windy I can bring an umbrella and so on and so on. So it's a little manual for her to
decide yeah it's a slightly different setting in machine learning because in machine learning when we
do work with decision trees we essentially want to build up at a relevant tree, from data sets collected
in your domain, so is not predefined it's not prescriptive the tree should be more or less true to the
data considered. So the tree challenge here is to design an optimal tree so that the tree makes the best
fit of the considered data items and has the best predictive performance for new data items.

So the interdisciplinary sources of inspiration for this Representation as has already been mentioned is
decision-making theory in economy in business, which have for a long time used simple models of
this kind. It has also been said is that these models are normally defined and they are prescriptive. But
there is an interesting parallel of course because we will see when we build try to build this kind of
trees from data sets and with the aim of having a true picture close to the data set, of course we hope
that even if a person or some persons just this define this tree, the way they do it is indirectly and
informally based on earlier experiences from their site this is the normal case. So at some point
learning takes place however it has done informally among the people involved or we do it formally
in the form of machine learning. So the core component and the core problem solving for the
representation is as follows. So when you construct the tree, you construct it upside down, the root to
the top, if you do this manually you intuitively choose some order of features and the features you
represent as notes and from each level from each node you branch the tree by looking at the values so
of course the edge is represent on every level feature values or featured intervals. And finally when
you come to a leaf that leaf represents the discrete or continuous outcome. So that's the build up of the
tree the use of the tree is also straightforward you start from the tree the top and evaluate the values of
the features in the given order eventually up with a leaf with a unique outcome. If we now turn to
learning for this representation which means that we want to build up a decision tree for a realistic
domain with many features and many data items.

So then we consider two kinds of decision tree analysis, one we can call classifications Tree analysis
where the leaves are the classes of the categories that we want to define or regression tree analysis
when the leaves are intervals or real numbers. The challenge is to design a tree that that this tree
optimizes the fit of the considered data attempts and the predicted performance for still unseen data
types minimal prediction error basis is aimed at and it's not trivial because we may have many
features to look at and it's not crystal clear for most domains which is the most important or
discriminatory feature to use first because of course the idea is the to start building the tree using the
features that that best discriminates among the data and there are several approaches that can be
combined in addition to learning techniques so one kind of technique is a kind of proactive which
means that at every point we evaluate the whole dataset with respect to a potential selection of our
feature and we analyse the discriminatory power of that feature using different kind of approaches. So
and typically the way we evaluate is by looking if some kind of information theoretic measures on the
whole situation and their approach is referred to as information gain beta of some an entropy concept
also approaches called Gini impurity measure, but the purpose in general in all these cases is to judge
the discrimination power of the feature. Another kind of measure to take is to first contract the tree but
then at a later stage in the process proven it and there of course also different techniques and criteria
for what is a good way to prove. A third approach is to actually several trees instead of just one in
parallel, so this means that through the process the design process we would we generate not just a
tree but a forest, and then further on in the process evaluate which is the most optimal tree of within
the forest. One criteria that is always important to have in the back of ones head in these matters is the
very well-known principle called Occam's razor but of course if there is a choice of structure you
should always prefer the simplest choice or the simplest excellent tree in this case.

So if we look at what happens to the data set when we fabricate preliminary tree we can observe that
what always happened is that if we have a population at the start with so in this example we have like
14 data items in in the root and then we make the first split, use one feature and we split up in in three
possibilities which are the values of that feature. So then what happens is that by doing that we also in
a way split the data set or the populate data set into three boxes. And then in the second step when we
make the second split using some other feature we make a split again and of course it should always
hold that every step is displayed and the sum of the items in the box is created should sum up to data
set which was 14, so as you see here on for the first plate and we divided up in three boxes and when
we take the second step and we don't go further with the third option we go oh yeah that's split further
the two first option but altogether the five from resulting leaves in that case partitions the data set as a
passive whole. Another perspective on what's going on in this process is to compare a preliminary tree
with some other graphical depiction of the feature space, so if we in this very simple case have to two
features and we in a way split with respect to one breaking point for each of these example illustrates
how the three constructed partitions a two-dimensional depiction of this little features set. So as you
can see here there is one quarter of the area represents one option and three quarters of the area that
represents the other option or outcome.

So I will end this lecture by showing you a few examples so the intention with the first example to
show you like a complete picture where you get a whole data set with a number of data items and then
a corresponding tree, whether that tree is optimal we are not going to discuss at this point we will
come back to those issues next week when we look into a variety of algorithm that has been designed
for this kind of problem. So this example is related to the analysis of election outcomes in the u.s.
essentially the features in this example are the outcomes in various states, so this means that the
ordering of features is based on the importance of those outcomes for the final results and as you see
this has been done here in such a way that the tree is reasonably well behaved in the sense that one of
the alternatives the what is depicted as blue here are kind of more dense to the to the left and the other
alternative the red outcomes are sorted to the right. So I give you two more examples here I won't say
much about them just a few comments the left one is a classification example and the right one is a
regression example. So in the in the left one the classification is essentially in two outcomes a child
gets a Christmas gift or a child doesn't get Christmas gift those are the two only outcomes on this and
of course as you see the outcome can depend on certain behaviours of the child and I are no further
comments on the regression tree example, taken you see here how what do is illustrate it earlier that
the population of the or the data items in the data set are on every level distributed among the leaf
nodes at that point. So this was the end of this lecture thanks for your intention the next lecture will be
on the topic of Bayesian belief networks thank you good bye

Decision Trees-Lecture 9&10
No ratings yet
Decision Trees-Lecture 9&10
60 pages
9-Module 5 Decision Tree-21-03-2024
No ratings yet
9-Module 5 Decision Tree-21-03-2024
83 pages
19 - Decision Tree - ID3
No ratings yet
19 - Decision Tree - ID3
87 pages
AIML Module 4 Imp
No ratings yet
AIML Module 4 Imp
5 pages
Week 3 Forms of Representation: Nptel Video Course On Machine Learning
No ratings yet
Week 3 Forms of Representation: Nptel Video Course On Machine Learning
11 pages
M2 Decision Trees
No ratings yet
M2 Decision Trees
37 pages
Unit IV Decision Trees
No ratings yet
Unit IV Decision Trees
37 pages
Decision Trees for Data Mining Students
No ratings yet
Decision Trees for Data Mining Students
30 pages
Decision Trees Implementation
No ratings yet
Decision Trees Implementation
13 pages
Unit 3
No ratings yet
Unit 3
31 pages
Unit 1 ML (DT)
No ratings yet
Unit 1 ML (DT)
24 pages
Unit 1 ML (NN& ML Techniques)
No ratings yet
Unit 1 ML (NN& ML Techniques)
40 pages
Decision Tree Learning (8 Hours)
No ratings yet
Decision Tree Learning (8 Hours)
141 pages
Tree Based Algorithms in Machine Learning
No ratings yet
Tree Based Algorithms in Machine Learning
8 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
3 Dtrees-Lect6
No ratings yet
3 Dtrees-Lect6
63 pages
Machine Learning in Ecology
No ratings yet
Machine Learning in Ecology
15 pages
Unit Ii
No ratings yet
Unit Ii
22 pages
ML Unit 3 New
100% (1)
ML Unit 3 New
24 pages
Unit 3 - ML (NEW)
No ratings yet
Unit 3 - ML (NEW)
68 pages
Module 5 Notes
No ratings yet
Module 5 Notes
8 pages
1.decision Trees Concepts
No ratings yet
1.decision Trees Concepts
70 pages
ml2 PDF
No ratings yet
ml2 PDF
5 pages
Machine Learning: Decision Trees & Algorithms
No ratings yet
Machine Learning: Decision Trees & Algorithms
24 pages
ML Unit3
No ratings yet
ML Unit3
24 pages
Decision Trees & The Iterative Dichotomiser 3 (ID3) Algorithm
100% (1)
Decision Trees & The Iterative Dichotomiser 3 (ID3) Algorithm
8 pages
Wha Is A Decision Tree?: Decision Ree Classifica On Regression
No ratings yet
Wha Is A Decision Tree?: Decision Ree Classifica On Regression
23 pages
Decision Tree and Random Forest
No ratings yet
Decision Tree and Random Forest
41 pages
Dmi Unit 4
No ratings yet
Dmi Unit 4
34 pages
ML Unit 03
No ratings yet
ML Unit 03
23 pages
Wk. 5.2. Decision Trees (27.10.2020)
No ratings yet
Wk. 5.2. Decision Trees (27.10.2020)
57 pages
Decision Trees
No ratings yet
Decision Trees
8 pages
Trees and Forests: Machine Learning With Python Cookbook
No ratings yet
Trees and Forests: Machine Learning With Python Cookbook
5 pages
Random Forest Regression
No ratings yet
Random Forest Regression
57 pages
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
No ratings yet
STAT 451: Machine Learning Lecture Notes: Sebastian Raschka Department of Statistics University of Wisconsin-Madison
18 pages
ML Unit3 QB Solutions
No ratings yet
ML Unit3 QB Solutions
11 pages
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
No ratings yet
16-Decision Tree Classification Algorithm Advantages With Examples (Iterative Dichotomiser 3-ID3) - 22-03-2024
83 pages
TEAA - Tree Ensembles-1
No ratings yet
TEAA - Tree Ensembles-1
43 pages
What Is Decision Tree
No ratings yet
What Is Decision Tree
35 pages
Decision Trees & Kernel Machines
No ratings yet
Decision Trees & Kernel Machines
39 pages
Decision Tree
No ratings yet
Decision Tree
12 pages
Lecture 17 18
No ratings yet
Lecture 17 18
52 pages
Lec.7.intro.D.S. Fall 2023
No ratings yet
Lec.7.intro.D.S. Fall 2023
26 pages
Lesson 4 My Path Task - WorldQuant University
No ratings yet
Lesson 4 My Path Task - WorldQuant University
6 pages
UNIT3
No ratings yet
UNIT3
71 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
Decision Tree & Random ForestNotes
No ratings yet
Decision Tree & Random ForestNotes
11 pages
Lec 34
No ratings yet
Lec 34
32 pages
Machine Learning: B.E, M.Tech, PH.D
No ratings yet
Machine Learning: B.E, M.Tech, PH.D
23 pages
Decision Trees for CS Students
No ratings yet
Decision Trees for CS Students
54 pages
Decision Trees
No ratings yet
Decision Trees
37 pages
UNIT 2 - Groups (Decision Tree)
No ratings yet
UNIT 2 - Groups (Decision Tree)
20 pages
Decision Tree Random Forrest Naive Bayes 02
No ratings yet
Decision Tree Random Forrest Naive Bayes 02
13 pages
ML Unit-3
No ratings yet
ML Unit-3
23 pages
Classification 4
No ratings yet
Classification 4
16 pages
Lec 46
No ratings yet
Lec 46
6 pages
Model Engineering
No ratings yet
Model Engineering
7 pages
November 2012 Mark Scheme 22 PDF
No ratings yet
November 2012 Mark Scheme 22 PDF
11 pages
IELTS Writing Task 1: Model Answers
No ratings yet
IELTS Writing Task 1: Model Answers
14 pages
Topic 4 Managerial Decision Making
No ratings yet
Topic 4 Managerial Decision Making
43 pages
Don Honorio Ventura State University
No ratings yet
Don Honorio Ventura State University
3 pages
Honorato C. Perez, Sr. Memorial Science High School: Sta. Arcadia, Cabanatuan City
No ratings yet
Honorato C. Perez, Sr. Memorial Science High School: Sta. Arcadia, Cabanatuan City
2 pages
Organizational Behavior Quiz Guide
No ratings yet
Organizational Behavior Quiz Guide
12 pages
Improving Skill and Interest of Grade 7 Students in Algebra Through The Use of Games
No ratings yet
Improving Skill and Interest of Grade 7 Students in Algebra Through The Use of Games
5 pages
Lambda-Calculus and Combinators
100% (7)
Lambda-Calculus and Combinators
359 pages
Neural Networks for Beginners
No ratings yet
Neural Networks for Beginners
4 pages
Grammar Activity
No ratings yet
Grammar Activity
5 pages
Instructional Objectives & Learning Outcomes Guide
No ratings yet
Instructional Objectives & Learning Outcomes Guide
8 pages
Graphic Design
No ratings yet
Graphic Design
6 pages
Aristotle vs. Mill vs. Kant Ethics
No ratings yet
Aristotle vs. Mill vs. Kant Ethics
4 pages
Behavioral Psychology From Ignou
No ratings yet
Behavioral Psychology From Ignou
21 pages
Examining The Effects of Explicit Teaching of Context Clues in Content Area Texts
No ratings yet
Examining The Effects of Explicit Teaching of Context Clues in Content Area Texts
6 pages
Heather Douglas: Is Science Value-Free? (Science, Policy, and The Value-Free Ideal)
No ratings yet
Heather Douglas: Is Science Value-Free? (Science, Policy, and The Value-Free Ideal)
4 pages
Vocabulary Revision: Contemporary Topics
No ratings yet
Vocabulary Revision: Contemporary Topics
4 pages
Journey to Professional Writing
No ratings yet
Journey to Professional Writing
6 pages
Fear & Anxiety: Models & Insights
No ratings yet
Fear & Anxiety: Models & Insights
23 pages
A Guide To Case Analysis
100% (2)
A Guide To Case Analysis
6 pages
B1-1 (Nivel Cuatro)
No ratings yet
B1-1 (Nivel Cuatro)
29 pages
Teacher - S Kit KSPK
100% (1)
Teacher - S Kit KSPK
146 pages
Positive and Negative Effects of Technology On Child Development
No ratings yet
Positive and Negative Effects of Technology On Child Development
2 pages
IMSciences Thesis Format Guidelines
No ratings yet
IMSciences Thesis Format Guidelines
7 pages
Grade 1 Math: Measurement Basics
No ratings yet
Grade 1 Math: Measurement Basics
2 pages
Tuesdays With Morrie by Mitch Albom Sty
No ratings yet
Tuesdays With Morrie by Mitch Albom Sty
14 pages
Signature Assignment Science Lesson Plan
No ratings yet
Signature Assignment Science Lesson Plan
18 pages
Genetics Assessment Rubric
No ratings yet
Genetics Assessment Rubric
2 pages
English Grammar Essentials
No ratings yet
English Grammar Essentials
153 pages
Am I Codependent?
100% (4)
Am I Codependent?
2 pages

ML

Uploaded by

ML

Uploaded by

Machine Learning,ML

Prof. Carl Gustaf Jansson

You might also like