Random Forest (RF) : Decision Trees

Random Forest (RF) is a machine learning algorithm that uses ensemble learning to combine multiple decision trees. It can be used for both classification and regression tasks. RF works by constructing a multitude of decision trees at training time and outputting the class that is the mode of the classes or mean prediction of the individual trees. RF decreases variance and helps prevent overfitting compared to a single decision tree. Some key hyperparameters include the number of trees, maximum depth, and number of features considered at each split. RF provides relatively fast and powerful predictions but acts as a black-box model.

Uploaded by

Divya Negi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

321 views3 pages

Random Forest (RF) : Decision Trees

Uploaded by

Divya Negi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

Random Forest (RF)

Random Forest (RF) is one of the many machine learning algorithms used for supervised learning,
this means for learning from labelled data and making predictions based on the learned patterns. RF
can be used for both classification and regression tasks.

Decision trees
 RF is based on decision trees. In machine learning decision trees are a technique for creating
predictive models. They are called decision trees because the prediction follows several
branches of “if… then…” decision splits - similar to the branches of a tree.
 If we imagine that we start with a sample, which we want to predict a class for, we would
start at the bottom of a tree and travel up the trunk until we come to the first split-off
branch. This split can be thought of as a feature in machine learning, let’s say it would be
“age”; we would now make a decision about which branch to follow: “if our sample has an
age bigger than 30, continue along the left branch, else continue along the right branch”.
This we would do until we come to the next branch and repeat the same decision process
until there are no more branches before us. This endpoint is called a leaf and in decision
trees would represent the final result: a predicted class or value.
 At each branch, the feature thresholds that best split the (remaining) samples locally is
found.
 Single decision trees are very easy to visualize and understand because they follow a
method of decision-making that is very similar to how we humans make decisions: with a
chain of simple rules. However, they are not very robust, i.e. they don’t generalize well to
unseen samples. Here is where Random Forests come into play.

Ensemble learning
RF makes predictions by combining the results from many individual decision trees - so we call them
a forest of decision trees. Because RF combines multiple models, it falls under the category of
ensemble learning. Other ensemble learning methods are gradient boosting and stacked ensembles.

Combining decision trees

There are two main ways for combining the outputs of multiple decision trees into a random forest:

1. Bagging, which is also called Bootstrap aggregation (used in Random Forests)

 Bagging is the default method used with Random Forests.
 Decision trees are trained on randomly sampled subsets of the data, while sampling is
being done with replacement.
 A big advantage of bagging over individual trees is that it decreases the variance of the
model. Individual trees are very prone to overfitting and are very sensitive to noise in
the data. As long as our individual trees are not correlated, combining them with
bagging will make them more robust without increasing the bias.
 We remove (most of) the correlation by randomly sampling subsets of data and training
the different decision trees on these subsets instead of on the entire dataset.
 In addition to randomly sampling instances from our data, RF also uses feature bagging.
2. Boosting (used in Gradient Boosting Machines)
 The samples are weighted for sampling so that samples, which were predicted incorrectly
get a higher weight and are therefore sampled more often.
 The idea behind this is that difficult cases should be emphasized during learning compared
to easy cases.
 Because of this difference bagging can be easily paralleled, while boosting is performed
sequentially.

Final Result
The final result of our model is calculated by averaging over all predictions from these sampled trees
or by majority vote.
Hyperparameters
 Hyperparameters are the arguments that can be set before training and which define how
the training is done.
 The main hyperparameters in Random Forests are:
o The number of decision trees to be combined
o The maximum depth of the trees
o The maximum number of features considered at each split
o Whether bagging/bootstrapping is performed with or without replacement

Pros and Cons of Random Forests:

Pros
 They are a relatively fast and powerful algorithm for classification and regression learning.
 Calculations can be parallelized and perform well on many problems, even with small
datasets and the output returns prediction probabilities.

Cons
 They are black-boxes, meaning that we can’t interpret the decisions made by the model
because they are too complex.
 RF are also somewhat prone to overfitting and they tend to be bad at predicting
underrepresented classes in unbalanced datasets.

Boosting
 The idea of boosting came out of the idea of whether a weak learner can be modified to
become better.
 A weak hypothesis or weak learner is defined as one whose performance is at least slightly
better than random chance.


Random Forest Algorithm Updated
No ratings yet
Random Forest Algorithm Updated
11 pages
Machine Learning
No ratings yet
Machine Learning
5 pages
Random Forest Algorithm
No ratings yet
Random Forest Algorithm
2 pages
Random Forest Algorithm 1
100% (2)
Random Forest Algorithm 1
14 pages
Lecture-12 Machine Learning With Python
No ratings yet
Lecture-12 Machine Learning With Python
18 pages
Random Forest Classic Style
No ratings yet
Random Forest Classic Style
9 pages
Randon Forest
No ratings yet
Randon Forest
34 pages
Random Forest
No ratings yet
Random Forest
14 pages
Random Forest
No ratings yet
Random Forest
25 pages
Lecture 05 Random Forest 07112022 124639pm
No ratings yet
Lecture 05 Random Forest 07112022 124639pm
25 pages
Deep Learning and Neural Networks
No ratings yet
Deep Learning and Neural Networks
21 pages
Bagging and Random Forest Presentation1
100% (4)
Bagging and Random Forest Presentation1
23 pages
Decision Tree
No ratings yet
Decision Tree
7 pages
Random Forest for ML Enthusiasts
No ratings yet
Random Forest for ML Enthusiasts
4 pages
Class 7 Random Forest Algorithm
No ratings yet
Class 7 Random Forest Algorithm
13 pages
Random Forest
No ratings yet
Random Forest
29 pages
Data Mining Notes
No ratings yet
Data Mining Notes
5 pages
Notes On Random Forest
No ratings yet
Notes On Random Forest
2 pages
Random Forest
No ratings yet
Random Forest
8 pages
Random Forest
No ratings yet
Random Forest
6 pages
Random Forest - Basics
100% (1)
Random Forest - Basics
9 pages
Random Forest
100% (1)
Random Forest
18 pages
Random Forest, CNN and Different Algorithm
No ratings yet
Random Forest, CNN and Different Algorithm
14 pages
Random Forest
No ratings yet
Random Forest
16 pages
D3 IT Random Forest Apr 2023
No ratings yet
D3 IT Random Forest Apr 2023
32 pages
Random Forest Algorithms - Comprehensive Guide With Examples
No ratings yet
Random Forest Algorithms - Comprehensive Guide With Examples
13 pages
Random Forests 2
No ratings yet
Random Forests 2
43 pages
Random Forest Summary
No ratings yet
Random Forest Summary
6 pages
Random Forest Algorithm in Machine Learning Random Forest Random Forests or Random Decision Trees Decision Trees
No ratings yet
Random Forest Algorithm in Machine Learning Random Forest Random Forests or Random Decision Trees Decision Trees
6 pages
Ensemble Learning Explained
No ratings yet
Ensemble Learning Explained
32 pages
Classification Algorithms
No ratings yet
Classification Algorithms
68 pages
Random Forest
No ratings yet
Random Forest
2 pages
Da MS
No ratings yet
Da MS
24 pages
Random Forests: H S H H
No ratings yet
Random Forests: H S H H
2 pages
Random Forest
No ratings yet
Random Forest
10 pages
Random Forests
No ratings yet
Random Forests
2 pages
Random Forest Summary
No ratings yet
Random Forest Summary
6 pages
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
No ratings yet
Machine Learning With Random Forests - by Knoldus Inc. - Knoldus - Technical Insights - Medium
12 pages
Random Forests
No ratings yet
Random Forests
43 pages
Random Forest
No ratings yet
Random Forest
9 pages
Decision Tree Classification Algorithm
No ratings yet
Decision Tree Classification Algorithm
4 pages
Random Forests
No ratings yet
Random Forests
1 page
Random Forest: Machine Learning Guide
100% (1)
Random Forest: Machine Learning Guide
32 pages
03 - Random Forest
No ratings yet
03 - Random Forest
24 pages
Random Forest Algorithm Unit 3
No ratings yet
Random Forest Algorithm Unit 3
2 pages
Forest
No ratings yet
Forest
2 pages
lecture19-FromTreesToForests RandomForests
No ratings yet
lecture19-FromTreesToForests RandomForests
50 pages
Lec10 Random Forest Algorithm
No ratings yet
Lec10 Random Forest Algorithm
8 pages
ML Unit 3
No ratings yet
ML Unit 3
22 pages
Lecture+Notes+-+Random Forests
No ratings yet
Lecture+Notes+-+Random Forests
10 pages
Random Forest
No ratings yet
Random Forest
2 pages
Lecture #15: Regression Trees & Random Forests
No ratings yet
Lecture #15: Regression Trees & Random Forests
34 pages
Random Forest Class Lecture Notes
No ratings yet
Random Forest Class Lecture Notes
2 pages
Random Forest
No ratings yet
Random Forest
11 pages
Aditri Chaudhuri - DM
No ratings yet
Aditri Chaudhuri - DM
10 pages
Lecture 5
No ratings yet
Lecture 5
53 pages
Session 1 On Random Forest 1
No ratings yet
Session 1 On Random Forest 1
8 pages
CSL0777 L26
No ratings yet
CSL0777 L26
33 pages
Performance Management Case Study 1
No ratings yet
Performance Management Case Study 1
5 pages
Google's Performance Management
No ratings yet
Google's Performance Management
6 pages
Managing Talent: How Google Searches For Performance Measures
No ratings yet
Managing Talent: How Google Searches For Performance Measures
6 pages
Product A: Safety Stock Reorder Value
No ratings yet
Product A: Safety Stock Reorder Value
2 pages
"Options Thinking" in IT Project Management
No ratings yet
"Options Thinking" in IT Project Management
18 pages
SCM - Case Study - Grp9
No ratings yet
SCM - Case Study - Grp9
4 pages
Amul Milk Supply Chain Overview
No ratings yet
Amul Milk Supply Chain Overview
1 page
Systems Thinking Introduction
No ratings yet
Systems Thinking Introduction
12 pages
Counter Intuitive Type 1 Problem: Description
No ratings yet
Counter Intuitive Type 1 Problem: Description
4 pages
MS19A008 FinalJournal PDF
No ratings yet
MS19A008 FinalJournal PDF
12 pages
Microsoft Azure-Case Study Document
No ratings yet
Microsoft Azure-Case Study Document
8 pages
Earthquake Risk Management: Lecture Notes - IIT, Roorkee
75% (8)
Earthquake Risk Management: Lecture Notes - IIT, Roorkee
365 pages
Loctite SI 595: Technical Data Sheet
No ratings yet
Loctite SI 595: Technical Data Sheet
2 pages
Anticoagulation For CRRT
No ratings yet
Anticoagulation For CRRT
81 pages
Registration Form
No ratings yet
Registration Form
4 pages
Global Ghee Market Report 2016-2021
No ratings yet
Global Ghee Market Report 2016-2021
10 pages
Home Manegement
No ratings yet
Home Manegement
8 pages
CTET January 2024 Exam Details
No ratings yet
CTET January 2024 Exam Details
5 pages
Competency Based Training Assessment - Whitepaper
No ratings yet
Competency Based Training Assessment - Whitepaper
12 pages
Toyota Faisalabad Motors Intro Lettre
No ratings yet
Toyota Faisalabad Motors Intro Lettre
2 pages
Flight Attendant
No ratings yet
Flight Attendant
25 pages
Protection Relay REX 521: Operator's Manual
100% (1)
Protection Relay REX 521: Operator's Manual
32 pages
Biosafety Levels in WHO Requirement-1
No ratings yet
Biosafety Levels in WHO Requirement-1
18 pages
BSBOPS601 Assessment 1
No ratings yet
BSBOPS601 Assessment 1
71 pages
Tap Spec
No ratings yet
Tap Spec
3 pages
Thermal and Slip Effect On Rotor Time Constant in Vector Controlled Indiction Motor
No ratings yet
Thermal and Slip Effect On Rotor Time Constant in Vector Controlled Indiction Motor
10 pages
Raspberry. Trip Laser Wire
No ratings yet
Raspberry. Trip Laser Wire
8 pages
Food Marketing Assignment Final
No ratings yet
Food Marketing Assignment Final
15 pages
Retreat Farm Gardening
No ratings yet
Retreat Farm Gardening
7 pages
MID1 Materials
No ratings yet
MID1 Materials
4 pages
Danh Sach Tim 10-57-53 13-11-2024
No ratings yet
Danh Sach Tim 10-57-53 13-11-2024
16 pages
DSA
No ratings yet
DSA
2 pages
Quarterly Percentage Tax Return: (From Schedule 1 Item 7)
No ratings yet
Quarterly Percentage Tax Return: (From Schedule 1 Item 7)
2 pages
Sample SOP For MS in Electrical Engineering: Digital Design
No ratings yet
Sample SOP For MS in Electrical Engineering: Digital Design
2 pages
Tutorial 6 - Domain Modelling
No ratings yet
Tutorial 6 - Domain Modelling
2 pages
Cam Timing DDEC S50
100% (1)
Cam Timing DDEC S50
4 pages
Cash & Bank Flow Statement
No ratings yet
Cash & Bank Flow Statement
8 pages
Wayside Amenities Guidelines
No ratings yet
Wayside Amenities Guidelines
8 pages
Shang Et Al 2017 Optimal Retail Return Policies With Wardrobing
No ratings yet
Shang Et Al 2017 Optimal Retail Return Policies With Wardrobing
18 pages
WEG CFW11 Installation Guide 10001803811 en Es PT de FR Ru It Tu
No ratings yet
WEG CFW11 Installation Guide 10001803811 en Es PT de FR Ru It Tu
212 pages
CTSDG06516XDM PDF
No ratings yet
CTSDG06516XDM PDF
2 pages

Random Forest (RF) : Decision Trees

Uploaded by

Random Forest (RF) : Decision Trees

Uploaded by

Random Forest (RF)

Combining decision trees

1. Bagging, which is also called Bootstrap aggregation (used in Random Forests)

Pros and Cons of Random Forests:

You might also like