0% found this document useful (0 votes)

643 views4 pages

Dimensionality Reduction

There are two primary methods for reducing dimensionality in machine learning: feature selection and feature extraction. Feature selection involves filtering irrelevant or redundant features from a dataset to keep a subset of the original features. Feature extraction creates new features from the original ones to capture most of the useful information in a smaller set of features. Common feature selection methods include variance thresholds, correlation thresholds, and genetic algorithms. Principal component analysis (PCA) and linear discriminant analysis (LDA) are examples of feature extraction methods. Dimensionality reduction techniques are important for addressing the curse of dimensionality and allowing machine learning algorithms to work effectively.

Uploaded by

PAWAN TIWARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

643 views4 pages

Dimensionality Reduction

Uploaded by

PAWAN TIWARI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Dimensionality Reduction

In machine learning, “dimensionality” simply refers to the number of features

(i.e. input variables) in the dataset.
When the number of features is very large relative to the number of observations
in your dataset, certain algorithms struggle to train effective models. This is called the
“Curse of Dimensionality”, and it’s especially relevant for clustering algorithms that
rely on distance calculations.
We have 2 primary methods for reducing dimensionality:

1. Feature Selection
2. Feature Extraction.

Feature Selection: - Feature selection is for filtering irrelevant or redundant features

from your dataset. The key difference between feature selection and extraction is that
feature selection keeps a subset of the original features while feature extraction creates
brand new ones.

To be clear, some supervised algorithms already have built-in feature selection, such
as Regularized Regression and Random Forests. As a stand-alone task, feature selection
can be unsupervised (e.g. Variance Thresholds) or supervised (e.g. Genetic Algorithms).
You can also combine multiple methods if needed.

Variance Thresholds

Variance thresholds remove features whose values don't change much from
observation to observation (i.e. their variance falls below a threshold). These
features provide little value. Because variance is dependent on scale, you should
always normalize your features first.

Correlation Thresholds

Correlation thresholds remove features that are highly correlated with others
(i.e. its values change very similarly to another's). These features provide
redundant information.

Genetic Algorithm

Genetic algorithms (GA) are a broad class of algorithms that can be adapted to
different purposes. They are search algorithms that are inspired by evolutionary
biology and natural selection, combining mutation and cross-over to efficiently
traverse large solution spaces. Here's a great intro to the intuition behind GA's.
In machine learning, GA's have two main uses. The first is for optimization, such
as finding the best weights for a neural network.

The second is for supervised feature selection. In this use case, "genes" represent
individual features and the "organism" represents a candidate set of features.
Each organism in the "population" is graded on a fitness score such as model
performance on a hold-out set. The fittest organisms survive and reproduce,
repeating until the population converges on a solution some generations later.

Honorable Mention: Stepwise Search

Stepwise search is a supervised feature selection method based on sequential

search, and it has two flavours: forward and backward.

For forward stepwise search, you start without any features. Then, you'd train a
1-feature model using each of your candidate features and keep the version with
the best performance. You'd continue adding features, one at a time, until
your performance improvements stall.

Backward stepwise search is the same process, just reversed: start with all
features in your model and then remove one at a time until performance starts
to drop substantially.

Feature Extraction: - Feature extraction is for creating a new, smaller set of

features that stills captures most of the useful information. Again, feature selection
keeps a subset of the original features while feature extraction creates new ones.

As with feature selection, some algorithms already have built-in feature extraction.
The best example is Deep Learning, which extracts increasingly useful representations
of the raw input data through each hidden neural layer.
As a stand-alone task, feature extraction can be unsupervised (i.e. PCA) or supervised
(i.e. LDA).

Principal Component Analysis (PCA)

Principal component analysis (PCA) is an unsupervised algorithm that creates

linear combinations of the original features. The new features are orthogonal,
which means that they are uncorrelated. Furthermore, they are ranked in order
of their "explained variance." The first principal component (PC1) explains the
most variance in your dataset, PC2 explains the second-most variance, and so on.

Therefore, you can reduce dimensionality by limiting the number of principal

components to keep based on cumulative explained variance.

You should always normalize your dataset before performing PCA because the
transformation is dependent on scale. If you don't, the features that are on the
largest scale would dominate your new principal components.

Linear Discriminant Analysis (LDA)

Linear discriminant analysis (LDA) - not to be confused with latent Dirichlet

allocation - also creates linear combinations of your original features. However,
unlike PCA, LDA doesn't maximize explained variance. Instead, it maximizes
the separability between classes.

Therefore, LDA is a supervised method that can only be used with labeled data.
So which is better: LDA and PCA? Well, results will vary from problem to
problem, and the same "No Free Lunch" theorem.

The LDA transformation is also dependent on scale, so you should normalize

your dataset first.

Autoencoders

Autoencoders are neural networks that are trained to reconstruct their original
inputs. For example, image autoencoders are trained to reproduce the original
images instead of classifying the image as a dog or a cat.

So how is this helpful? Well, the key is to structure the hidden layer to have fewer
neurons than the input/output layers. Thus, that hidden layer will learn to
produce a smaller representation of the original image.
Because you use the input image as the target output, autoencoders are
considered unsupervised. They can be used directly (e.g. image compression) or
stacked in sequence (e.g. deep learning).

No Free Lunch Theorem: -

In machine learning, there’s something called the “No Free Lunch” theorem. In a
nutshell, it states that no one algorithm works best for every problem, and it’s especially
relevant for supervised learning (i.e. predictive modelling).

For example, you can’t say that neural networks are always better than decision trees or
vice-versa. There are many factors at play, such as the size and structure of your dataset.

As a result, you should try many different algorithms for your problem, while using
a hold-out “test set” of data to evaluate performance and select the winner.

Of course, the algorithms you try must be appropriate for your problem, which is where
picking the right machine learning task comes in. As an analogy, if you need to clean
your house, you might use a vacuum, a broom, or a mop, but you wouldn't bust out
a shovel and start digging.

Unit-V Deep Learning Techniques
100% (1)
Unit-V Deep Learning Techniques
31 pages
Assignment # 01 Bscs - 7 Semester: Machine Learning
100% (1)
Assignment # 01 Bscs - 7 Semester: Machine Learning
5 pages
Support Vector Machine - Explanation
No ratings yet
Support Vector Machine - Explanation
12 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
79 pages
Unit - 3 ML
No ratings yet
Unit - 3 ML
17 pages
ML Unit-1
No ratings yet
ML Unit-1
15 pages
Machine Learning Fundamentals
No ratings yet
Machine Learning Fundamentals
19 pages
AI & ML Unit 3 Notes
No ratings yet
AI & ML Unit 3 Notes
20 pages
Unit 4 Ensemble Techniques and Unsupervised Learning
100% (1)
Unit 4 Ensemble Techniques and Unsupervised Learning
25 pages
ML Unit4
No ratings yet
ML Unit4
41 pages
Jntuk R20 ML Unit-Iii
100% (1)
Jntuk R20 ML Unit-Iii
21 pages
2.building Blocks of Neural Networks
100% (1)
2.building Blocks of Neural Networks
2 pages
Clustering (Unit 3)
100% (2)
Clustering (Unit 3)
71 pages
Data Preprocessing in Machine Learning
No ratings yet
Data Preprocessing in Machine Learning
27 pages
CS-605 Data - Analytics - Lab Complete Manual (2) - 1672730238
No ratings yet
CS-605 Data - Analytics - Lab Complete Manual (2) - 1672730238
56 pages
Deep Learning Laboratory
No ratings yet
Deep Learning Laboratory
69 pages
ML UNIT 2 Sir
No ratings yet
ML UNIT 2 Sir
46 pages
Tangent Prop and Manifold Tangent Classifier Are B
No ratings yet
Tangent Prop and Manifold Tangent Classifier Are B
4 pages
Unit I Notes Machine Learning Techniques 1
No ratings yet
Unit I Notes Machine Learning Techniques 1
21 pages
SCSA3015 Deep Learning Unit 2 PDF
No ratings yet
SCSA3015 Deep Learning Unit 2 PDF
32 pages
Feature Engineering Essentials
0% (1)
Feature Engineering Essentials
29 pages
Various Paradigms of Learning Problems
100% (1)
Various Paradigms of Learning Problems
14 pages
Machine Learning-2
No ratings yet
Machine Learning-2
16 pages
ML Unit 1
100% (1)
ML Unit 1
44 pages
Unit 3 Full Notes
No ratings yet
Unit 3 Full Notes
30 pages
Distance-Based Methods - KNN
0% (1)
Distance-Based Methods - KNN
8 pages
ML Unit 5
No ratings yet
ML Unit 5
20 pages
Searching Sorting Notes Handwritten
No ratings yet
Searching Sorting Notes Handwritten
29 pages
ML Experimentation Guide
No ratings yet
ML Experimentation Guide
11 pages
Lec-1 ML Intro
No ratings yet
Lec-1 ML Intro
15 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
34 pages
ML Unit-1
No ratings yet
ML Unit-1
34 pages
Modeling Process in Machine Learning - Google Search
No ratings yet
Modeling Process in Machine Learning - Google Search
2 pages
Day 5 Supervised Technique-Decision Tree For Classification PDF
100% (1)
Day 5 Supervised Technique-Decision Tree For Classification PDF
58 pages
Deep Learning Unit-II
No ratings yet
Deep Learning Unit-II
19 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
6 pages
Ccs355 Neural Networks and Deep Learning Unit1
No ratings yet
Ccs355 Neural Networks and Deep Learning Unit1
29 pages
LP I ML Viva Questions
100% (1)
LP I ML Viva Questions
9 pages
Unit - 3-NNDL - Notes
No ratings yet
Unit - 3-NNDL - Notes
17 pages
Machine Learning Basics Stanford Notes
No ratings yet
Machine Learning Basics Stanford Notes
15 pages
UNIT-2 ML Notes
No ratings yet
UNIT-2 ML Notes
15 pages
Machine Learning Unit 4
No ratings yet
Machine Learning Unit 4
28 pages
Deep Learning Optimization Guide
No ratings yet
Deep Learning Optimization Guide
30 pages
Unit-5 Alt
No ratings yet
Unit-5 Alt
15 pages
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
No ratings yet
Question Bank Module-1: Department of Computer Applications 18mca53 - Machine Learning
7 pages
Unit Ii
No ratings yet
Unit Ii
8 pages
ML Set 1 QB Question Paper
No ratings yet
ML Set 1 QB Question Paper
4 pages
Efficient Convolution Algorithms
No ratings yet
Efficient Convolution Algorithms
13 pages
Learning Processes
No ratings yet
Learning Processes
30 pages
Dimensionality Reduction & Models
No ratings yet
Dimensionality Reduction & Models
59 pages
1) Aim: Demonstration of Preprocessing of Dataset Student - Arff
No ratings yet
1) Aim: Demonstration of Preprocessing of Dataset Student - Arff
26 pages
Unit 5
No ratings yet
Unit 5
61 pages
Machine Learning-Unit-V-Notes
No ratings yet
Machine Learning-Unit-V-Notes
23 pages
Scikit - Notes ML
100% (2)
Scikit - Notes ML
12 pages
Machine Learning Basics & Techniques
No ratings yet
Machine Learning Basics & Techniques
13 pages
Classification Algorithm Guide
100% (2)
Classification Algorithm Guide
23 pages
R PPT 30
No ratings yet
R PPT 30
45 pages
ML UNIT-2 Notes
No ratings yet
ML UNIT-2 Notes
15 pages
Stats & ML Model Comparisons
100% (1)
Stats & ML Model Comparisons
72 pages
Feature Extraction & PCA Guide
No ratings yet
Feature Extraction & PCA Guide
10 pages
Prasoon Raj - 1709131099 - Report
100% (1)
Prasoon Raj - 1709131099 - Report
41 pages
Naive Bayes Implementation
No ratings yet
Naive Bayes Implementation
2 pages
Naive Bayes Theory
No ratings yet
Naive Bayes Theory
4 pages
SVM Tuning for Optimal Classification
No ratings yet
SVM Tuning for Optimal Classification
2 pages
Matplotlib Library Implementation
No ratings yet
Matplotlib Library Implementation
3 pages
Dimentionality Reduction Implementation
No ratings yet
Dimentionality Reduction Implementation
8 pages
Electronic Nose
No ratings yet
Electronic Nose
25 pages
Lectronic OSE: Presentation by
No ratings yet
Lectronic OSE: Presentation by
18 pages
Decision Tree Algorithm Guide
No ratings yet
Decision Tree Algorithm Guide
4 pages
Seminar Report
57% (7)
Seminar Report
31 pages
Large Scale Deep Learning
No ratings yet
Large Scale Deep Learning
170 pages
Autoencoder Assignment PDF
No ratings yet
Autoencoder Assignment PDF
5 pages
Rodin: A Generative Model For Sculpting 3D Digital Avatars Using Diffusion
No ratings yet
Rodin: A Generative Model For Sculpting 3D Digital Avatars Using Diffusion
19 pages
VAE, Domain Adaptation
No ratings yet
VAE, Domain Adaptation
15 pages
Electricity Theft Detection Techniques Using Artificial Intelligence A Survey
No ratings yet
Electricity Theft Detection Techniques Using Artificial Intelligence A Survey
6 pages
Deep Learning for Stock Investment
No ratings yet
Deep Learning for Stock Investment
10 pages
28 - Intelligent Electronic Monitoring Supervision System Based On Multi-Label Classification
No ratings yet
28 - Intelligent Electronic Monitoring Supervision System Based On Multi-Label Classification
3 pages
Neural Networks & Deep Learning Basics
No ratings yet
Neural Networks & Deep Learning Basics
64 pages
Chen and Liu - 2018 - Broad Learning System An Effective and Efficient Incremental Learning System Without The Need For D
No ratings yet
Chen and Liu - 2018 - Broad Learning System An Effective and Efficient Incremental Learning System Without The Need For D
15 pages
Table of Content: Recurrent Autoencoder Ensembles For Brake Operating Unit Anomaly Detection On Metro Vehicles
No ratings yet
Table of Content: Recurrent Autoencoder Ensembles For Brake Operating Unit Anomaly Detection On Metro Vehicles
4 pages
Sorghum Disease Detection with AI
No ratings yet
Sorghum Disease Detection with AI
29 pages
Applsci 13 06711 v2
No ratings yet
Applsci 13 06711 v2
17 pages
Exploring The Use of Different Feature Levels of CNN For Anomaly Detection
No ratings yet
Exploring The Use of Different Feature Levels of CNN For Anomaly Detection
5 pages
Cyber Security Meets Artificial in
No ratings yet
Cyber Security Meets Artificial in
13 pages
1 PB
No ratings yet
1 PB
8 pages
RAI Exam Questions - Risk and AI (RAI)
No ratings yet
RAI Exam Questions - Risk and AI (RAI)
22 pages
1 - Modeling Relational Data With Graph Convolutional Networks
No ratings yet
1 - Modeling Relational Data With Graph Convolutional Networks
9 pages
An Introduction To Machine Learning Communications
No ratings yet
An Introduction To Machine Learning Communications
11 pages
Task-Oriented Edge Inference via Information Bottleneck
No ratings yet
Task-Oriented Edge Inference via Information Bottleneck
14 pages
Handwritten Notes Recognition Using Artificial Intelligence
No ratings yet
Handwritten Notes Recognition Using Artificial Intelligence
4 pages
Basic Data Science Interview Questions
No ratings yet
Basic Data Science Interview Questions
18 pages
Latent Diffusion Model for Reservoir Facies
No ratings yet
Latent Diffusion Model for Reservoir Facies
20 pages
AI Video Creation for Everyone
No ratings yet
AI Video Creation for Everyone
8 pages
Convolutional Autoencoder in Pytorch On MNIST Dataset - by Eugenia Anello - DataSeries - Medium
No ratings yet
Convolutional Autoencoder in Pytorch On MNIST Dataset - by Eugenia Anello - DataSeries - Medium
18 pages
Syed Imran Ahmed
No ratings yet
Syed Imran Ahmed
20 pages
DE-GAN: Enhancing Degraded Documents
No ratings yet
DE-GAN: Enhancing Degraded Documents
12 pages
Mehta Et Al. - 2019 - A High-Bias, Low-Variance Introduction To Machine PDF
No ratings yet
Mehta Et Al. - 2019 - A High-Bias, Low-Variance Introduction To Machine PDF
116 pages
Deepfake & Face2Face Detection Network
No ratings yet
Deepfake & Face2Face Detection Network
7 pages
Machine Translation, Auto Encoders and Decoders
No ratings yet
Machine Translation, Auto Encoders and Decoders
29 pages
GEN AI Lab 1.ipynb Colab
No ratings yet
GEN AI Lab 1.ipynb Colab
12 pages

Dimensionality Reduction

Uploaded by

Dimensionality Reduction

Uploaded by

Dimensionality Reduction

In machine learning, “dimensionality” simply refers to the number of features

Feature Selection: - Feature selection is for filtering irrelevant or redundant features

Honorable Mention: Stepwise Search

Stepwise search is a supervised feature selection method based on sequential

Feature Extraction: - Feature extraction is for creating a new, smaller set of

Principal Component Analysis (PCA)

Principal component analysis (PCA) is an unsupervised algorithm that creates

Therefore, you can reduce dimensionality by limiting the number of principal

Linear Discriminant Analysis (LDA)

Linear discriminant analysis (LDA) - not to be confused with latent Dirichlet

The LDA transformation is also dependent on scale, so you should normalize

No Free Lunch Theorem: -

You might also like