Problem.:, I, I, I, I, I, I

The document outlines the process for performing Principal Component Analysis (PCA) using both eigendecomposition of the correlation matrix and singular value decomposition (SVD) on a numeric matrix without missing values. It details the input requirements, output expectations, and algorithm steps for both methods, including data normalization, matrix computation, and checks for validity. Additionally, it emphasizes the importance of testing the implementation against built-in functions and ensuring correctness through various checks.

Uploaded by

Shaikh Firdous

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

76 views2 pages

Problem.:, I, I, I, I, I, I

Uploaded by

Shaikh Firdous

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Problem. Write from-scratch code to perform principal component analysis on given data.

Use eigendecomposition of the correlation matrix for this purpose.

Input. X: n × p numeric matrix (rows: cases/samples, columns: variables/factors); without

any missing values.

Output. Suppose k = min(n, p).

1. Loadings/rotations: p × k matrix.

2. Principal components/scores: n × k matrix.

3. Standard deviations: k-vector.

Checks on input arguments. Valid values in the input arguments, no missing values, etc.
Treat end-cases such as n < 2 and p < 2 separately.

Algorithm.

1. Shift each column of X by its own mean; i.e., Y·,i = X·,i − Mean(X·,i ) for each column
i = 1, . . . , p. Scale each column of Y by its own standard error; i.e., Y·,i = Y·,i /SE(Y·,i ) for
each column i = 1, . . . , p.

2. Compute the p × p correlation matrix C = Y T Y /(n − 1). C should be symmetric, all 1s

on the diagonal, and all other elements between −1 and +1.

3. Compute the eigendecomposition of C using the in-built function eigen(). This gives a
p-vector of eigenvalues d and a p × p matrix V with eigenvectors as its columns. Formally,
the eigendecomposition is C = V T · diag(d) · V .

4. (a) Check if the eigenvalues d > 0. If not, take an appropriate course of action.
(b) Check if the eigenvalues d are in descending order. If not, then reorder d in descending
order. Reorder the columns of V to match the changed order.

5. Compute the output quantities:

(a) Rotation/loading matrix R is the matrix of first k columns of V .

(b) Scores/principal component matrix (with PCs as columns) is Y R.
(c) Standard deviations of the PCs are the square roots of first k eigenvalues in d.

Testing.

1. At each step of the algorithm, put appropriate checks that reflect the assumptions made
about the computed quantities.
2. Compare the results of your implementation with the output of the in-built function
princomp() applied to a standard data set such as USArrests or iris without the species
column. The simplest artificial test data set would be a 2-variable (X1 , X2 ) data set where
X2 = mX1 + c + ϵ where the noise ϵ is normal with mean 0 and standard deviation σ > 0.
3. Demonstrate that your code produces correct results.

1
Problem. Write from-scratch code to perform principal component analysis on given data.
Use singular value decomposition of the data matrix for this purpose.

Input. X: n × p numeric matrix (rows: cases/samples, columns: variables/factors); without

any missing values.

Output. Suppose k = min(n, p).

1. Loadings/rotations: p × k matrix.

2. Principal components/scores: n × k matrix.

3. Standard deviations: k-vector.

Checks on input arguments. Valid values in the input arguments, no missing values, etc.
Treat end-cases such as n < 2 and p < 2 separately.

Algorithm.

2. Compute the singular value decomposition of Y using the in-built function svd(). SVD
gives a k-vector of singular values d, a n × k matrix U (left singular vectors as columns),
and a p × k matrix V (right singular vectors as columns). Formally, the singular value
decomposition is Y = U · diag(d) · V T .

3. (a) Check if the singular values d > 0. If not, take an appropriate course of action.
(b) Check if the singular values d are in descending order. If not, then reorder d in
descending order. Reorder the columns of U and V to match the changed order.

4. Compute the output quantities:

(a) Rotation/loading matrix R is the matrix V .

(b) Scores/principal component matrix (with PCs as columns) is Y R.
√
(c) Standard deviations of the PCs are d/ n.

Testing.

MLSP Exp02
No ratings yet
MLSP Exp02
10 pages
Principal Component Analysis Steps
No ratings yet
Principal Component Analysis Steps
14 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Exercise 1 Instruction Pca
No ratings yet
Exercise 1 Instruction Pca
9 pages
L08 PrincipalComponentAnalysis
No ratings yet
L08 PrincipalComponentAnalysis
36 pages
Mathematical Approach To PCA
No ratings yet
Mathematical Approach To PCA
8 pages
5 Pca
No ratings yet
5 Pca
14 pages
PCA: Step-by-Step Guide to Dimensionality Reduction
No ratings yet
PCA: Step-by-Step Guide to Dimensionality Reduction
13 pages
Principal Component Analysis (PCA) Final
No ratings yet
Principal Component Analysis (PCA) Final
37 pages
MV - Principal Components Using SAS
No ratings yet
MV - Principal Components Using SAS
69 pages
MLSP Exp2
No ratings yet
MLSP Exp2
7 pages
Unit 3
No ratings yet
Unit 3
28 pages
Exp 15
No ratings yet
Exp 15
12 pages
Aim: Theory: Experiment 3
No ratings yet
Aim: Theory: Experiment 3
3 pages
Principal Components Analysis (PCA) : 2.1 Outline of Technique
No ratings yet
Principal Components Analysis (PCA) : 2.1 Outline of Technique
21 pages
PCA Steps - Numerical Problem
No ratings yet
PCA Steps - Numerical Problem
8 pages
Lecture FPCA
No ratings yet
Lecture FPCA
67 pages
Principal Component Analysis - A Numerical Approach
No ratings yet
Principal Component Analysis - A Numerical Approach
8 pages
Steps For PCA
No ratings yet
Steps For PCA
5 pages
1-Python Algebra Maths
No ratings yet
1-Python Algebra Maths
26 pages
ACPusing R
No ratings yet
ACPusing R
25 pages
Pca
No ratings yet
Pca
16 pages
Dimensionality Reduction Using PCA (Principal Component Analysis)
No ratings yet
Dimensionality Reduction Using PCA (Principal Component Analysis)
13 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
5 Pca
No ratings yet
5 Pca
33 pages
Chapter2 PCA
No ratings yet
Chapter2 PCA
65 pages
AMC3 (1) - Merged
No ratings yet
AMC3 (1) - Merged
11 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
PCA Practice Problems & Solutions
No ratings yet
PCA Practice Problems & Solutions
6 pages
PCA Guide and R Implementation
No ratings yet
PCA Guide and R Implementation
11 pages
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
No ratings yet
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
31 pages
Week 9 Lecture - Revision Test-Dual-Translated
No ratings yet
Week 9 Lecture - Revision Test-Dual-Translated
92 pages
Factor Analysis
No ratings yet
Factor Analysis
57 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
79 pages
AML Non Evaluative Assignment 2
No ratings yet
AML Non Evaluative Assignment 2
2 pages
09 Pca
No ratings yet
09 Pca
22 pages
The Mathematics Behind Principal Component Analysis
No ratings yet
The Mathematics Behind Principal Component Analysis
9 pages
PCA Principle Component Analysis
No ratings yet
PCA Principle Component Analysis
10 pages
AMC3
No ratings yet
AMC3
8 pages
DR Pca
No ratings yet
DR Pca
22 pages
09 Pca
No ratings yet
09 Pca
19 pages
09 Pca
No ratings yet
09 Pca
19 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
9 pages
Education - Post 12th Standard - CSV
88% (16)
Education - Post 12th Standard - CSV
11 pages
PCA With An Example
No ratings yet
PCA With An Example
7 pages
Principle Component Analysis
No ratings yet
Principle Component Analysis
7 pages
Importing Libraries Used in This Chapter
No ratings yet
Importing Libraries Used in This Chapter
8 pages
PCA for Biologists and Researchers
No ratings yet
PCA for Biologists and Researchers
29 pages
PCA for Data Science Students
No ratings yet
PCA for Data Science Students
30 pages
PCA Concepts and Techniques
No ratings yet
PCA Concepts and Techniques
16 pages
Steps Involved in The PCA: Dataset Matrix
No ratings yet
Steps Involved in The PCA: Dataset Matrix
4 pages
6 Principal Component Analysis
No ratings yet
6 Principal Component Analysis
7 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
Presentation A I STD 2
No ratings yet
Presentation A I STD 2
63 pages
Principal Component Analysis: Random Vector
No ratings yet
Principal Component Analysis: Random Vector
20 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
Find Your Audience and Understand Your Customers
No ratings yet
Find Your Audience and Understand Your Customers
2 pages
Define Your Marketing Goals
No ratings yet
Define Your Marketing Goals
1 page
How To Set SMART Goals
No ratings yet
How To Set SMART Goals
1 page
The Traditional Marketing Funnel To DM Funnel
No ratings yet
The Traditional Marketing Funnel To DM Funnel
5 pages
Elements of A Digital Mareting Strategy
No ratings yet
Elements of A Digital Mareting Strategy
1 page
Glossary
No ratings yet
Glossary
3 pages
The Top of Funnel
No ratings yet
The Top of Funnel
1 page
Measuring Success at The Top of The Funnel
No ratings yet
Measuring Success at The Top of The Funnel
1 page
E Commerce
No ratings yet
E Commerce
2 pages
What Do Digital Marketing and E-Commerce Specialist Do
No ratings yet
What Do Digital Marketing and E-Commerce Specialist Do
2 pages
Agency Roles vs. In-House Roles
No ratings yet
Agency Roles vs. In-House Roles
1 page
Another Dummy Bank Statement
No ratings yet
Another Dummy Bank Statement
1 page
Compiler Construction
100% (1)
Compiler Construction
305 pages
Astrology Insights for Beginners
No ratings yet
Astrology Insights for Beginners
13 pages
Financial Stability Report - NOV2024
No ratings yet
Financial Stability Report - NOV2024
73 pages
Coaching and Mentoring Handbook and List
No ratings yet
Coaching and Mentoring Handbook and List
7 pages
United States v. Lopez-Carreon, 10th Cir. (2000)
No ratings yet
United States v. Lopez-Carreon, 10th Cir. (2000)
4 pages
Conscious Oracle Card Booklet
No ratings yet
Conscious Oracle Card Booklet
44 pages
İlk Gün Formu Son Hali
No ratings yet
İlk Gün Formu Son Hali
2 pages
Eimeria 1
No ratings yet
Eimeria 1
11 pages
National Senior Certificate: Grade 12
No ratings yet
National Senior Certificate: Grade 12
19 pages
Under - The - Moon AK
No ratings yet
Under - The - Moon AK
10 pages
2022年港澳杯初赛 P1
No ratings yet
2022年港澳杯初赛 P1
4 pages
Laboratory #4: Control Charts For Variable Data (X-Bar and R) Purpose: Materials
No ratings yet
Laboratory #4: Control Charts For Variable Data (X-Bar and R) Purpose: Materials
7 pages
Mini M-70 Series Industrial Spray Nozzle: Specifications
No ratings yet
Mini M-70 Series Industrial Spray Nozzle: Specifications
2 pages
CV - Ilaha Asadova
No ratings yet
CV - Ilaha Asadova
1 page
Holcim Compensation Report 2023
No ratings yet
Holcim Compensation Report 2023
32 pages
Ethics Presentation 1
No ratings yet
Ethics Presentation 1
16 pages
CAE - Multiple Choice Vocabulary
No ratings yet
CAE - Multiple Choice Vocabulary
5 pages
MID 039 - CID 0171 - FMI 04: Troubleshooting
No ratings yet
MID 039 - CID 0171 - FMI 04: Troubleshooting
4 pages
Cookery Week 6-Day2
100% (2)
Cookery Week 6-Day2
3 pages
The Art of Immutable Architecture Theory and Practice of Data Management in Distributed Systems 2nd Edition Michael L Perry Download
No ratings yet
The Art of Immutable Architecture Theory and Practice of Data Management in Distributed Systems 2nd Edition Michael L Perry Download
32 pages
RD Sharma Class 11 Maths Solutions
No ratings yet
RD Sharma Class 11 Maths Solutions
20 pages
Time Study Method Implementation in Manufacturing Industry Nor Diana Hashim TS183.N67 2008 - 24 Pages
No ratings yet
Time Study Method Implementation in Manufacturing Industry Nor Diana Hashim TS183.N67 2008 - 24 Pages
24 pages
Instalación, Operación y Mantenimiento Polipasto LX1 LX3
No ratings yet
Instalación, Operación y Mantenimiento Polipasto LX1 LX3
68 pages
FM - MCQ - Part 2
No ratings yet
FM - MCQ - Part 2
7 pages
Aquaculture's Role in the Philippines
No ratings yet
Aquaculture's Role in the Philippines
50 pages
HSN Code
No ratings yet
HSN Code
6 pages
Highwoods Primary School Overview
No ratings yet
Highwoods Primary School Overview
18 pages
Distributor Agreement Terms
No ratings yet
Distributor Agreement Terms
4 pages
Volvo 47705930 - Us
71% (7)
Volvo 47705930 - Us
234 pages
Coding & Career Tips
No ratings yet
Coding & Career Tips
15 pages
Grease Programme
No ratings yet
Grease Programme
2 pages