0% found this document useful (0 votes)

9 views4 pages

Aiml Exp 5 Viva

The document discusses feature selection and Principal Component Analysis (PCA), emphasizing the goal of dimensionality reduction to simplify analysis while retaining relevant information. It outlines various methods and concepts related to PCA, including advantages, limitations, and comparisons with other techniques like LDA and t-SNE. Additionally, it covers advanced topics such as hybrid methods, incremental PCA, and the integration of domain knowledge into feature selection.

Uploaded by

ngore971

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views4 pages

Aiml Exp 5 Viva

Uploaded by

ngore971

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

SPPU Mechanical Engineering

Created by @vaibhavpandit_tele

4.
5 Feature Selection and PCA
Basic Fundamentals

1. What is the goal of dimensionality reduction?

To reduce the number of variables in a dataset while preserving as much relevant information as possible,
simplifying analysis and improving model performance.

2. List three advantages of PCA.

 Removes correlated features.

 Enhances algorithm performance by reducing dimensionality.

 Enables better visualization of high-dimensional data.

3. How does a low-variance filter work?

It removes features with variance below a set threshold, assuming low-variance features carry little
information.

4. What is the interpretation of principal components?

Principal components are new uncorrelated variables formed as linear combinations of original features
that capture maximum variance in the data.

5. How does correlation-based feature selection work?

It selects features highly correlated with the target variable but uncorrelated with each other to reduce
redundancy and improve predictive power.

6. Why is PCA sensitive to feature scaling?

Because PCA relies on variance, features with larger scales dominate the principal components unless
data is standardized.

7. What is the scree plot used for in PCA?

To visualize the eigenvalues of principal components and help decide how many components to retain.

8. Define "eigenvalue" in the context of PCA.

An eigenvalue represents the amount of variance captured by its corresponding principal component.
9. How does LDA differ from PCA?
LDA is supervised and maximizes class separability, while PCA is unsupervised and maximizes variance
without considering class labels.

10. What is the Kaiser criterion for component selection?

Retain principal components with eigenvalues greater than 1, as they explain more variance than an
individual original variable.

Medium Level

11. How would you determine the optimal number of principal components?
By analyzing the scree plot, cumulative explained variance (e.g., 90%), or using cross-validation to balance
dimensionality and accuracy.

12. Compare forward selection and backward elimination for feature selection.

 Forward selection starts with no features and adds them iteratively based on improvement.

 Backward elimination starts with all features and removes the least significant iteratively.

13. Explain how to apply PCA for image compression.

Flatten images, apply PCA to reduce dimensionality, store only top components, and reconstruct images
from these components to save space.

14. What are the limitations of PCA for non-linear data?

PCA cannot capture non-linear relationships as it is a linear method, leading to poor representation of
complex data structures.

15. How does multicollinearity impact feature selection?

Highly correlated features can cause redundancy and instability in models, making feature selection
necessary to remove them.

16. Propose a method to validate selected features using cross-validation.

Split data into folds, train models on selected features in training folds, and evaluate performance on
validation folds to ensure generalization.

17. How can PCA be used for noise reduction?

By discarding components with low eigenvalues that mostly capture noise, reconstructing data with
principal components representing signal.
18. What are the assumptions of ICA compared to PCA?
ICA assumes statistical independence of source signals and non-Gaussianity, unlike PCA which assumes
orthogonality and maximizes variance.

19. How does t-SNE address PCA's limitations for visualization?

t-SNE captures non-linear relationships and preserves local structure, providing better visualization of
complex data clusters.

20. Explain the role of singular value decomposition (SVD) in PCA.

SVD decomposes the data matrix into singular vectors and values, facilitating efficient computation of
principal components.

Hard Level

21. Derive the relationship between covariance matrix and PCA components.
PCA components are eigenvectors of the covariance matrix; eigenvectors define directions of maximum
variance, eigenvalues quantify variance along those directions.

22. Design a hybrid dimensionality reduction method combining PCA and t-SNE.
First apply PCA to reduce dimensionality and noise, then apply t-SNE on PCA output for detailed non-
linear visualization.

23. How would you apply PCA to streaming data with concept drift?
Use incremental PCA algorithms that update components as new data arrives, adapting to changes in data
distribution over time.

24. Critique the interpretability challenges of PCA-transformed features.

Principal components are linear combinations of original features, making it difficult to attribute specific
meanings to them.

25. Propose a method to integrate domain knowledge into automated feature selection.
Incorporate expert-defined feature importance as priors or constraints within feature selection
algorithms.

26. Analyze the failure modes of PCA for categorical data.

PCA assumes numeric continuous data and linear relationships; it fails to handle categorical variables
properly without encoding.
27. How can reinforcement learning optimize feature selection pipelines?
Model feature selection as sequential decisions by an RL agent, rewarding selections that improve model
performance.

28. Evaluate the computational complexity of incremental PCA.

Incremental PCA updates components with new data batches, reducing complexity compared to batch
PCA but still dependent on data size and component count.

29. Design a metric to quantify information loss in dimensionality reduction.

Use reconstruction error or the proportion of variance not explained by retained components.

30. How does kernel PCA extend traditional PCA for non-linear data?
Kernel PCA applies PCA in a high-dimensional feature space via kernel functions, capturing non-linear
structures in the original data.

This completes your clean Q&A set on Feature Selection and PCA for effective viva preparation.

ML Module 6
No ratings yet
ML Module 6
6 pages
Pca
No ratings yet
Pca
19 pages
Pca 2
No ratings yet
Pca 2
3 pages
Lesson 7-Feature Selection and Principal Component Analysis
No ratings yet
Lesson 7-Feature Selection and Principal Component Analysis
24 pages
PCA in Machine Learning Explained
No ratings yet
PCA in Machine Learning Explained
33 pages
Love Report 1
No ratings yet
Love Report 1
10 pages
Pca PDF
No ratings yet
Pca PDF
6 pages
Machine Learning Lab Assignment 21: MCKV Institute of Engineering
No ratings yet
Machine Learning Lab Assignment 21: MCKV Institute of Engineering
3 pages
Dimensionality Reduction Technique
No ratings yet
Dimensionality Reduction Technique
17 pages
ML Mod 4 & 6 Pyq
No ratings yet
ML Mod 4 & 6 Pyq
11 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
Question and Answer PCA
No ratings yet
Question and Answer PCA
4 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
IRJMETS443407
No ratings yet
IRJMETS443407
7 pages
Dimensionality Reduction Guide
No ratings yet
Dimensionality Reduction Guide
79 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
33 pages
Love Report
No ratings yet
Love Report
7 pages
The Intuition Behind PCA: Machine Learning Assignment
No ratings yet
The Intuition Behind PCA: Machine Learning Assignment
11 pages
PCA Guide for B.Tech Students
No ratings yet
PCA Guide for B.Tech Students
10 pages
ML Mod-4
No ratings yet
ML Mod-4
11 pages
Pca 1
No ratings yet
Pca 1
3 pages
Feature Extraction Techniques
No ratings yet
Feature Extraction Techniques
32 pages
Machine Learning (CSO851) - Lecture 03
No ratings yet
Machine Learning (CSO851) - Lecture 03
71 pages
CHAPTER 6 Dimensionality Reduction
No ratings yet
CHAPTER 6 Dimensionality Reduction
20 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
AI Unsupervised Learning Guide
No ratings yet
AI Unsupervised Learning Guide
44 pages
CHBE413CDS Lecture 12 Unsupervised DimRed
No ratings yet
CHBE413CDS Lecture 12 Unsupervised DimRed
30 pages
Day 8
No ratings yet
Day 8
25 pages
Principal Component Analysis Limitations and How To Overcome Them Let's Talk A
No ratings yet
Principal Component Analysis Limitations and How To Overcome Them Let's Talk A
5 pages
ML Questions Answer Q1
No ratings yet
ML Questions Answer Q1
79 pages
Dimensionality Reduction, PCA, and Kernel Methods
No ratings yet
Dimensionality Reduction, PCA, and Kernel Methods
3 pages
Aiml Exp 3 Viva
No ratings yet
Aiml Exp 3 Viva
4 pages
PCA Dev
No ratings yet
PCA Dev
16 pages
Dimensionality Reduction Quiz
No ratings yet
Dimensionality Reduction Quiz
7 pages
Forward Selection Component Analysis Algorithms and Applications
No ratings yet
Forward Selection Component Analysis Algorithms and Applications
16 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
11 pages
Data Pre-Processing-IV (Feature Extraction-PCA)
No ratings yet
Data Pre-Processing-IV (Feature Extraction-PCA)
23 pages
20 Pca
No ratings yet
20 Pca
50 pages
UNIT-4 Machine Learning
No ratings yet
UNIT-4 Machine Learning
20 pages
Lec 15
No ratings yet
Lec 15
28 pages
PCA for Students and Educators
No ratings yet
PCA for Students and Educators
12 pages
CS ML Unit 2
No ratings yet
CS ML Unit 2
24 pages
PCALDAICA
No ratings yet
PCALDAICA
28 pages
Data Science Quiz: PCA, t-SNE, and More
No ratings yet
Data Science Quiz: PCA, t-SNE, and More
8 pages
Module 3
No ratings yet
Module 3
41 pages
Big Data Analysis Assig.2
100% (1)
Big Data Analysis Assig.2
5 pages
5 Data Pre Processing III
No ratings yet
5 Data Pre Processing III
30 pages
PCA for Data Scientists
No ratings yet
PCA for Data Scientists
20 pages
Presentation 1
No ratings yet
Presentation 1
15 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
Dimensionality Reduction Explained
No ratings yet
Dimensionality Reduction Explained
60 pages
Pages 141-210
No ratings yet
Pages 141-210
70 pages
Pca Lda Lobo
No ratings yet
Pca Lda Lobo
20 pages
Module3 OTML
No ratings yet
Module3 OTML
67 pages
Dimension Reduction
No ratings yet
Dimension Reduction
23 pages
Pca - Principal Component Analysis 1233
No ratings yet
Pca - Principal Component Analysis 1233
30 pages
Ai & ML Week-9
No ratings yet
Ai & ML Week-9
30 pages
3 - Feature Extraction
No ratings yet
3 - Feature Extraction
22 pages
2nd Special Paper Selection
No ratings yet
2nd Special Paper Selection
1 page
EEO 401 Note Set 9
No ratings yet
EEO 401 Note Set 9
12 pages
Fuzzification
No ratings yet
Fuzzification
6 pages
DSP Questions
No ratings yet
DSP Questions
8 pages
Sample Final Summer 2016 (Updated)
No ratings yet
Sample Final Summer 2016 (Updated)
13 pages
TEITA170 Saral Mane Seminar Report
No ratings yet
TEITA170 Saral Mane Seminar Report
23 pages
3D Reconstruction From A Single Sketch Via View-Dependent Depth Sampling
No ratings yet
3D Reconstruction From A Single Sketch Via View-Dependent Depth Sampling
16 pages
Delay Estimation at Signalised Intersection
No ratings yet
Delay Estimation at Signalised Intersection
8 pages
Algorithmic Thinking With Python
No ratings yet
Algorithmic Thinking With Python
2 pages
اشارات و نظم
No ratings yet
اشارات و نظم
39 pages
D-1 Iup Itb
No ratings yet
D-1 Iup Itb
3 pages
Digital Signal Sampling Lab
No ratings yet
Digital Signal Sampling Lab
5 pages
Lattice Boltzmann Simulations With Moving Objects: Stefan Werner, Maximilian Walther, Jannis Greifenstein
No ratings yet
Lattice Boltzmann Simulations With Moving Objects: Stefan Werner, Maximilian Walther, Jannis Greifenstein
53 pages
Findings of The Papers of XAI
No ratings yet
Findings of The Papers of XAI
12 pages
Greedy
No ratings yet
Greedy
36 pages
Network Flow Decomposition Guide
No ratings yet
Network Flow Decomposition Guide
26 pages
Week 2
No ratings yet
Week 2
8 pages
Day 1
No ratings yet
Day 1
36 pages
Model Order Reduction Techniques For Reducing Order of Industrial PR
No ratings yet
Model Order Reduction Techniques For Reducing Order of Industrial PR
5 pages
Daftar Pustaka
100% (1)
Daftar Pustaka
3 pages
Discrete Events Randomistics Guide
No ratings yet
Discrete Events Randomistics Guide
25 pages
DAA Question Bank
No ratings yet
DAA Question Bank
5 pages
SHARP Calculator Guide for DSC1630
No ratings yet
SHARP Calculator Guide for DSC1630
41 pages
Lesson 3 Transportation Problem
No ratings yet
Lesson 3 Transportation Problem
41 pages
Mth744u Exam 2013
No ratings yet
Mth744u Exam 2013
3 pages
Summative Stat Prob Q3 W1 26
No ratings yet
Summative Stat Prob Q3 W1 26
5 pages
Random Variable and Linear Algebra (Cycle Test 1)
No ratings yet
Random Variable and Linear Algebra (Cycle Test 1)
2 pages
Manual 2 For Datamine Studio RM
100% (1)
Manual 2 For Datamine Studio RM
14 pages
Mongodb-Aggregation-And-Indexing Group-B-Assignment-15batch-1
No ratings yet
Mongodb-Aggregation-And-Indexing Group-B-Assignment-15batch-1
8 pages
Reduced Order Modelling For Flow Control (Edited by BERND R. NOACK, MAREK MORZYNSKI, GILEAD TADMOR)
No ratings yet
Reduced Order Modelling For Flow Control (Edited by BERND R. NOACK, MAREK MORZYNSKI, GILEAD TADMOR)
340 pages

Aiml Exp 5 Viva

Uploaded by

Aiml Exp 5 Viva

Uploaded by

SPPU Mechanical Engineering

1. What is the goal of dimensionality reduction?

2. List three advantages of PCA.

 Removes correlated features.

 Enhances algorithm performance by reducing dimensionality.

 Enables better visualization of high-dimensional data.

3. How does a low-variance filter work?

4. What is the interpretation of principal components?

5. How does correlation-based feature selection work?

6. Why is PCA sensitive to feature scaling?

7. What is the scree plot used for in PCA?

8. Define "eigenvalue" in the context of PCA.

10. What is the Kaiser criterion for component selection?

13. Explain how to apply PCA for image compression.

14. What are the limitations of PCA for non-linear data?

15. How does multicollinearity impact feature selection?

16. Propose a method to validate selected features using cross-validation.

17. How can PCA be used for noise reduction?

19. How does t-SNE address PCA's limitations for visualization?

20. Explain the role of singular value decomposition (SVD) in PCA.

24. Critique the interpretability challenges of PCA-transformed features.

26. Analyze the failure modes of PCA for categorical data.

28. Evaluate the computational complexity of incremental PCA.

29. Design a metric to quantify information loss in dimensionality reduction.

You might also like