0% found this document useful (0 votes)

36 views73 pages

Visualization 9 Dim Reduction

The document discusses dimensionality reduction (DR) techniques in data visualization, highlighting the importance of reducing the complexity of high-dimensional data for better interpretability and computational efficiency. It covers various methods such as Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA), and manifold learning, along with their applications in fields like multimedia, bioinformatics, and finance. The goal of DR is to extract relevant information from data while simplifying the representation for analysis and visualization.

Uploaded by

Richard Heisenberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

36 views73 pages

Visualization 9 Dim Reduction

Uploaded by

Richard Heisenberg

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 73

EECE 5642

Data Visualization

Dimensionality Reduction

Y. Raymond Fu
Professor
Electrical and Computer Engineering (ECE), COE
Khoury College of Computer Science (KCCS)
Northeastern University
Attribute Dimensions and Orders

• Dimensions
– 1D: scalar
– 2D: two-dimensional vector
– 3D: three-dimensional vector
– >3D: multi-dimensional vector
• Orders
– scalars
– vectors
– matrix 1-st order 2-nd order
– tensors (high-order)

vector matrix

2
Data Table

www.many-eyes.com
3 Courtesy of Prof. Hanspeter Pfister, Harvard University.
Univariate Data Representations
Matlab Box Plot

Courtesy of Prof. Hanspeter Pfister, Harvard University.

4 Original figures were from the slides of Stasko
Bivariate Data Representations

Courtesy of Prof. Hanspeter Pfister, Harvard University.

5 Original figures were from the slides of Stasko
Trivariate Data Representations

Courtesy of Prof. Hanspeter Pfister, Harvard University.

6 Original figures were from the slides of Stasko
Multi-Dimensional Data

Courtesy of Prof. Hanspeter Pfister, Harvard University.

7 Original figures were from the slides of Stasko
Multi-Dimensional Data Visualization

8 https://www.youtube.com/watch?v=wvsE8jm1GzE
What if the dimension of the data is 4, 5, 6, and
even more?

A world of high-dimensional measurements!

Dimensionality Reduction (DR)

9
High Dimensional Data
• Multimedia
– High-resolution images
– High-resolution videos
– Data from multiple sensors
• Bioinformatics
– Expressions of genes
– Neurons
• Social networks
– Tweets/likes/friendships
– Other interactions
• Weather and climate
– Multiple measurements (e.g., temperature)
– Time series data
• Finance
– Stock markets
– Time series data

10
Motivation and Goal of DR
• Reduce the degree of freedom in measurements
 Replace a large set of measured variables with a small set of more
“condensed” variables
 Simpler models are more robust on small datasets
• Reduce the computational load
 By reducing the dimensionality of data, the computational burden
(time and space) could be greatly decreased.
• Visualization
 “Looking at the data”—more interpretable; simpler explanations
 Make sense of the data before processing

Goal
• Extract information hidden in the data
 Detect variables relevant for a specific task and how variables interact
with each other  Reformulate data with less variables.

11 Samuel Kaski, Jaakko Peltonen: Dimensionality Reduction for Data Visualization [Applications Corner]. IEEE Signal Process. Mag. 28(2): 100-104 (2011)
Motivation and Goal of DR

This is easier to interpret … … than this

12 Courtesy of Prof. Jaakko Peltonen, Aalto University.

Feature Selection vs. Feature Extraction
Given a data set X consisting of n samples, and the dimension of
each sample is d.
• Feature Selection
 Choose k important features (k < d), ignoring the remaining d – k.
 Example: microarray data analysis

• Feature Extraction
 Transform the original data set X from the d-dimensional space to a
k-dimensional space (k < d).
 A general problem: 𝑌 = 𝑃𝑇 𝑋, where 𝑋 ∈ 𝑹𝑑 , 𝑌 ∈ 𝑹𝑘 .

13
Statistics & Linear Algebra Background
• Given a set of n-point data {Xk} in Rd
– The mean is E{x}

– The variance is Var{x}

– The co-variance between two data set is

14 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Eigenvectors
• For transform Y=AX, if exists

• e=[e1, e2, e3]T is an eigenvector, λ is the eigenvalue associated

with this eigenvector.
• For transform A, e is just a scaling function.
• Example

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

15 http://en.wikipedia.org/wiki/Eigenvalue,_eigenvector_and_eigenspace
Dimensionality Reduction Methods
• Linear Methods
– Principal Component Analysis (PCA), M.A. Turk & A.P. Pentland
– Multidimensional Scaling (MDS), T.F. Cox and M.A.A. Cox
– Locality Preserving Projections (LPP), X.F. He, S.C. Yan, Y.X. Hu
– Locality Persuit Embedding (LPE), W.L. Min, K. Lu, and X.F. He.
– Locally Embedded Analysis (LEA), Y. Fu and T.S. Huang
• Nonlinear Methods
– Locally Linear Embedding (LLE), S.T. Roweis & L.K. Saul
– Laplacian Eigenmaps, M. Belkin & P. Niyogi
– Isomap, J.B. Tenenbaum, V.de Silva, and J.C. Langford
– Hessian LLE, D.L. Donoho & C.E. Grimes
– Semidefinite Programming (SDE), K.Q. Weinberger & L.K. Saul
• Fisher Graph Methods
– Linear Discriminant Analysis (LDA), R.A. Fisher
– Marginal Fisher Analysis (MFA), S.C. Yan, et al.
– Local Discriminant Embedding (LDE), H.-T. Chen, et al.
– Discriminant Simplex Analysis (DSA), Y. Fu and T.S. Huang
– Correlation Embedding Analysis (CEA), Y. Fu and T.S. Huang

16
Parametric vs. Nonparametric Learning
• Parametric Model
– Use a parameterized family of probability distributions to describe the nature
of a set of data (Moghaddam & Pentland, 1997).
– The data distribution is empirically assumed or estimated.
– Learning is conducted by measuring a set of fixed parameters, such as mean
and variance.
– Effective for the large sample, but degrade for complicated data distribution.
• Nonparametric Model
– Distribution free.
– Learning is conducted by measuring the pair-wise data relationship in both
global and local manners.
– Effective and robust due to the reliance on fewer assumptions and parameters.
– Work for cases with small-sample, high-dimensionality, and complicated data
distribution.

17
Parametric Model
• Principal Component Analysis (PCA) and Linear Discriminant
Analysis (LDA)
• PCA is trying to captures the “principle” variations in the data
• It is computed by finding the Eigenvectors of the covariance
matrix of the data
• Geometrically, PCA finds the largest variations directions of
the underlying data
• Can be applied in data compression, pattern recognition, etc.
• Find a line going through the
data mean and along the max
variation direction of the data.
• Assuming zero mean, line is
represented as y=wTx, where
w is the basis, wTw=1.
18 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
Principal Component Analysis

19 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Principal Component Analysis

20 Courtesy of Prof. Jaakko Peltonen, Aalto University.

Principal Component Analysis

• OptDigits Dataset
 The data set contains 5620 instances of digitized handwritten digits in
range 0 ~ 9.
 Each digit is a 𝑹64 vector: 8 × 8 = 64 pixels.

21
Principal Component Analysis

22 Courtesy of Prof. Jaakko Peltonen, Aalto University.

Principal Component Analysis

Eigenvector Eigenfaced
23 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
Linear Discriminant Analysis

24 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Linear Discriminant Analysis

• Instead of PCA, it finds the discriminant subspace by including class label info
in subspace modeling (Supervised learning).
– Compute within class scatter
– Compute between class scatter
– Maximize between scatters and minimize within scatters
25 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
LDA Definition

26 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

LDA Two-Class Case

27 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

LDA Multiple-Class Case

28 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Different Subspace Base Vectors
• Different subspace base vectors show
different projective directions
• Subspace base vector w forms a Fisherface

29
PCA vs. LDA

Digits data after PCA Digits data after LDA

30 Courtesy of Prof. Jaakko Peltonen, Aalto University.

PCA vs. LDA

PCA LDA
31 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
PCA vs. LDA

PCA LDA
32 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.
PCA vs. LDA

• PCA performs
worse under
this condition
• LDA (FLD-Fisher
Linear Discrimi-
nant) provides
better low
dimensional
representation.

33 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

When LDA Fails

• LDA fails in the right figure( v1 is the projected

direction). Think about why

34 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Criteria of Nonparametric Model
Effective to model sample
distributions
Manifold Learning
Effective to classify
different classes
Fisher Graph
Effective to measure
Similarity sample distances
Metric
Effective to describe
intrinsic data structures
High-
Order
Data
Structure
35
Manifold
• “Manifold, is an abstract mathematical space in which every
point has a neighborhood which resembles Euclidean space,
but in which the global structure may be more complicated.” --
-from Wikipedia
• “A manifold is a topological space that is locally Euclidean.” ---
from Mathworld
• e. g. 2D map of the 3D earth is a manifold.
• Manifold could be obtained by a projection from original data
to a low-dimensional representation via subspace learning.
• Manifold criterion can provide more effective ways to model
the data distribution than conventional learning methods
based on the Gaussian distribution.

36
Manifold

37 http://en.wikipedia.org/wiki/Manifold
Manifold Learning
Swiss Roll

Dimensionality
Reduction

38 Courtesy of Sam T. Roweis and Lawrence K. Saul, Sience 2002

Locally Linear Embedding

http://www.cs.toronto.edu/~roweis/lle/
39
LEA for Pose Manifold

Linear embedding and subspace projection of 400 rotating teapot images. The number of nearest neighbors is k = 6.

40
Yun Fu, et. al. “Locally Adaptive Subspace and Similarity Metric Learning for Visual Clustering and Retrieval”, CVIU, Vol. 110, No. 3, pp: 390-402, 2008.
LEA for Expression Manifold

Manifold visualization of 1,965 Frey’s face images by LEA using k = 6 nearest neighbors.

41
Yun Fu, et. al. “Locally Adaptive Subspace and Similarity Metric Learning for Visual Clustering and Retrieval”, CVIU, Vol. 110, No. 3, pp: 390-402, 2008.
LEA for Emotion State Manifold

Manifold visualization for 11,627 AAI sequence images of a male subject using LLE algorithm. (a) A video frame
snapshot and the 3D face tracking result. The yellow mesh visualizes the geometric motion of the face. (b) Manifold
visualization with k=5 nearest neighbors. (c) k=8 nearest neighbors. (d) k=15 nearest neighbors and labeling results.
42
Yun Fu, et. al. “Locally Adaptive Subspace and Similarity Metric Learning for Visual Clustering and Retrieval”, CVIU, Vol. 110, No. 3, pp: 390-402, 2008.
LEA for Head Pose Manifold

43
Fisher Graph
• Graph Embedding (S. Yan, IEEE TPAMI, 2007)
– G={X, W} is an undirected weighted graph.
– W measures the similarity between a pair of vertices.
– Laplacian matrix

– Most manifold learning method can be reformulated as

where d is a constant and B is the constraint matrix.

Between-Locality Graph Within-Locality Graph Courtesy of Shuicheng Yan

Discriminant Simplex Analysis

Y. Fu, et. al., IEEE Transactions on Information Forensics and Security, 2008.
Similarity Metric
• Single-Sample Metric
– Euclidean Distance and Pearson Correlation Coefficient.

• Multi-Sample Metric
– k-Nearest- Neighbor Simplex

Q
Q
Correlation Embedding Analysis

 Objective Function

Correlation Distance Fisher Graph

Y. Fu, et. al., IEEE Transactions on Pattern Analysis and Machine Intelligence, 2008.
High-Order Data Structure
– m-th order tensors
– Representation where
– Define , where
– Here, tensor means multilinear representation.

1-st order 2-nd order

vector matrix
Tensor

Y. Fu, et. al., IEEE Transactions on Circuits and Systems for Video Technology, 2009.
Correlation Tensor Analysis
Given two m-th order tensors,
Pearson Correlation Coefficient (PCC):

CTA objective function

Correlation Distance and Fisher Graph

Multilinear Representation

m different subspaces
Y. Fu, et. al., IEEE Transactions on Image Processing, 2008.
Manifold with Noise Effect
Robust Manifold by Low-Rank Recovery
Real-world ATR data are
Automated, real-
large scale, unbalanced in time, and robust
dynamic sampling, and description of ATR
easily affected by noises data space under
and outliers, which are uncertainty.
difficult to represent.

Low-rank matrix recovery

can deal with noises and
outliers for data
reconstruction.
Stabilized Manifold Learning

Raw Data Existing Method New Method

LLE Noise Outlier

Voting for Outlier Detection

Stabilized Manifold Learning
Large Scale Manifold Learning
 Graph based methods require spectral decomposition of matrices
of n x n, where n denotes the number of samples.
 The storage cost and computational cost of building neighborhood
maps are O(n2) and O(n3), it is almost intractable to apply these
methods to large-scale scenarios.
 Neighborhood search is also a large scale aspect.
Large Scale Manifold Learning

Graph oriented clustering K-means clustering

Robust Matching of Sub-Manifolds
 A robust visual representation must be insensitive to durations in the case of
dynamics or time series, such as action/activity videos.
 A generalized manifold can be considered as a union of sub-manifolds with
different durations which characterize different instances with similar structures,
such as different individuals performing the same action, instead of a single
continuous manifold as conventionally regarded.
 Robust matching of these sub-manifolds can be achieved through both low-rank
matrix recovery and simplex synchronization.
Applications
 Chemical data visualization
 DR algorithm: multidimensional scaling (MDS)

Seung-Hee Bae, Jong Youl Choi, Judy Qiu, Geoffrey Fox: Dimension reduction and visualization of large high-dimensional data via interpolation. HPDC
58 2010: 203-214
Applications
 Biology data visualization
 DR algorithm: principal component analysis (PCA)

Andreas Lehrmann, Michael Huber, Aydin Can Polatkan, Albert Pritzkau, Kay Nieselt: Visualizing dimensionality reduction of systems biology data.
59 Data Min. Knowl. Discov. 27(1): 146-165 (2013)
Applications
 Biology data visualization
 DR algorithm: locally linear embedding (LLE)

Andreas Lehrmann, Michael Huber, Aydin Can Polatkan, Albert Pritzkau, Kay Nieselt: Visualizing dimensionality reduction of systems biology data.
60 Data Min. Knowl. Discov. 27(1): 146-165 (2013)
Applications
 Bioinformatics
 DR algorithm: multidimensional scaling (MDS)

Adam Hughes, Yang Ruan, Saliya Ekanayake, Seung-Hee Bae, Qunfeng Dong, Mina Rho, Judy Qiu, Geoffrey Fox: Interpolative multidimensional scaling
61 techniques for the identification of clusters in very large sequence sets. BMC Bioinformatics 13(S-2): S9 (2012)
Applications
 Metagenomic data visualization
 DR algorithm: stochastic neighbor embedding (SNE)

CC Laczny, N Pinel, N Vlassis, P Wilmes: Alignment-free Visualization of Metagenomic Data by Nonlinear Dimension Reduction, Scientific reports, 4
62 (2014).
Applications
 Neuroscience
 DR algorithm: multiple algorithms

J. P. Cunningham and B. M. Yu.: Dimensionality reduction for large-scale neural recordings. Nature Neuroscience, (2014), doi:10.1038/nn.3776.
63
Applications
 Semantic visualization in data mining
 DR algorithm: spherical semantic embedding (SSE).

64 Tuan M. V. Le, Hady Wirawan Lauw: Semantic visualization for spherical representation. KDD (2014): 1007-1016.
Applications
 Visualization of machine learning datasets
 DR algorithm: stochastic neighbor embedding (SNE)

Zhirong Yang, Jaakko Peltonen, Samuel Kaski: Scalable Optimization of Neighbor Embedding for Visualization. ICML (2) 2013: 127-135
65
Transfer Learning in Dimension Reduction

• We are facing huge amount of unlabeled data

nowadays
• Only a few databases are labeled
• The problem of inconsistency of training and
test data
• Transfer learning can help: training in one
domain and test in another
• Knowledge is better utilized
66
Recent Advances: Transfer Learning in DR
Motivation:
• We are facing huge amount of unlabeled data nowadays
• Only a few databases are labeled
• Knowledge is better utilized
Basic Idea:
Given two data sets A and B, use the knowledge learned from A to
help the learning task for B.

67
Recent Advances: Transfer Learning in DR
Object Face
Recognition Recognition

Learning Framework
Ming Shao, Carlos Castillo, Zhenghong Gu, Yun Fu: Low-Rank Transfer Subspace Learning. ICDM (2012): 1104-1109. 68
Recent Advances: Robust Subspace Discovery
Low-rank matrix recovery

Clean images Noisy images -observation - low-rank - sparse

Images from Twitter

Subspace Learning Subspace Clustering

 Find low-dimensional projection  Discover underlying subspaces in
with specific properties. data set, and correct errors.
 Unsupervised (e.g., PCA, LPP) /  Sparse subspace clustering (SSC),
Supervised (e.g., LDA) Low-rank representation (LRR).
Sheng Li, Yun Fu: Robust Subspace Discovery through Supervised Low-Rank Constraints. SDM 2014: 163-171 69
Recent Advances: Robust Subspace Discovery

Learning Framework
Sheng Li, Yun Fu: Robust Subspace Discovery through Supervised Low-Rank Constraints. SDM 2014: 163-171 70
Self-Taught Low-Rank Coding for Visual Learning

Self-taught Learning (Raina et al, 2007) Our Motivations and Contributions

 Transferring knowledge from auxiliary domain  Learn effective feature representations for
with minimum restrictions. target domain.
 A special type of transfer learning.  High-quality dictionary bridges auxiliary
domain and target domain.
Objective Function  Low-rank constraint characterizes the
structure information.
 The first general self-taught learning
framework is developed, including supervised
and unsupervised learning tasks.
Self-Taught Low-Rank Coding for Visual Learning
Application I: Subspace
Clustering

Application II: Image

Classification
Summary
• The motivation of using dimensionality reduction (DR) for
visualization
• DR mainly includes feature selection, feature extraction.
• Two basic linear DR methods: PCA and LDA.
• Nonlinear DR methods: LLE, SNE, etc.
• Applications of DR methods.
• Recent advances in DR.

Eigenvectors 2
No ratings yet
Eigenvectors 2
31 pages
LDA and PCA Algorithms Explained
No ratings yet
LDA and PCA Algorithms Explained
10 pages
Unit 3
No ratings yet
Unit 3
21 pages
I2ml3e Chap6
No ratings yet
I2ml3e Chap6
37 pages
Ruiz Modified I2ml3e Chap6
No ratings yet
Ruiz Modified I2ml3e Chap6
38 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
08 HighDimensional PDF
No ratings yet
08 HighDimensional PDF
88 pages
315 F19 27 Pca1
No ratings yet
315 F19 27 Pca1
28 pages
Lecture 16 - 25.09.2024 - PCA, Unsupervised Learning-Clustring & Metrics
No ratings yet
Lecture 16 - 25.09.2024 - PCA, Unsupervised Learning-Clustring & Metrics
51 pages
16 dm2 Dimred 2022 23
No ratings yet
16 dm2 Dimred 2022 23
49 pages
Data Projections & Visualization: Student Eng.: Maria-Alexandra MATEI
No ratings yet
Data Projections & Visualization: Student Eng.: Maria-Alexandra MATEI
18 pages
FR Pca Lda
No ratings yet
FR Pca Lda
52 pages
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
No ratings yet
Lecture 7: Unsupervised Learning: C19 Machine Learning Hilary 2013 A. Zisserman
20 pages
Unit - 4
No ratings yet
Unit - 4
76 pages
Lec 3
No ratings yet
Lec 3
60 pages
5-Dimension Reduction
No ratings yet
5-Dimension Reduction
48 pages
ML Unit 4
No ratings yet
ML Unit 4
10 pages
Polo Chaur Dimension Reduction
No ratings yet
Polo Chaur Dimension Reduction
59 pages
Dimensions Reduction
No ratings yet
Dimensions Reduction
27 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
19 pages
Updated Feature Enginering Notes
No ratings yet
Updated Feature Enginering Notes
47 pages
Dimensionality Reduction Explained
No ratings yet
Dimensionality Reduction Explained
60 pages
ML 4
No ratings yet
ML 4
14 pages
Unit V Foml
No ratings yet
Unit V Foml
18 pages
PrincipalComponentAnalysis LectureNotesPublic
No ratings yet
PrincipalComponentAnalysis LectureNotesPublic
24 pages
ML Mod 4 Part 2
No ratings yet
ML Mod 4 Part 2
32 pages
22AIP3101A Session 7
No ratings yet
22AIP3101A Session 7
28 pages
Manifold Learning Algorithms
No ratings yet
Manifold Learning Algorithms
17 pages
Deep Learning 3
No ratings yet
Deep Learning 3
12 pages
Lecture W12ab
No ratings yet
Lecture W12ab
60 pages
Sheng Hundley
No ratings yet
Sheng Hundley
54 pages
MLSP-6 Dimensionality Reduction
No ratings yet
MLSP-6 Dimensionality Reduction
39 pages
Feature Engineering
No ratings yet
Feature Engineering
51 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
183 pages
Unit Iii Dimentionality Reduction
No ratings yet
Unit Iii Dimentionality Reduction
12 pages
CHP 4
No ratings yet
CHP 4
72 pages
Unit 5
No ratings yet
Unit 5
13 pages
AI Unsupervised Learning Guide
No ratings yet
AI Unsupervised Learning Guide
44 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
85 pages
Feature Selection and Extraction
No ratings yet
Feature Selection and Extraction
26 pages
10 Autoencoders
No ratings yet
10 Autoencoders
42 pages
It ML Unit 4 Notes Final
No ratings yet
It ML Unit 4 Notes Final
21 pages
UNIT-4 Machine Learning
No ratings yet
UNIT-4 Machine Learning
20 pages
PCA - Feb 8
No ratings yet
PCA - Feb 8
28 pages
03 Face Detection
No ratings yet
03 Face Detection
7 pages
Lecture8 2015
No ratings yet
Lecture8 2015
51 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
کتاب نهم بارگزاری شده
No ratings yet
کتاب نهم بارگزاری شده
55 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
17 pages
4 - Basics in Statistics and Linear Algebra
No ratings yet
4 - Basics in Statistics and Linear Algebra
7 pages
Dimensionality Reduction in ML
No ratings yet
Dimensionality Reduction in ML
47 pages
Computer Vision: Spring 2006 15-385,-685
No ratings yet
Computer Vision: Spring 2006 15-385,-685
58 pages
Computer Vision and Image Processing - Fundamentals and Applications
No ratings yet
Computer Vision and Image Processing - Fundamentals and Applications
34 pages
Day School 03
No ratings yet
Day School 03
32 pages
Lecture 14: Principal Component Analysis: Computing The Principal Components
No ratings yet
Lecture 14: Principal Component Analysis: Computing The Principal Components
6 pages
Dimensionality Reduction DR
No ratings yet
Dimensionality Reduction DR
31 pages
Calender Ing
No ratings yet
Calender Ing
12 pages
Detailed Lesson Plan (DLP) Format: (With Inclusion of The Provisions of D.O. No. 8, S. 2015 and D.O. 42, S. 2016)
No ratings yet
Detailed Lesson Plan (DLP) Format: (With Inclusion of The Provisions of D.O. No. 8, S. 2015 and D.O. 42, S. 2016)
9 pages
Aquaculture's Role in the Philippines
No ratings yet
Aquaculture's Role in the Philippines
50 pages
Environmental Movements in India
No ratings yet
Environmental Movements in India
2 pages
Conscious Oracle Card Booklet
No ratings yet
Conscious Oracle Card Booklet
44 pages
Key Account Management Presentation
100% (2)
Key Account Management Presentation
37 pages
NSTP1 Syllabus 2024-2025
No ratings yet
NSTP1 Syllabus 2024-2025
6 pages
E-Way Bill System-2
No ratings yet
E-Way Bill System-2
1 page
The International Association For The Properties of Water and Steam
No ratings yet
The International Association For The Properties of Water and Steam
7 pages
Marketing Flexibility - Significance and Implications For Automobile Industry
No ratings yet
Marketing Flexibility - Significance and Implications For Automobile Industry
12 pages
CA7 Commands
No ratings yet
CA7 Commands
29 pages
L Set 04 PGT (Direct) 131 To 140 General English
No ratings yet
L Set 04 PGT (Direct) 131 To 140 General English
4 pages
Miguel Wiels
No ratings yet
Miguel Wiels
1 page
Kinetic Theory
No ratings yet
Kinetic Theory
3 pages
Keratograph 5m en PDF
No ratings yet
Keratograph 5m en PDF
16 pages
PAS14G ManualMR
No ratings yet
PAS14G ManualMR
16 pages
ESP411 2019 Study Guide Rev1
No ratings yet
ESP411 2019 Study Guide Rev1
18 pages
Test 3 Global WF With Answers
No ratings yet
Test 3 Global WF With Answers
5 pages
Catalytic Bleaching
100% (1)
Catalytic Bleaching
195 pages
Architectural Concrete
No ratings yet
Architectural Concrete
24 pages
AG PIECO Glass & Rubber Products Catalog
No ratings yet
AG PIECO Glass & Rubber Products Catalog
8 pages
Hampi Express Sleeper Class (SL)
No ratings yet
Hampi Express Sleeper Class (SL)
3 pages
VCCEdgeH1 DealReport2024
No ratings yet
VCCEdgeH1 DealReport2024
31 pages
Highwoods Primary School Overview
No ratings yet
Highwoods Primary School Overview
18 pages
Kyecu-Cocoa - 26.05.2025
No ratings yet
Kyecu-Cocoa - 26.05.2025
4 pages
List of LEED Projects
No ratings yet
List of LEED Projects
4 pages
Marking Out For PCD Holes MIG Welding Forum
No ratings yet
Marking Out For PCD Holes MIG Welding Forum
7 pages
Planning and Designing Practicals 2022-2023
No ratings yet
Planning and Designing Practicals 2022-2023
3 pages
Heat Transfer Lab: Thermal Paste & Insulation
No ratings yet
Heat Transfer Lab: Thermal Paste & Insulation
16 pages
SU-Field and 76 Solutions
No ratings yet
SU-Field and 76 Solutions
90 pages

Visualization 9 Dim Reduction

Uploaded by

Visualization 9 Dim Reduction

Uploaded by

EECE 5642

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Courtesy of Prof. Hanspeter Pfister, Harvard University.

Courtesy of Prof. Hanspeter Pfister, Harvard University.

A world of high-dimensional measurements!

Dimensionality Reduction (DR)

This is easier to interpret … … than this

12 Courtesy of Prof. Jaakko Peltonen, Aalto University.

– The variance is Var{x}

– The co-variance between two data set is

14 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

• e=[e1, e2, e3]T is an eigenvector, λ is the eigenvalue associated

Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

19 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

20 Courtesy of Prof. Jaakko Peltonen, Aalto University.

22 Courtesy of Prof. Jaakko Peltonen, Aalto University.

24 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

26 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

27 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

28 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

Digits data after PCA Digits data after LDA

30 Courtesy of Prof. Jaakko Peltonen, Aalto University.

33 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

• LDA fails in the right figure( v1 is the projected

34 Courtesy of Prof. Zhu Li, Hong Kong Polytechnic University.

38 Courtesy of Sam T. Roweis and Lawrence K. Saul, Sience 2002

– Most manifold learning method can be reformulated as

where d is a constant and B is the constraint matrix.

Between-Locality Graph Within-Locality Graph Courtesy of Shuicheng Yan

Correlation Distance Fisher Graph

1-st order 2-nd order

CTA objective function

Correlation Distance and Fisher Graph

Low-rank matrix recovery

Raw Data Existing Method New Method

LLE Noise Outlier

Voting for Outlier Detection

Graph oriented clustering K-means clustering

• We are facing huge amount of unlabeled data

Clean images Noisy images -observation - low-rank - sparse

Images from Twitter

Subspace Learning Subspace Clustering

Self-taught Learning (Raina et al, 2007) Our Motivations and Contributions

Application II: Image

You might also like