0% found this document useful (0 votes)

14 views37 pages

Stat401 ch6

Uploaded by

765musik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

14 views37 pages

Stat401 ch6

Uploaded by

765musik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 37

Ch 6.

Cluster Analysis
 Cluster analysis classify observations into a small number of
homogeneous groups (called clusters) based on observed variables.
- The cluster analysis partitions observations into mutually exclusive groups.

 Purpose of cluster analysis: organize multivariate data into groups and

describe similarities and differences between groups.
- Ex 1) A marketing department of a company is interested in grouping
potential customers of their product.
- Ex 2) In psychiatry, the classification of mental status of patients help
finding the causes and leading to improved methods of therapy.

 Cluster analysis should be judged largely on its usefulness.

- Alternative clustering may exist for the same set of observations.
- Some classification will be more useful than others.
Ch 6. Cluster Analysis

Cluster A

1, 2, 4,…

Cluster B

3, 5, 6,…
Ch 6. Cluster Analysis

 The simplest way to identify groups is to examine data using graphs.

- Graphs can be made with the raw data or by using the results of a principal
components analysis.
- Graphical techniques are often useful to search clusters or to provide
evidence to justify clustering results.

 Cluster analysis classifies previously unclassified materials.

- Assume that the number and composition of clusters is unknown.
6.2. Agglomerative Hierarchical
Clustering Techniques
 Hierarchical clustering
- The hierarchical clustering consists of a series of partitions that proceed
from a single ‘cluster’ containing all observations, to n clusters each
containing a single observation.

 Agglomerative hierarchical clustering techniques produce partitions by

a series of successive fusion of n observations into groups.
- Fusions are irreversible.
- When an agglomerative hierarchical clustering fuses two observations
into the same group, they cannot be subsequently divided into different
groups.
- Needs to determine an appropriate number of clusters.

 Hierarchic classifications can be represented by a dendrogram.

- Dendrogram illustrates the fusion made at each stage of the analysis.
Example of Dendrogram

 Agglomerative hierarchical clustering

Stage 1 2 3 4 5

A
A, D
D
A, B, C, D, E
B

B, C, E
C
C, E
E

Agglomerative
6.2. Agglomerative Hierarchical
Clustering Techniques
 An agglomerative hierarchical clustering procedure produces a series of
partitions of the data, Pn, Pn-1, …, P1.
- The first, Pn, consists of n single member clusters.
- The last, P1, consists of a single group containing all n observations.

 Basic operation of the agglomerative hierarchical clustering

- START with Clusters C1, C2, …, Cn, each containing a single observation.
(1) Find the nearest pair of distinct clusters, say Ci and Cj, merge Ci and Cj, and
decrease the number of clusters by one.
(2) If the number of clusters equals one then stop, else return to (1).
- At each stage, the method fuses observations that are closest (or most similar).
- Have to decide how to define distance (or similarity) between an observation
and a group of observations or between two groups of observations.
6.2.1. Measuring inter-cluster dissimilarity

 Agglomerative hierarchical clustering techniques differ in how they

measure the distances between or similarity of two clusters.

 Single linkage clustering

- Use an inter-group measure
d AB  min d ij ,
i A
jB
where dAB is the distance between two clusters A and B,
and dij is the distance between observations i and j.

 Complete linkage clustering

- Use an inter-group measure
d AB  max d ij .
i A
jB

 Both of single and complete linkage clustering are invariant to monotone

transformation of the original inter-observations distances.
6.2.1. Measuring inter-cluster dissimilarity

 Group average clustering

- Measure inter-cluster distance or similarity by
1
d AB 
n A nB
 d
i A jB
ij ,

where nA and nB are the number of observations in clusters A and B.

 Illustrative example
- A dissimilarity matrix for five individuals:
1 2 3 4 5
1 0 
 
2 2 0 
D 6 
3 5 0
 
4 10 9 4 0 
5 9 8 5 3 0 

6.2.1. Measuring inter-cluster dissimilarity

 Everitt (2010)

Single Linkage

Complete Linkage

Group Average
6.2.2. Illustrative example of the application
of single linkage
 Stage 1: Since d12 is the smallest entry in the matrix D, individuals 1 and 2
are merged to form a cluster.
 Stage 2.
- The distances between this group and the three remaining individuals, 3, 4,
and 5 are obtained as follows:
d(12)3 = min (d13, d23) = d23 = 5, 1 2 3 4 5
1 0 
d(12)4 = min (d14, d24) = d24 = 9,  
2 2 0 
d(12)5 = min (d15, d25) = d25 = 8. D
3 6 5 0 
- Form a new distance matrix D1:  
4 10 9 4 0 
12 3 4 5
5  9 8 5 3 0 
12  0 
 
D1  3  5 0 .
4  9 4 0 
 
5  8 5 3 0 

- Since d45 is the smallest entry in D1, individuals 4 and 5 are merged to form
a second cluster.
6.2.2. Illustrative example of the application
of single linkage
 Stage 3.
- The distances between (45) and the remaining (12) and 3 are obtained as
follows:
d(12)(45) = min (d14, d15, d24, d25) = d25 = 8,
1 2 3 4 5
d3(45) = min (d34, d35) = d34 = 4. 1 0 
- Form a new distance matrix D2:  
2 2 0 
D
12 3 45 3 6 5 0 
 
12  0  4 10 9 4 0 
D2   .
3  5 0  5  9 8 5 3 0 
45  8 4 0 

- Since d3(45) is the smallest entry in D2, individual 3 is merged with the (45)
cluster.
 Stage 4.
- Fusion of the two remaining groups takes place to form a single group
containing all five individuals.
6.2.2. Illustrative example of the application
of single linkage
 The partitions produced at each stage are:
Stage Groups
P5 [1],[2],[3],[4],[5]
P4 [12],[3],[4],[5]
P3 [12],[3],[45]
P2 [12],[345]
P1 [12345]

- The single linkage dendrogram :

6.2.2. Illustrative example of the application
of complete linkage
 Stage 1: Since d12 is the smallest entry in the matrix D, individuals 1 and 2
are merged to form a cluster.
 Stage 2.
- The distances between this group and the three remaining individuals, 3, 4,
and 5 are obtained as follows:
d(12)3 = max (d13, d23) = d13 = 6,
d(12)4 = max (d14, d24) = d14 = 10, 1 2 3 4 5
1 0 
d(12)5 = max (d15, d25) = d15 = 9.  
2 2 0 
- Form a new distance matrix D1: D  
3 6 5 0
12 3 4 5  
4 10 9 4 0 
12  0 
  5  9 8 5 3 0 
D1  3  6 0 .
4  10 4 0 
 
5  9 5 3 0 

- Since d45 is the smallest entry in D1, individuals 4 and 5 are merged to form
a second cluster.
6.2.2. Illustrative example of the application
of complete linkage
 Stage 3.
- The distances between (45) and the remaining (12) and 3 are obtained as
follows:
d(12)(45) = max (d14, d15, d24, d25) = d14 = 10, 1 2 3 4 5
d3(45) = max (d34, d35) = d35 = 5. 1 0 
 
- Form a new distance matrix D2: 2 2 0 
D
3 6 5 0 
12 3 45  
12  0  4 10 9 4 0 
D2 
3
 . 5  9 8 5 3 0 
 6 0 
45  10 5 0 

- Since d3(45) is the smallest entry in D2, individual 3 is merged with the (45)
cluster.
 Stage 4.
- Fusion of the two remaining groups takes place to form a single group
containing all five individuals.
6.2.2. Illustrative example of the application
of complete linkage
 The partitions produced at each stage are:
Stage Groups
P5 [1],[2],[3],[4],[5]
P4 [12],[3],[4],[5]
P3 [12],[3],[45]
P2 [12],[345]
P1 [12345]

- The complete linkage dendrogram:

- Note that the partitions are the

same with the single linkage,
but the distances used to merge
groups are different.
6.2.2. Illustrative example of the application
of group average
 Stage 1: Since d12 is the smallest entry in the matrix D, individuals 1 and 2
are merged to form a cluster.
 Stage 2.
- The distances between this group and the three remaining individuals, 3, 4,
and 5 are obtained as follows:
d(12)3 = (d13+d23)/2 = 5.5, 1 2 3 4 5
1 0 
d(12)4 = (d14+d24)/2 = 9.5,  
2 2 0 
d(12)5 = (d15+d25)/2 = 8.5. D
3 6 5 0 
- Form a new distance matrix D1:  
4 10 9 4 0 
12 3 4 5
5  9 8 5 3 0 
12  0 
 
D1  3  5.5 0 .
4  9.5 4 0 
 
5  8.5 5 3 0 

- Since d45 is the smallest entry in D1, individuals 4 and 5 are merged to form
a second cluster.
6.2.2. Illustrative example of the application
of group average
 Stage 3.
- The distances between (45) and the remaining (12) and 3 are obtained as
follows:
d(12)(45) = (d14+d15+ d24+d25)/4 = 9, 1 2 3 4 5
1 0 
d3(45) = (d34+ d35)/2 = 4.5.  
2 2 0 
- Form a new distance matrix D2: D  
3 6 5 0
12 3 45  
4 10 9 4 0 
12  0 
D2   . 5  9 8 5 3 0 
3  5.5 0 
45  9 4.5 0 

- Since d3(45) is the smallest entry in D2, individual 3 is merged with the (45)
cluster.
 Stage 4.
- Fusion of the two remaining groups takes place to form a single group
containing all five individuals.
6.2.2. Illustrative example of the application
of group average
 The partitions produced at each stage are:
Stage Groups
P5 [1],[2],[3],[4],[5]
P4 [12],[3],[4],[5]
P3 [12],[3],[45]
P2 [12],[345]
P1 [12345]

- The group average dendrogram:

- Note that the partitions are the

same with the single linkage and
complete linkage, but the distances
used to merge groups are different.
6.2.3. Some properties of agglomerative
hierarchical clustering techniques

 Single linkage has a tendency to incorporate intermediate points into an

existing cluster rather than initiating a new one (called chaining).
- It leads to the formation of long ‘straggly’ clusters.
- Ex) Everitt (2010)
6.2.3. Some properties of agglomerative
hierarchical clustering techniques

 Hierarchical techniques using complete linkage and group average tend to

produce solutions in which the clusters are ‘spherical’ even when the data
contain relatively well-separated clusters.

- Ex) Everitt (2010)

6.2.3. Some properties of agglomerative
hierarchical clustering techniques

 Empirical investigations indicate that no single method could be claimed

superior for all types of data.
- The presence of outliers led to very poor performance by group average,
but left the single linkage virtually unaffected.
- When the data contained a true cluster structure masked by the addition of
‘noise’, single linkage gave poor results.
6.2.5. Partitions from a hierarchy:
the number-of-groups problem

 We are not interested in the complete hierarchy but only in one or two
partitions obtained from it.
- Determining an appropriate number of groups is not straightforward.
- In hierarchical clustering, partitions are obtained by ‘cutting’ a dendrogram.

 One informal approach

- Examine the size of the difference
between fusion levels in the dendrogram.
- Large changes in the dendrogram may
indicate a particular number of clusters.
6.2.6. Examples of the application of
agglomerative hierarchical clustering techniques

 Example 1. Chest, waist, and hip measurements for 20 individuals (Everitt,

2010)
6.2.6. Examples of the application of
agglomerative hierarchical clustering techniques

 Example 2. Skull data (Everitt, 2010)

- 32 skulls found in south-western and eastern districts of Tibet.
- It was known that the data could be divided into two groups:
Observation #1-17 vs. observation #18-32.
- Five variables are measured as follows:
Greatest length of skull (X1),
Greatest horizontal breadth of skull (X2),
Height of skull (X3),
Upper face height (X4),
Face breadth (X5)
- Ignore the a priori grouping and investigate how a cluster analysis solution
reproduces the prior partition (1-17 vs. 18-32).
6.3. Optimization Methods

 Clustering techniques based on optimization methods

- For a chosen number of groups, produce a partition of the observations by
optimizing a numerical criterion.

 With a single variable, the partition may be chosen by minimizing the

within-group sum of squares of the variable.
6.3. Optimization Methods

 In the multivariate data, the commonly used criterion considers the three
matrices that can be calculated for each partition of the data into g groups:
T   xij  x xij  x  ,

g ni

i 1 j 1

W   xij  xi xij  xi  ,

g ni

i 1 j 1


g
B   ni  xi  x  xi  x  .
i 1

where xij is the vector of variable values for the jth observation in the ith group,
x is the mean vector of all n observations,
xi is the mean vector of the observation in group i,
and ni is the number of observations in group i.
- Note that T = W + B,
where T represents total dispersion, W represents within-group dispersion, and
B represents between-group dispersion.
6.3. Optimization Methods

 For p = 1, the equation represents a separation of the total sum of squares

for a variable into the within- and between- groups sum of squares (in
ANOVA).
- A natural criterion is to choose the partition corresponding to the minimum
value of the within-group sum of squares (or equivalently, the maximum
value of the between-group sum of squares).

 For p > 1, a number of criteria have been suggested. Among them,

(1) Minimization of tr(W): minimization of the sum of the within-group sum
of squares for each variable
(2) Minimization of det(W): minimization of the determinants
6.3. Optimization Methods
 xij1  xi1 
 
g ni
 g ni  xij 2  xi 2 
- W    xij  xi  xij  xi     xij1  xi1, xij 2  xi 2 , , xijp  xip 
i 1 j 1 i 1 j 1
  
 
x  x
 ijp ip 
  x  x 2 x  xi1  xij 2  xi 2   x  xi1  xijp  xip  
 ij1 i1 ij1 ij1

g ni  
x ij 2  xi 2  x  
2
  x x  x
=   ij 2 i2 ijp ip 

i 1 j 1    
 
  xijp  xip  
2

 
 g ni ni ni

   ij1 i1    xij1  xi1  xij 2  xi 2    ij1 i1  ijp ip  
g g


2
x  x x  x x  x
 i 1 j 1 i 1 j 1 i 1 j 1 
 ni g ni 
  xij 2  xi 2     xij 2  xi 2  xijp  xip  
g
2

= i 1 j 1 i 1 j 1 
   
 
 g ni 
  xijp  xip 
2
 
 i 1 j 1 
6.3. Optimization Methods
 K-means methods is a criterion to minimize tr(W).

 K-means method
(1) Partition the individuals into K initial clusters.
(2) Proceed through the list of individuals, assigning an individual to the
cluster whose centroid (mean) is nearest.
- Distance is usually computed using the Euclidean distance with either
standardized or unstandardized observations.
(3) Recalculate the centroid for the cluster receiving the new individual and
for the cluster losing the individual.
- If an individual is moved from the initial configuration, the cluster
centroid (means) must be updated before proceeding.
(4) Repeat Steps (2) and (3) until no more reassignments take place.

 Note that the Euclidean distance between two arbitrary points P and Q
with coordinates P = (x1, x2, … xp) and Q = (y1, y2, … yp) is
d  P, Q   x1  y1 2  x2  y2 2    x p  y p 2 .
6.3. Optimization Methods
 Example. K-means method with K = 2
X1 X2
1 5 3
Individuals 2 -1 1
3 1 -2
4 -3 -2
(1) Divide the four individuals into 2 clusters.
- Arbitrarily partition the individuals into two clusters, such as (12) and (34).
- Compute the coordinates of the cluster centroid:
Coordinates of centroid
Cluster X1 X2
(12) {5+(-1)}/2 = 2 (3+1)/2 = 2
(34) {1+(-3)}/2 = -1 {-2+(-2)}/2 = -2
6.3. Optimization Methods
 Example. K-means method with K = 2 (continued)
(2) Compute the squared distances
d1(12)2 = (5-2)2 + (3-2)2 = 10,
d1(34)2 = (5-(-1))2 + (3-(-2))2 = 61.
- Since individual 1 is closer to cluster (12) than to cluster (34),
it is not reassigned.
d2(12)2 = (-1-2)2 + (1-2)2 = 10,
d2(34)2 = (-1-(-1))2 + (1-(-2))2 = 9.
- Since individual 2 is closer to cluster (34) than to cluster (12),
it is reassigned to cluster (34).

- If an item is moved from the initial configuration, the cluster

centroids (means) must be updated before proceeding.
- Now this gives clusters (1) and (234), and do not check the squared
distances of individuals 3 and 4.
6.3. Optimization Methods
 Example. K-means method with K = 2
6.3. Optimization Methods
 Example. K-means method with K = 2 (continued)
(3) Recalculate the coordinates of the cluster centroid:
Coordinates of centroid
Cluster X1 X2
1 5 3
(234) {(-1)+1+(-3)}/3 = -1 {1+(-2)+(-2)}/3 = -1

(4) Check each individual for reassignment.

Squared distances to group centroids
Cluster 1 2 3 4
1 0 40 41 89
(234) 52 4 5 5
- Each individual is assigned to the cluster with the nearest centroid, and the
process stops.
- The final K = 2 clusters are 1 and (234).
6.3. Optimization Methods
 Example. K-means method with K = 2
6.3. Optimization Methods

 Example. Tibetan skull data using K-means method (revisited)

- 32 skulls found in south-western and eastern districts of Tibet
- It was known that the data could be divided into two groups:
Observations #1-17 vs. observations #18-32.
- Five measurements:
Greatest length of skull (X1),
Greatest horizontal breadth of skull (X2),
Height of skull (X3),
Upper face height (X4),
Face breadth (X5)
- Ignore the a priori grouping and investigate how a cluster analysis
solution reproduces the prior partition (1-17 vs. 18-32).
6.3. Optimization Methods

 K-means method
- Rather than starting with a partition of all individuals into K preliminary
groups in Step (1), could specify K initial centroids and then proceed to
Step (2).
- The final assignment of individuals to clusters could depend on the initial
partition or the initial selection of centroids.
- To avoid biases, randomly partition the individuals into initial groups or
randomly select initial centroids.
- To check the stability of the clustering, it is desirable to rerun the algorithm
with a new initial partition.
6.3. Optimization Methods

 K-means method
- If two or more initial centroids inadvertently lie within a single cluster,
their resulting clusters will be poorly differentiated.
- The existence of an outlier might produce at least one group with very
dispersed individuals.
- Even if the population is known to consist of K groups, data from the rarest
group may not appear in the sample. Forcing the data into K groups would
lead to nonsensical clusters.
- It is recommended not to fix the number of clusters, K, in advance.
- In cases where a single run of the algorithm requires the user to specify K,
it is always a good idea to rerun the algorithm for several choices.

AI20 - Hierarchical-Clustering
No ratings yet
AI20 - Hierarchical-Clustering
31 pages
Hierarchical 4 4 03
No ratings yet
Hierarchical 4 4 03
15 pages
Clustering for Data Analysis
No ratings yet
Clustering for Data Analysis
16 pages
Hierarchical
No ratings yet
Hierarchical
2 pages
Aula - Análise de Clusters
No ratings yet
Aula - Análise de Clusters
93 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
32 pages
Phân Cấp Phân Cụm
No ratings yet
Phân Cấp Phân Cụm
17 pages
RK Clustering
No ratings yet
RK Clustering
77 pages
Lec.4.D. M. Spring 2025
No ratings yet
Lec.4.D. M. Spring 2025
19 pages
Cluster Analysis Hierarchical & - Means
No ratings yet
Cluster Analysis Hierarchical & - Means
41 pages
Cluster Analysis BRM Session 14
No ratings yet
Cluster Analysis BRM Session 14
25 pages
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
No ratings yet
20 - 1 - ML - UNSUP - 02 - Hierarchical Clustering
41 pages
Hierarchical
No ratings yet
Hierarchical
31 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Cluster Analysis for Analysts
No ratings yet
Cluster Analysis for Analysts
33 pages
Unit 5 Clustering
No ratings yet
Unit 5 Clustering
25 pages
Data Mining Functionalities
No ratings yet
Data Mining Functionalities
13 pages
MA Unit 5
No ratings yet
MA Unit 5
7 pages
Lecture-11 Cluster Analysis-1
No ratings yet
Lecture-11 Cluster Analysis-1
28 pages
Hierarchical Clustering Guide
No ratings yet
Hierarchical Clustering Guide
110 pages
Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
No ratings yet
Agglomerative Hierarchical Clustering Algorithm-A Review: K.Sasirekha, P.Baby
3 pages
Hierarchical Clustering Basics
No ratings yet
Hierarchical Clustering Basics
2 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
DA Seminar
No ratings yet
DA Seminar
29 pages
Hierarchical Clustering: Relationship Between Clusters
No ratings yet
Hierarchical Clustering: Relationship Between Clusters
23 pages
Cluster Analysis
No ratings yet
Cluster Analysis
30 pages
Lec 35
No ratings yet
Lec 35
18 pages
Module 3
No ratings yet
Module 3
123 pages
Unit-4 New
No ratings yet
Unit-4 New
36 pages
Hierarchical Clustering Guide
No ratings yet
Hierarchical Clustering Guide
38 pages
Unit 3 Clustering
No ratings yet
Unit 3 Clustering
101 pages
13 Clustering and Classifier
No ratings yet
13 Clustering and Classifier
123 pages
Agglomerative Clustering
No ratings yet
Agglomerative Clustering
6 pages
Lattin Et Al - Analyzing Multivariate Data - 281-283
No ratings yet
Lattin Et Al - Analyzing Multivariate Data - 281-283
3 pages
Unsupervised Learning Lecture
No ratings yet
Unsupervised Learning Lecture
6 pages
Agnes
No ratings yet
Agnes
25 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
7 HierarchicalClustering AND DBSCAN
No ratings yet
7 HierarchicalClustering AND DBSCAN
41 pages
Hierarchical Clustering Explained
No ratings yet
Hierarchical Clustering Explained
14 pages
3CP10 MJJ Hierarchical Clustering
No ratings yet
3CP10 MJJ Hierarchical Clustering
40 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Cluster Analysis Concept & Methods
No ratings yet
Cluster Analysis Concept & Methods
14 pages
10.STAT 466 Cluster Analysis
No ratings yet
10.STAT 466 Cluster Analysis
27 pages
Hierarchical Clustering Methods
No ratings yet
Hierarchical Clustering Methods
22 pages
My Lecture On CLUSTER ANALYSIS PDF
No ratings yet
My Lecture On CLUSTER ANALYSIS PDF
55 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
11 pages
Clustring
No ratings yet
Clustring
20 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
ML Lec-18
No ratings yet
ML Lec-18
21 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
6 pages
Data Mining Unit 5
No ratings yet
Data Mining Unit 5
30 pages
07 Hierarchical Clustering
No ratings yet
07 Hierarchical Clustering
19 pages
Lecture+Notes+ +clustering
No ratings yet
Lecture+Notes+ +clustering
13 pages
Overlapping Clustering
No ratings yet
Overlapping Clustering
8 pages
Cluster Analysis Techniques
No ratings yet
Cluster Analysis Techniques
33 pages
Unit 3
No ratings yet
Unit 3
12 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
8 pages
Clustering 1
No ratings yet
Clustering 1
2 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
26 pages
Organizing Scientific Thinking Using The Qualmri Framework: Part 1: Qualmri in Depth
No ratings yet
Organizing Scientific Thinking Using The Qualmri Framework: Part 1: Qualmri in Depth
10 pages
Past Tense Grammar Guide
100% (3)
Past Tense Grammar Guide
1 page
MSDS Oven Cleaner Caustic Commercial Grade
No ratings yet
MSDS Oven Cleaner Caustic Commercial Grade
4 pages
AFFF Fire Protection System Guide
No ratings yet
AFFF Fire Protection System Guide
35 pages
Summary of Glycopeptides and Aminoglycosides
No ratings yet
Summary of Glycopeptides and Aminoglycosides
2 pages
Guidelines for Contracting Health Systems
No ratings yet
Guidelines for Contracting Health Systems
6 pages
Itr Ay 2024-25
No ratings yet
Itr Ay 2024-25
1 page
Blank Is - Is Not
No ratings yet
Blank Is - Is Not
1 page
CNC Writing Machine Project
No ratings yet
CNC Writing Machine Project
23 pages
Blooket Tower Defense Guide - How To Make The Best Set-Up
No ratings yet
Blooket Tower Defense Guide - How To Make The Best Set-Up
5 pages
ACCA SBR Group Questions by GM
No ratings yet
ACCA SBR Group Questions by GM
22 pages
Sparks-Langer and Colton - Synthesis On Research On Teacher Reflective Thinking
No ratings yet
Sparks-Langer and Colton - Synthesis On Research On Teacher Reflective Thinking
9 pages
Human Rights: Origins and Debates
No ratings yet
Human Rights: Origins and Debates
1 page
2015 (F) Volkswagen Vehicle Diagnostic Report - WVWZZZ6RZFY298072 - 2024127093921
No ratings yet
2015 (F) Volkswagen Vehicle Diagnostic Report - WVWZZZ6RZFY298072 - 2024127093921
2 pages
WEEK 3 LISTENING - Revisión Del Intento
No ratings yet
WEEK 3 LISTENING - Revisión Del Intento
2 pages
WMO-Unified-Data-Policy-Initial List of Core Data 2021-07-14 0
No ratings yet
WMO-Unified-Data-Policy-Initial List of Core Data 2021-07-14 0
7 pages
Practice Sheet - IF and or
No ratings yet
Practice Sheet - IF and or
25 pages
Assignment No. 9 Rules On Accession
No ratings yet
Assignment No. 9 Rules On Accession
21 pages
Quarter 3 Module 3
No ratings yet
Quarter 3 Module 3
10 pages
Predicted Question Paper 22627 Summer2025
No ratings yet
Predicted Question Paper 22627 Summer2025
2 pages
Nike's Global Marketing Insights
No ratings yet
Nike's Global Marketing Insights
18 pages
NRI Marriages: Women's Challenges
No ratings yet
NRI Marriages: Women's Challenges
14 pages
Pawan Aff
No ratings yet
Pawan Aff
11 pages
2020 UKChO ASDAN Final
No ratings yet
2020 UKChO ASDAN Final
16 pages
Different Agro Ecological Zones in India
100% (1)
Different Agro Ecological Zones in India
7 pages
Indian Railways Overview
No ratings yet
Indian Railways Overview
10 pages
Sap Ewm Outbound Process
100% (2)
Sap Ewm Outbound Process
19 pages
Assignment 2 NFDN 1002
No ratings yet
Assignment 2 NFDN 1002
2 pages
Full Stack Web Dev Course Guide
No ratings yet
Full Stack Web Dev Course Guide
12 pages
TRIBUTE EDITION - Weekend Post Calls It A Wrap
No ratings yet
TRIBUTE EDITION - Weekend Post Calls It A Wrap
16 pages

Stat401 ch6

Uploaded by

Stat401 ch6

Uploaded by

Ch 6.

 Purpose of cluster analysis: organize multivariate data into groups and

 Cluster analysis should be judged largely on its usefulness.

 The simplest way to identify groups is to examine data using graphs.

 Cluster analysis classifies previously unclassified materials.

 Agglomerative hierarchical clustering techniques produce partitions by

 Hierarchic classifications can be represented by a dendrogram.

 Agglomerative hierarchical clustering

 Basic operation of the agglomerative hierarchical clustering

 Agglomerative hierarchical clustering techniques differ in how they

 Single linkage clustering

 Complete linkage clustering

 Both of single and complete linkage clustering are invariant to monotone

 Group average clustering

where nA and nB are the number of observations in clusters A and B.

- The single linkage dendrogram :

- The complete linkage dendrogram:

- Note that the partitions are the

- The group average dendrogram:

- Note that the partitions are the

 Single linkage has a tendency to incorporate intermediate points into an

 Hierarchical techniques using complete linkage and group average tend to

- Ex) Everitt (2010)

 Empirical investigations indicate that no single method could be claimed

 One informal approach

 Example 1. Chest, waist, and hip measurements for 20 individuals (Everitt,

 Example 2. Skull data (Everitt, 2010)

 Clustering techniques based on optimization methods

 With a single variable, the partition may be chosen by minimizing the

 For p = 1, the equation represents a separation of the total sum of squares

 For p > 1, a number of criteria have been suggested. Among them,

- If an item is moved from the initial configuration, the cluster

(4) Check each individual for reassignment.

 Example. Tibetan skull data using K-means method (revisited)

You might also like