Hierarchical Clustering

The document explains Hierarchical clustering, which groups data into a tree of clusters through a series of merges, and is represented by a Dendrogram. It also discusses Divisive Hierarchical clustering, the opposite approach that starts with one cluster and separates data points iteratively. Additionally, it introduces DBSCAN, a density-based clustering algorithm that identifies arbitrary-shaped clusters and handles noise, detailing its key parameters, eps and MinPts.

Uploaded by

vedanta.rcr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views3 pages

Hierarchical Clustering

Uploaded by

vedanta.rcr

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 3

A Hierarchical clustering method works via grouping data into a tree of

clusters. Hierarchical clustering begins by treating every data point as a

separate cluster. Then, it repeatedly executes the subsequent steps:
1. Identify the 2 clusters which can be closest together, and
2. Merge the 2 maximum comparable clusters. We need to continue these
steps until all the clusters are merged together.

In Hierarchical Clustering, the aim is to produce a hierarchical series of

nested clusters. A diagram called Dendrogram (A Dendrogram is a tree-like
diagram that statistics the sequences of merges or splits) graphically
represents this hierarchy and is an inverted tree that describes the order in
which factors are merged (bottom-up view) or clusters are broken up (top-
down view).

Let's say we have six data points A, B, C, D, E, and F.

 Step-1: Consider each alphabet as a single cluster and calculate the

distance of one cluster from all the other clusters.
 Step-2: In the second step comparable clusters are merged together to
form a single cluster. Let's say cluster (B) and cluster (C) are very similar
to each other therefore we merge them in the second step similarly to
cluster (D) and (E) and at last, we get the clusters [(A), (BC), (DE), (F)]
 Step-3: We recalculate the proximity according to the algorithm and
merge the two nearest clusters([(DE), (F)]) together to form new clusters
as [(A), (BC), (DEF)]
 Step-4: Repeating the same process; The clusters DEF and BC are
comparable and merged together to form a new cluster. We’re now left
with clusters [(A), (BCDEF)].
 Step-5: At last, the two remaining clusters are merged together to form a
single cluster [(ABCDEF)].

Divisive Hierarchical clustering

We can say that Divisive Hierarchical clustering is precisely the opposite of
Agglomerative Hierarchical clustering. In Divisive Hierarchical clustering, we
take into account all of the data points as a single cluster and in every
iteration, we separate the data points from the clusters which aren't
comparable. In the end, we are left with N clusters.

Divisive Hierarchical clustering

DBSCAN is a density-based clustering algorithm that groups data

points that are closely packed together and marks outliers as
noise based on their density in the feature space. It identifies clusters as
dense regions in the data space separated by areas of lower density. Unlike
K-Means or hierarchical clustering which assumes clusters are compact and
spherical, DBSCAN perform well in handling real-world data irregularities
such as:
 Arbitrary-Shaped Clusters: Clusters can take any shape not just circular
or convex.
 Noise and Outliers: It effectively identifies and handles noise points
without assigning them to any cluster.

DBSCAN Clustering in ML | Density based clustering

The figure above shows a data set with clustering algorithms: K-Means and
Hierarchical handling compact, spherical clusters with varying noise
tolerance while DBSCAN manages arbitrary-shaped clusters and noise
handling.
Key Parameters in DBSCAN
1. eps: This defines the radius of the neighborhood around a data point. If
the distance between two points is less than or equal to eps they are
considered neighbors. A common method to determine eps is by analyzing
the k-distance graph. Choosing the right eps is important:
 If eps is too small most points will be classified as noise.
 If eps is too large clusters may merge and the algorithm may fail to
distinguish between them.
2. MinPts: This is the minimum number of points required within
the eps radius to form a dense region. A general rule of thumb is to set
MinPts >= D+1 where D is the number of dimensions in the dataset.
For most cases a minimum value of MinPts = 3 is recommended.

ML Unit 4
No ratings yet
ML Unit 4
15 pages
Clustering
No ratings yet
Clustering
11 pages
SSRN Id3768295
No ratings yet
SSRN Id3768295
7 pages
Unsupervised Learning: Clustering
No ratings yet
Unsupervised Learning: Clustering
69 pages
Data Mining: Hierarchical Clustering, DBSCAN The EM Algorithm
No ratings yet
Data Mining: Hierarchical Clustering, DBSCAN The EM Algorithm
63 pages
Unit5 CSM ML
No ratings yet
Unit5 CSM ML
32 pages
DWM 4
No ratings yet
DWM 4
14 pages
Capture D'écran, Le 2025-04-14 À 16.57.54
No ratings yet
Capture D'écran, Le 2025-04-14 À 16.57.54
40 pages
Hierarchical Clustering in Data Mining
No ratings yet
Hierarchical Clustering in Data Mining
4 pages
Hierarchical Clustering - 11.3.2024 - Full
No ratings yet
Hierarchical Clustering - 11.3.2024 - Full
14 pages
M6
No ratings yet
M6
23 pages
Module-5-Cluster Analysis-Part1
No ratings yet
Module-5-Cluster Analysis-Part1
24 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
10 pages
DBSCAN Clustering
No ratings yet
DBSCAN Clustering
17 pages
Clustering Techniques Overview
No ratings yet
Clustering Techniques Overview
45 pages
Hierarchical Clustering Explained
No ratings yet
Hierarchical Clustering Explained
14 pages
Hierarchical Clustering in Machine Learning
No ratings yet
Hierarchical Clustering in Machine Learning
10 pages
L08 Hierachical Agglomerative Clustering
No ratings yet
L08 Hierachical Agglomerative Clustering
41 pages
Clustering Analysis
No ratings yet
Clustering Analysis
12 pages
Unit 2
No ratings yet
Unit 2
33 pages
Unit 5 Cluster Analysis
No ratings yet
Unit 5 Cluster Analysis
15 pages
Un Supervised Learning
No ratings yet
Un Supervised Learning
22 pages
Density Based Clustering (Unit 5)
No ratings yet
Density Based Clustering (Unit 5)
5 pages
Unit 4 ML
No ratings yet
Unit 4 ML
14 pages
Heirarchical Clustering
No ratings yet
Heirarchical Clustering
22 pages
Lecture 6
No ratings yet
Lecture 6
55 pages
Clustering
No ratings yet
Clustering
12 pages
Ktustudents - In: 1. Hierarchical Methods
No ratings yet
Ktustudents - In: 1. Hierarchical Methods
21 pages
Clustering
No ratings yet
Clustering
53 pages
Clustering Analysis
No ratings yet
Clustering Analysis
30 pages
Hierarchical Clustering
No ratings yet
Hierarchical Clustering
26 pages
Density Based Clustering
No ratings yet
Density Based Clustering
70 pages
DBSCAN An Assessment of Density Based CL
No ratings yet
DBSCAN An Assessment of Density Based CL
5 pages
Ambo University: Inistitute of Technology
No ratings yet
Ambo University: Inistitute of Technology
15 pages
03 Clustering
No ratings yet
03 Clustering
63 pages
K-Means Clustering Guide
100% (1)
K-Means Clustering Guide
14 pages
10Hierarchical&Probabilistic Clustering & GMM (ML)
No ratings yet
10Hierarchical&Probabilistic Clustering & GMM (ML)
24 pages
Hierarchical Clustering Algorithm
No ratings yet
Hierarchical Clustering Algorithm
9 pages
ML - 8
No ratings yet
ML - 8
70 pages
Chapter 2 (19-06-2019 v2)
No ratings yet
Chapter 2 (19-06-2019 v2)
10 pages
Data Science Session 8 Clustering V0
No ratings yet
Data Science Session 8 Clustering V0
30 pages
Partition
No ratings yet
Partition
52 pages
Cluster Evaluation Techniques: Atds Assignment
No ratings yet
Cluster Evaluation Techniques: Atds Assignment
4 pages
Clustering & Association Mining Basics
No ratings yet
Clustering & Association Mining Basics
50 pages
Clustering Analysis (Unsupervised)
No ratings yet
Clustering Analysis (Unsupervised)
6 pages
Unit 4 Cluster Analysis 4
No ratings yet
Unit 4 Cluster Analysis 4
25 pages
Spooo
No ratings yet
Spooo
9 pages
Birch
No ratings yet
Birch
6 pages
Agnes
No ratings yet
Agnes
25 pages
Unit 4
No ratings yet
Unit 4
16 pages
Lect 11 DM
No ratings yet
Lect 11 DM
41 pages
Unit 4 Clustering
No ratings yet
Unit 4 Clustering
18 pages
Unsupervised Learning-01
No ratings yet
Unsupervised Learning-01
42 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
64 pages
Density-Based Clustering Based On Hierarchical Density Estimates
No ratings yet
Density-Based Clustering Based On Hierarchical Density Estimates
13 pages
Density Based Clustering
No ratings yet
Density Based Clustering
25 pages
Clustering Hierarchical PDF
No ratings yet
Clustering Hierarchical PDF
31 pages
Untitled Form (Responses)
No ratings yet
Untitled Form (Responses)
6 pages
Organization of Data
No ratings yet
Organization of Data
17 pages
PHP Notes Unit 2,3 &4
No ratings yet
PHP Notes Unit 2,3 &4
95 pages
Mining Frequent Patterns Unit-3
No ratings yet
Mining Frequent Patterns Unit-3
13 pages
Chapter 3 Logic Gates
No ratings yet
Chapter 3 Logic Gates
7 pages
K - Means Clustering and Related Algorithms: Ryan P. Adams COS 324 - Elements of Machine Learning Princeton University
No ratings yet
K - Means Clustering and Related Algorithms: Ryan P. Adams COS 324 - Elements of Machine Learning Princeton University
18 pages
Educational Research by Mohammed
100% (1)
Educational Research by Mohammed
124 pages
Data-Mining-Lab-Manual Cs 703b
No ratings yet
Data-Mining-Lab-Manual Cs 703b
41 pages
Semi Supervised Machine Learning Approach For DDOS Detection
No ratings yet
Semi Supervised Machine Learning Approach For DDOS Detection
6 pages
ML Unit 4 V1
No ratings yet
ML Unit 4 V1
30 pages
Data Mining
100% (1)
Data Mining
6 pages
AI in Breast Cancer Analysis
No ratings yet
AI in Breast Cancer Analysis
25 pages
DM Questions
No ratings yet
DM Questions
7 pages
Rapid Serial Visual Presentation in Dynamic Graph Visualization
No ratings yet
Rapid Serial Visual Presentation in Dynamic Graph Visualization
8 pages
Business Analytics: A Data-Driven Decision Making Approach For Business, Volume I
No ratings yet
Business Analytics: A Data-Driven Decision Making Approach For Business, Volume I
48 pages
Driving Cycle Analysis Methods Using Data Clustering For Machine Design Optimization
No ratings yet
Driving Cycle Analysis Methods Using Data Clustering For Machine Design Optimization
6 pages
Design and Testing of Automated Smoke Monitoring Sensors in Vehicles
No ratings yet
Design and Testing of Automated Smoke Monitoring Sensors in Vehicles
8 pages
Heart Disease Prediction Final Report
100% (1)
Heart Disease Prediction Final Report
31 pages
Supply Chain Risk Review
No ratings yet
Supply Chain Risk Review
43 pages
BCS602 Model Question Paper Solved (Search Creators)
No ratings yet
BCS602 Model Question Paper Solved (Search Creators)
37 pages
Skin Disease Detection Using Image Processing Technique
No ratings yet
Skin Disease Detection Using Image Processing Technique
4 pages
Unsupervised Machine Learning in Python
100% (2)
Unsupervised Machine Learning in Python
89 pages
Unit 2
No ratings yet
Unit 2
10 pages
Data Mining for Business Insights
No ratings yet
Data Mining for Business Insights
30 pages
MRKT 434 Case 2 - Durr Environmental
50% (2)
MRKT 434 Case 2 - Durr Environmental
4 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
6 pages
FCM The Fuzzy C Means Clustering Algorithm PDF
0% (1)
FCM The Fuzzy C Means Clustering Algorithm PDF
2 pages
Urban Forms: AI Analysis for Planners
No ratings yet
Urban Forms: AI Analysis for Planners
10 pages
Browse The Book: First-Hand Knowledge
No ratings yet
Browse The Book: First-Hand Knowledge
31 pages
E-Note 33535 Content Document 20250322050519PM
No ratings yet
E-Note 33535 Content Document 20250322050519PM
4 pages
Rahul Chakraborty
No ratings yet
Rahul Chakraborty
11 pages
DSE Oficial - Riquelme 2015 - Discontinuity Spacing Analysis in Rock Masses Using 3D Point Clouds
No ratings yet
DSE Oficial - Riquelme 2015 - Discontinuity Spacing Analysis in Rock Masses Using 3D Point Clouds
11 pages
Clustering-Based Improvement of Nonparametric Functional Time Series Forecasting: Application To Intra-Day Household-Level Load Curves
No ratings yet
Clustering-Based Improvement of Nonparametric Functional Time Series Forecasting: Application To Intra-Day Household-Level Load Curves
9 pages
Jupyter Notebook Project DM Nikita Chaturvedi 25.07.2021
100% (5)
Jupyter Notebook Project DM Nikita Chaturvedi 25.07.2021
83 pages

Hierarchical Clustering

Uploaded by

Hierarchical Clustering

Uploaded by

A Hierarchical clustering method works via grouping data into a tree of

clusters. Hierarchical clustering begins by treating every data point as a

In Hierarchical Clustering, the aim is to produce a hierarchical series of

Let's say we have six data points A, B, C, D, E, and F.

 Step-1: Consider each alphabet as a single cluster and calculate the

Divisive Hierarchical clustering

Divisive Hierarchical clustering

DBSCAN is a density-based clustering algorithm that groups data

DBSCAN Clustering in ML | Density based clustering

You might also like