0% found this document useful (0 votes)

20 views4 pages

Pranav ML-8

The document outlines a practical exercise on implementing the K-means clustering algorithm in Python, including dataset preparation and visualization of clusters. It describes the theory behind clustering, the algorithm steps, and the analysis of clustering results, emphasizing the importance of selecting the appropriate number of clusters (K). The conclusion highlights the successful implementation and the necessity of using methods like the Elbow method and silhouette scores to determine the optimal K.

Uploaded by

raghaV nama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views4 pages

Pranav ML-8

Uploaded by

raghaV nama

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

PRACTICAL - 8

Aim:
 To identify/prepare a dataset and implement the K-means clustering algorithm in
Python.
 To visualize the resulting clusters and centroids.
 To analyze the clustering result and study the effect of varying the number of clusters
K.
Theory:
 Clustering is an unsupervised learning task that groups data points so that points in
the same group (cluster) are more similar to each other than to those in other groups.
 K-means seeks to partition nnn observations into K clusters {C1,…,CK} by
minimizing the within-cluster sum of squares (WCSS):

where μi is the centroid of cluster Ci.

 Algorithm steps:
1. Initialize K centroids (randomly select K points).
2. Assign each data point to the nearest centroid.
3. Update each centroid as the mean of points assigned to it.
4. Repeat steps 2–3 until centroids move less than a tolerance or max iterations
reached.
Dataset
 Generated synthetically using sklearn.datasets.make_blobs:
o n_samples = 300
o centers = 4
o random_state = 42
 This produces four well-separated Gaussian clusters in 2D.

Program:
Step 1:
47
Department of Computer Science & Engineering
Student Name: Pranav Nama Enrollment No: EN22CS303035
Step 2:

Step 3:

Step 4:

Step 5:

Step 6:

Step 7:

Step 8:

Step 9:

Step 10:

48
Department of Computer Science & Engineering
Student Name: Pranav Nama Enrollment No: EN22CS303035
Step 11:

Step 12:

Output: K-Means Clustering Result:

The plot shows four clusters (different colors) and their centroids (red X’s)

Analysis of Results
1. Cluster quality
o Clusters are compact and well-separated, reflecting the way the data were
generated.
o Centroids lie near the “centre” of each blob.
2. Effect of changing K
49
Department of Computer Science & Engineering
Student Name: Pranav Nama Enrollment No: EN22CS303035
o Under-clustering (K<4):
 e.g. K=3 merges two true blobs into one cluster → increased WCSS.
o Over-clustering (K>4):
 e.g. K=5 splits a true blob into two smaller clusters → may overfit noise.
o Use the Elbow method (plot WCSS vs. K) to pick the “elbow” point where
adding another cluster yields diminishing returns.
3. Suggested extension
o Compute and plot WCSS for K=1to K=8, identify elbow.
o Compute silhouette scores for different K to assess cluster separation.
Conclusion
 Successful implementation of K-means and visualization of four natural clusters in the
dataset.
 Proper choice of K is crucial: too small merges distinct groups; too large over-splits.
 Elbow and silhouette analyses help select an optimal K.

50
Department of Computer Science & Engineering
Student Name: Pranav Nama Enrollment No: EN22CS303035

DWM Exp7 C49
No ratings yet
DWM Exp7 C49
11 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
UNIT - 3 - Clustering
No ratings yet
UNIT - 3 - Clustering
21 pages
K-Means Clustering Explained
No ratings yet
K-Means Clustering Explained
10 pages
7 Clustering1
No ratings yet
7 Clustering1
72 pages
Lecture 18 K Means Clustering
No ratings yet
Lecture 18 K Means Clustering
77 pages
Exp 5 ML
No ratings yet
Exp 5 ML
9 pages
K Means Algorithms
No ratings yet
K Means Algorithms
27 pages
Kmeansfinal
No ratings yet
Kmeansfinal
16 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
26 pages
Wa0033.
No ratings yet
Wa0033.
38 pages
0006 - K Means Clustering - Introduction - 2025
No ratings yet
0006 - K Means Clustering - Introduction - 2025
19 pages
Data Mining
No ratings yet
Data Mining
10 pages
Session 34 - 35clustering
No ratings yet
Session 34 - 35clustering
50 pages
ML (Unit 4)
No ratings yet
ML (Unit 4)
19 pages
"These Are Just Rough Notes For References" What Is K-Means Clustering
No ratings yet
"These Are Just Rough Notes For References" What Is K-Means Clustering
9 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
ML Module5 Clustering
No ratings yet
ML Module5 Clustering
71 pages
K-Means Clustering Report
No ratings yet
K-Means Clustering Report
2 pages
CPE412 Pattern Recognition (Week 7)
No ratings yet
CPE412 Pattern Recognition (Week 7)
48 pages
Experiment 10 Vtu ML
No ratings yet
Experiment 10 Vtu ML
5 pages
Intro to Cluster Analysis
No ratings yet
Intro to Cluster Analysis
90 pages
10.program K Means
No ratings yet
10.program K Means
16 pages
ADL LAB Manual
No ratings yet
ADL LAB Manual
27 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
BDA Unit 2
No ratings yet
BDA Unit 2
31 pages
Exp 7
No ratings yet
Exp 7
6 pages
Chapter 04 Clustering
No ratings yet
Chapter 04 Clustering
36 pages
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
No ratings yet
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
16 pages
K-Means Clustering Guide & Python Implementation
No ratings yet
K-Means Clustering Guide & Python Implementation
21 pages
10 Marks Questions
No ratings yet
10 Marks Questions
19 pages
AI-AG-Day-2-28th Feb 2023
No ratings yet
AI-AG-Day-2-28th Feb 2023
44 pages
Vid 4
No ratings yet
Vid 4
6 pages
Unit 4
No ratings yet
Unit 4
63 pages
Cluster Analysis and K-Means Guide
No ratings yet
Cluster Analysis and K-Means Guide
20 pages
K-Means Clustering Method For The Analysis of Log Data
No ratings yet
K-Means Clustering Method For The Analysis of Log Data
3 pages
Algo
No ratings yet
Algo
59 pages
K Mean Clustering
No ratings yet
K Mean Clustering
32 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
Unit 7 Clustering (P)
No ratings yet
Unit 7 Clustering (P)
22 pages
Unit 3 Data
No ratings yet
Unit 3 Data
37 pages
Intro To ML Ass
No ratings yet
Intro To ML Ass
3 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
Digital Image Processing: Segmentation-5
No ratings yet
Digital Image Processing: Segmentation-5
43 pages
Machine Learning
No ratings yet
Machine Learning
23 pages
Clustering and Dimensionality Reduction
No ratings yet
Clustering and Dimensionality Reduction
58 pages
Clustering Algorithms
No ratings yet
Clustering Algorithms
19 pages
K Means
No ratings yet
K Means
23 pages
Unsupervised Learning: Clustering Algorithms
No ratings yet
Unsupervised Learning: Clustering Algorithms
13 pages
Clustering Partition Hierachy
No ratings yet
Clustering Partition Hierachy
58 pages
K-Means Clustering Guide 2023
No ratings yet
K-Means Clustering Guide 2023
14 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
ML Unit-2
No ratings yet
ML Unit-2
31 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
K Means Clustering
No ratings yet
K Means Clustering
29 pages
08 K-Means
No ratings yet
08 K-Means
19 pages
9 - Ict - T2 - Revsion Material - MS - 2022-23
No ratings yet
9 - Ict - T2 - Revsion Material - MS - 2022-23
5 pages
Icici Bank
No ratings yet
Icici Bank
78 pages
Deep Learning Chorale Prelude
No ratings yet
Deep Learning Chorale Prelude
6 pages
Sample Project - Report - Guidelines
No ratings yet
Sample Project - Report - Guidelines
5 pages
Approved Manufacturers List
100% (1)
Approved Manufacturers List
41 pages
Grade 10 Track Choices Survey Report
0% (2)
Grade 10 Track Choices Survey Report
4 pages
Research Tangina
No ratings yet
Research Tangina
5 pages
831-1 Assignment
No ratings yet
831-1 Assignment
12 pages
Efficient Shorthand Techniques Guide
No ratings yet
Efficient Shorthand Techniques Guide
2 pages
Paramore That S What You Get PDF Free
No ratings yet
Paramore That S What You Get PDF Free
7 pages
Endocrinología
No ratings yet
Endocrinología
19 pages
Importance of Culture in Comm
100% (1)
Importance of Culture in Comm
30 pages
Flyer Liner Hanger Service Fangmann
No ratings yet
Flyer Liner Hanger Service Fangmann
2 pages
Datasheet MMO 32 - 12
No ratings yet
Datasheet MMO 32 - 12
1 page
ABRITES Commander For VAG Manual
No ratings yet
ABRITES Commander For VAG Manual
108 pages
Investigative Journalism Manual (PDFDrive)
100% (3)
Investigative Journalism Manual (PDFDrive)
121 pages
Field Study 2 Module Amurao Micalyne R.
No ratings yet
Field Study 2 Module Amurao Micalyne R.
88 pages
Field Study 1 (Episode 3)
100% (1)
Field Study 1 (Episode 3)
3 pages
Don Don Donki
No ratings yet
Don Don Donki
30 pages
OSCE Exam Report
No ratings yet
OSCE Exam Report
53 pages
Class Xi - Holiday Homework - 2025
No ratings yet
Class Xi - Holiday Homework - 2025
6 pages
Long Quiz On Internal Auditing
No ratings yet
Long Quiz On Internal Auditing
4 pages
Newtonianism and The Constitution
No ratings yet
Newtonianism and The Constitution
16 pages
Hydraulic Accumulators & Intensifiers
No ratings yet
Hydraulic Accumulators & Intensifiers
7 pages
Case Presentation Grading Rubric
No ratings yet
Case Presentation Grading Rubric
1 page
Veterinary Admission Guidelines
No ratings yet
Veterinary Admission Guidelines
47 pages
Tesis Con Mrf136
No ratings yet
Tesis Con Mrf136
147 pages
PPG - Thinner 2106 - English
No ratings yet
PPG - Thinner 2106 - English
13 pages
ICT 10 REGISTRY EDIT Lecture
No ratings yet
ICT 10 REGISTRY EDIT Lecture
22 pages
Monitoring of Transient Overvoltages On The Power Transformers and Shunt Reactors
No ratings yet
Monitoring of Transient Overvoltages On The Power Transformers and Shunt Reactors
14 pages

Pranav ML-8

Uploaded by

Pranav ML-8

Uploaded by

PRACTICAL - 8

where μi is the centroid of cluster Ci.

Output: K-Means Clustering Result:

You might also like