0% found this document useful (0 votes)

43 views4 pages

Data Entry

The document loads packages, reads in a CSV file containing student marks data, scales the numeric data using standardization, performs k-means clustering on the scaled data, and plots the results of the clustering to visualize the different clusters. Specifically, it loads data on student marks, scales the science, accounting, and maths columns, performs k-means clustering with k=3 clusters, and plots the clustered data with each cluster in a different color.

Uploaded by

2147033

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views4 pages

Data Entry

Uploaded by

2147033

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Assignment 7

In [1]: # Load pacakages

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
%matplotlib inline
import seaborn as sn

In [2]: #Load the csv file

marks = pd.read_csv('Marks.csv')

In [3]: # Understanding the data

marks.info()
<class 'pandas.core.frame.DataFrame'>
Rangeindex: 10 entries, 0 to 9
Data columns (total 4 columns):
# Column Non-Null Count Dtype
--------------
0 Roll no 10 non-null int64
1 Science 10 non-null int64
2 Accounting 10 non-null int64
3 Maths 10 non-null int64
dtypes: int64(4)
memory usage: 448.0 bytes

In [4]: # Scaling the data

from sklearn import preprocessing
from sklearn.preprocessing import StandardScaler
scale=StandardScaler()
scaled_marks =scale.fit_transform(marks[["Science","Accounting","Maths"]])
scaled_marks

Out[4]: array([[ 0.84964017, -0.14463921, -0.44396005],

[-0.20508556, 0.43391764, 0.89824474],
[ 0.08789381, 0.72319607, -0.75369961],
[ 0.79104429, -2.31422742, -0.96019266],
[ 0.3222773 , -0.43391764, -1.99265788],
[-1.6699824 , 0.28927843, 1.10473779],
[-1.37700303, -0.86783528, 1.41447736],
[-1.08402366, 1.59103135, 0.48525865],
[ 1.25981128, 0.57855685, -0.03097396],
[ 1.02542779, 0.14463921, 0.27876561]])

In [5]: # Creating a dataframe of the scaled data

scaledmarks = pd.DataFrame(scaled_marks, index =['1', '2', '3', '4', '5', '6', '7',
columns =['Science','Accounting','Ma1
scaledmarks

localhost:8889/nbconvert/html/Downloads/lDS/Chp 7 Clustering Marks dataset.ipynb?download=false 1/4

3/14/24, 2:13 PM Chp 7 Clustering Marks dataset
kmeans=kmeansclusters.fit_predict(scaledmarks)
kmeans

Out[10]: array([2, 0, 2, 1, 1, 0, 0, 0, 2, 2])

In [11]: # Scatter plot of the clusters

#Create seperate lists of each cluster
kmeans0 = scaledmarks[kmeans -- 0]
kmeansl = scaledmarks[kmeans -- 1]
kmeans2 = scaledmarks[kmeans -- 2]

#Scatter plot of each cluster

plt.scatter(kmeans0.iloc[:,0] , kmeans0.iloc[:,1], color='blue')
plt.scatter(kmeansl.iloc[:,0] , kmeansl.iloc[:,1], color='red')
plt.scatter(kmeans2.iloc[:,0] , kmeans2.iloc[:,1], color='green')
plt.show()

l'S •
• •
lO

• •
••
0.'5

0.0
•
•
-0.5

-1.0

-1.5

-2.0

-2.5 '-----.----,-------.----.-------.----.-----'
-1.5 -1.0 -0.5 0.0 0.5 HI

In [ ]:

4/4

Exp2 - Data Visualization and Cleaning and Feature Selection
No ratings yet
Exp2 - Data Visualization and Cleaning and Feature Selection
13 pages
Mlda - Lab
No ratings yet
Mlda - Lab
35 pages
SOLUTION ONLY CODE DWDM - Lab - All
No ratings yet
SOLUTION ONLY CODE DWDM - Lab - All
8 pages
DADV Exp-5
No ratings yet
DADV Exp-5
3 pages
Untitled Document
No ratings yet
Untitled Document
6 pages
Mini 4
No ratings yet
Mini 4
9 pages
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
No ratings yet
Tugas Clustering - 132021012 - Kevin Gazkia Naufal
6 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Vid 4
No ratings yet
Vid 4
6 pages
Python K-Means Clustering Guide
No ratings yet
Python K-Means Clustering Guide
6 pages
Elbow Method
No ratings yet
Elbow Method
2 pages
ML Lab Exam Document
No ratings yet
ML Lab Exam Document
14 pages
Practical 5
No ratings yet
Practical 5
6 pages
K-Means Clustering Guide
No ratings yet
K-Means Clustering Guide
26 pages
Exp. 1
No ratings yet
Exp. 1
4 pages
Mall Customer Segmentation Using KMeans Clustering Algorithm and Classification Algorithm
No ratings yet
Mall Customer Segmentation Using KMeans Clustering Algorithm and Classification Algorithm
40 pages
Lab Report 1
No ratings yet
Lab Report 1
6 pages
Experiment 3.1 K-Mean
No ratings yet
Experiment 3.1 K-Mean
8 pages
ML 2.3 Prashant
No ratings yet
ML 2.3 Prashant
4 pages
Week 01.a
No ratings yet
Week 01.a
4 pages
DA Programs
No ratings yet
DA Programs
44 pages
Python ML Algorithms Guide
No ratings yet
Python ML Algorithms Guide
7 pages
Lec 2 Unit 1
No ratings yet
Lec 2 Unit 1
89 pages
Parth ML
No ratings yet
Parth ML
24 pages
Data Pre Processing
No ratings yet
Data Pre Processing
2 pages
Kmeansclustering Sales Dataset
No ratings yet
Kmeansclustering Sales Dataset
6 pages
Program 7
No ratings yet
Program 7
3 pages
MLT 8 KK
No ratings yet
MLT 8 KK
2 pages
ML Lab
No ratings yet
ML Lab
29 pages
AAM 7th Prac
No ratings yet
AAM 7th Prac
4 pages
AdityaGaur BDA Exp8
No ratings yet
AdityaGaur BDA Exp8
4 pages
Aiml Lab
No ratings yet
Aiml Lab
37 pages
Data Science Practicals
No ratings yet
Data Science Practicals
47 pages
Ai 28-01-25
No ratings yet
Ai 28-01-25
18 pages
Unit 3 Unsupervised Learning
No ratings yet
Unit 3 Unsupervised Learning
9 pages
ML Lab
No ratings yet
ML Lab
20 pages
KMeans Clustering for Universities
No ratings yet
KMeans Clustering for Universities
9 pages
DVT Exp 3
No ratings yet
DVT Exp 3
1 page
Reading Data: #Importing Required Libraries
No ratings yet
Reading Data: #Importing Required Libraries
16 pages
Assignmnet 5
No ratings yet
Assignmnet 5
11 pages
Pattern Recognition Practicals
No ratings yet
Pattern Recognition Practicals
8 pages
ML Algorithms for Data Scientists
100% (1)
ML Algorithms for Data Scientists
148 pages
Data Preparation
No ratings yet
Data Preparation
11 pages
Dbscan Code Python
No ratings yet
Dbscan Code Python
1 page
Project Data Mining (AMAN YADAV)
No ratings yet
Project Data Mining (AMAN YADAV)
12 pages
BHMC17 P5.ipynb - Colaboratory
No ratings yet
BHMC17 P5.ipynb - Colaboratory
4 pages
Pa66 ML Exp6
No ratings yet
Pa66 ML Exp6
9 pages
7CSE A1 IU2141230116 Kevin Mevada-Practical8
No ratings yet
7CSE A1 IU2141230116 Kevin Mevada-Practical8
3 pages
DM ML Practical
No ratings yet
DM ML Practical
13 pages
Slip
No ratings yet
Slip
5 pages
Yogesh Siddiq Edited
No ratings yet
Yogesh Siddiq Edited
6 pages
Data Science
No ratings yet
Data Science
15 pages
Lecture Material 3
No ratings yet
Lecture Material 3
7 pages
DMA Flask
No ratings yet
DMA Flask
14 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
34 pages
Dav Lab Manual
No ratings yet
Dav Lab Manual
28 pages
Sales Data Clustering
No ratings yet
Sales Data Clustering
15 pages
DBMS Lab 18 SQL Constraints Combine
No ratings yet
DBMS Lab 18 SQL Constraints Combine
8 pages
MIC Chap 2 Notes
100% (2)
MIC Chap 2 Notes
9 pages
Vxworks Architecture Supplement 6.2
No ratings yet
Vxworks Architecture Supplement 6.2
252 pages
OS Lab Manual
No ratings yet
OS Lab Manual
37 pages
Build An Internet Infrastructure Final Exam
100% (4)
Build An Internet Infrastructure Final Exam
2 pages
Multilevel Indexing and B+ Trees
No ratings yet
Multilevel Indexing and B+ Trees
33 pages
8051 Programs With Opcode
100% (4)
8051 Programs With Opcode
80 pages
521 - The Future of OpenZFS and FreeBSD PDF
No ratings yet
521 - The Future of OpenZFS and FreeBSD PDF
27 pages
TELE9782 Latest Advances in Networking
No ratings yet
TELE9782 Latest Advances in Networking
2 pages
Unit 18 Computer Systems Hardware
No ratings yet
Unit 18 Computer Systems Hardware
11 pages
SS3 Data Processing Guide
No ratings yet
SS3 Data Processing Guide
27 pages
CD BSRpeb J2 W T96
No ratings yet
CD BSRpeb J2 W T96
2 pages
CF R16 Final Descriptive Suggestions
No ratings yet
CF R16 Final Descriptive Suggestions
2 pages
LMAX API Specification
100% (1)
LMAX API Specification
56 pages
6.2.3 Release Notes
No ratings yet
6.2.3 Release Notes
28 pages
Dimensional Modeling
100% (1)
Dimensional Modeling
12 pages
Product Selector Guide Storage: Description
No ratings yet
Product Selector Guide Storage: Description
2 pages
Class 10 DBMS
No ratings yet
Class 10 DBMS
41 pages
16 F 506
No ratings yet
16 F 506
22 pages
Java Script Objects
No ratings yet
Java Script Objects
24 pages
Data Communications Basics
No ratings yet
Data Communications Basics
7 pages
Database and SQL-2020
No ratings yet
Database and SQL-2020
19 pages
Release Strategy For Purchase Orders
100% (14)
Release Strategy For Purchase Orders
28 pages
Database Connections: From Processmaker
No ratings yet
Database Connections: From Processmaker
10 pages
1cp1 01 Que 20211116
No ratings yet
1cp1 01 Que 20211116
16 pages
Memory RAM & SSD Upgrades - Acer - Aspire Es1 - Aspire ES1-420
No ratings yet
Memory RAM & SSD Upgrades - Acer - Aspire Es1 - Aspire ES1-420
9 pages
Codingza DLL Inject
No ratings yet
Codingza DLL Inject
3 pages
Intro To MySQL
No ratings yet
Intro To MySQL
50 pages
Smart Traffic Management SRS
No ratings yet
Smart Traffic Management SRS
9 pages
White Paper: Business Analytics and The Data Complexity Matrix
No ratings yet
White Paper: Business Analytics and The Data Complexity Matrix
6 pages

Data Entry

Uploaded by

Data Entry

Uploaded by

Assignment 7

In [1]: # Load pacakages

In [2]: #Load the csv file

In [3]: # Understanding the data

In [4]: # Scaling the data

Out[4]: array([[ 0.84964017, -0.14463921, -0.44396005],

In [5]: # Creating a dataframe of the scaled data

localhost:8889/nbconvert/html/Downloads/lDS/Chp 7 Clustering Marks dataset.ipynb?download=false 1/4

Out[10]: array([2, 0, 2, 1, 1, 0, 0, 0, 2, 2])

In [11]: # Scatter plot of the clusters

#Scatter plot of each cluster

You might also like