0% found this document useful (0 votes)

60 views9 pages

KNN Age Prediction for Asian Dataset

The document describes the steps to create KNN age prediction models for Asian datasets based on bone length measurements. It involves importing libraries like pandas and sklearn, loading CSV datasets, splitting data into training and testing sets, defining input and target variables, training a KNN classifier on the training set and predicting ages on the testing set. Model performance is evaluated using mean squared error. Advantages of the train-test-split library include less time consumed and easier implementation while doing it manually allows consistent results but takes more time.

Uploaded by

ARINA SYAKIRAH MUHAIYUDDIN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views9 pages

KNN Age Prediction for Asian Dataset

Uploaded by

ARINA SYAKIRAH MUHAIYUDDIN

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

KNN Age Prediction Model for Asian Dataset Based on 19 lengths of left-hand bones

Male Dataset
Using Library Train_Test_Split

Step 1:
Open anaconda navigator and launch Jupyter Notebook
In the csv file, put =RANDBETWEEN(x,y) with x and y is 1 until the last number
Sort and split into two for training data and testing data
Save the datasets in the same folder as your coding

Step 2:
Import library pandas in the coding.
This library is used to read the CSV file from the same folder
Load the dataset into the coding

import pandas as pd
f_train = pd.read_csv('C:/Users/ariny/OneDrive/Documents/female_train.csv')
f_test = pd.read_csv('C:/Users/ariny/OneDrive/Documents/female_test.csv')

Step 3:
Divide the datasets into input and target variables
Drop unrelated columns in input training and target training

from sklearn.model_selection import train_test_split

input_training = f_train.drop(['No', 'Race', 'Gender', 'DOB', 'Exam Date', 'Tanner', 'Weight(kg)',
'Height(cm)', 'Trunk HT (cm)', 'ChrAge'], axis=1)
target_training = f_train['ChrAge']
input_testing = f_test.drop(['No', 'Race', 'Gender', 'DOB', 'Exam Date', 'Tanner', 'Weight(kg)',
'Height(cm)', 'Trunk HT (cm)', 'ChrAge'], axis=1)
target_testing = f_test['ChrAge']

Step 4:
Import library KNN from sklearn.neighbors
Use Classifier to determine false or true
Change n_neighbors according to number of nearest neighbors
Train the model using knn.fit and predict using knn.predict

from sklearn.neighbors import KNeighborsClassifier

knn = KNeighborsClassifier(n_neighbors=1)

knn.fit(input_training, target_training)
y_pred = knn.predict(input_testing)

Step 5:
Import metrics from sklearn to calculate mean squared error

from sklearn import metrics

mse = metrics.mean_squared_error(target_testing, y_pred)
print("Mean Squared Error:", mse)

Output:
Using Manually Random Ordered

Step 2:
Import library pandas in the coding.
This library is used to read the CSV file from the same folder
Load the dataset into the coding

import pandas as pd
m_train = pd.read_csv('C:/Users/ariny/OneDrive/Documents/male_train.csv')
m_test = pd.read_csv('C:/Users/ariny/OneDrive/Documents/male_test.csv')

Step 3:
Divide the datasets into input and target variables
Drop unrelated columns in input training and target training

from sklearn.model_selection import train_test_split

input_training = m_train.drop(['Bil', 'Race', 'Gender', 'DOB', 'Exam Date', 'Tanner', 'Weightkg',
'Heightcm', 'Trunk HTcm', 'ChrAge'], axis=1)
target_training = m_train['ChrAge']
input_testing = m_test.drop(['Bil', 'Race', 'Gender', 'DOB', 'Exam Date', 'Tanner', 'Weightkg',
'Heightcm', 'Trunk HTcm', 'ChrAge'], axis=1)
target_testing = m_test['ChrAge']

from sklearn.neighbors import KNeighborsClassifier

knn = KNeighborsClassifier(n_neighbors=1)

knn.fit(input_training, target_training)
y_pred = knn.predict(input_testing)

Step 5:
Import metrics from sklearn to calculate mean squared error
from sklearn import metrics
mse = metrics.mean_squared_error(target_testing, y_pred)
print("Mean Squared Error:", mse)

Output:

Jupyter screenshot:
Female Dataset
Using Library Train_Test_Split

Step 1:
Open anaconda navigator and launch Jupyter Notebook
Save the xray_image_dataset_female.csv dataset in the same folder as your coding
Create a new phyton file and start doing the coding

Step 2:
Import library pandas in the coding.
This library is used to read the CSV file from the same folder
Load the dataset into the coding

import pandas as pd
f_data = pd.read_csv('C:/Users/ariny/OneDrive/Documents/xray_image_dataset_female.csv')
f_data

Step 3:
Split the dataset into training and testing datasets using the function train_test_split.
This function is used to select training data and testing data randomly.
Import function train_test_split using library sklearn.model_selection
Drop unrelated columns from the dataset excel
Set y as the prediction age
Test size is 0.3 because 70% for training and 30% for testing

from sklearn.model_selection import train_test_split

X = f_data.drop(['No', 'Race', 'Gender', 'DOB', 'Exam Date', 'Tanner', 'Weight(kg)', 'Height(cm)',
'Trunk HT (cm)', 'ChrAge'], axis=1)
y = f_data['ChrAge']
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3)

from sklearn.neighbors import KNeighborsClassifier

knn = KNeighborsClassifier(n_neighbors=1)

knn.fit(X_train, y_train)
y_pred = knn.predict(X_test)
Step 5:
Import metrics to calculate accuracy from sklearn
Calculate mean squared error for the dataset

from sklearn import metrics

mse = metrics.mean_squared_error(y_test, y_pred)
print("Mean Squared Error:", mse)

Output:
Using Manually Random Ordered

Step 2:
Import library pandas in the coding.
This library is used to read the CSV file from the same folder
Load the dataset into the coding

import pandas as pd
f_train = pd.read_csv('C:/Users/ariny/OneDrive/Documents/female_train.csv')
f_test = pd.read_csv('C:/Users/ariny/OneDrive/Documents/female_test.csv')

Step 3:
Divide the datasets into input and target variables
Drop unrelated columns in input training and target training

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier

knn = KNeighborsClassifier(n_neighbors=1)

knn.fit(input_training, target_training)
y_pred = knn.predict(input_testing)

Step 5:
Import metrics from sklearn to calculate mean squared error
from sklearn import metrics
mse = metrics.mean_squared_error(target_testing, y_pred)
print("Mean Squared Error:", mse)

Output:

Jupyter screenshot:
Advantages and disadvantages

Library Train_Test_Split

Advantages Disadvantages

Less time consumed Inconsistent result

Easier to implement and code because the Re-train and re-test dataset every time the
library already existed code executed

Manually Random Ordered

Advantages Disadvantages

Consistent result Time consuming

Can custom split based on specific conditions Model performance can be biased

Decision Tree
No ratings yet
Decision Tree
10 pages
Atul MLT Exp 4-11
No ratings yet
Atul MLT Exp 4-11
17 pages
Data Mining Assignment No. 1
No ratings yet
Data Mining Assignment No. 1
7 pages
Aiml Ex 4-7
No ratings yet
Aiml Ex 4-7
8 pages
Iii Aid - ML
No ratings yet
Iii Aid - ML
30 pages
Task 2
No ratings yet
Task 2
4 pages
ML External Xerox
No ratings yet
ML External Xerox
1 page
ML Practical 205160694034
No ratings yet
ML Practical 205160694034
33 pages
Naive Bayes
No ratings yet
Naive Bayes
5 pages
AI Assignment-6
No ratings yet
AI Assignment-6
7 pages
MlLabManualdocx 2024 09 04 22 02 58
No ratings yet
MlLabManualdocx 2024 09 04 22 02 58
19 pages
Titanic Data Analysis & Modeling
No ratings yet
Titanic Data Analysis & Modeling
11 pages
ML Lab 6-9
No ratings yet
ML Lab 6-9
15 pages
Prakhar - Week 5
No ratings yet
Prakhar - Week 5
8 pages
Real-Time Calorie Burn Prediction
No ratings yet
Real-Time Calorie Burn Prediction
27 pages
Record
No ratings yet
Record
22 pages
LAB-4 Report
No ratings yet
LAB-4 Report
21 pages
DM Final
No ratings yet
DM Final
79 pages
DSBDA Practicals
No ratings yet
DSBDA Practicals
16 pages
ML Lab Programs PDF
No ratings yet
ML Lab Programs PDF
15 pages
ML Lab: Healthcare Data Analysis
No ratings yet
ML Lab: Healthcare Data Analysis
16 pages
AI ML - Cycle 2 Programs
No ratings yet
AI ML - Cycle 2 Programs
15 pages
Practical 4
No ratings yet
Practical 4
2 pages
Aiml Programs
No ratings yet
Aiml Programs
12 pages
P 7
No ratings yet
P 7
5 pages
Exp 5
No ratings yet
Exp 5
4 pages
1st PGM
No ratings yet
1st PGM
10 pages
Machine Learning Evaluation Guide
100% (1)
Machine Learning Evaluation Guide
504 pages
Pattern Recognition
No ratings yet
Pattern Recognition
26 pages
ML LAb Task
No ratings yet
ML LAb Task
4 pages
Final
No ratings yet
Final
13 pages
Data Science Code Implementations
No ratings yet
Data Science Code Implementations
274 pages
Machine Learning Lab New
No ratings yet
Machine Learning Lab New
14 pages
AI and ML Lab Manual
No ratings yet
AI and ML Lab Manual
29 pages
ML Lab Programs For Exam
No ratings yet
ML Lab Programs For Exam
10 pages
ML Manual
No ratings yet
ML Manual
53 pages
IEEE Conference Team ATOM
No ratings yet
IEEE Conference Team ATOM
5 pages
ML Complete Notes Hridoy
No ratings yet
ML Complete Notes Hridoy
5 pages
Titanic Akshaya
No ratings yet
Titanic Akshaya
12 pages
1 10
No ratings yet
1 10
4 pages
Aml Lab
No ratings yet
Aml Lab
6 pages
Untitled32.Ipynb - Colab
No ratings yet
Untitled32.Ipynb - Colab
1 page
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
24 pages
Diabetes Prediction with KNN
No ratings yet
Diabetes Prediction with KNN
2 pages
Decision Tree Classification Guide
No ratings yet
Decision Tree Classification Guide
4 pages
Cardiovascular Disease Prediction
No ratings yet
Cardiovascular Disease Prediction
2 pages
Advance Machine Learning
No ratings yet
Advance Machine Learning
28 pages
Bacdeaf 23032025 115708 Split 1
No ratings yet
Bacdeaf 23032025 115708 Split 1
37 pages
KNN Model
No ratings yet
KNN Model
5 pages
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
No ratings yet
Import As From Import From Import From Import From Import From Import From Import From Import From Import From Import From Import Import As
8 pages
Medical Data ML
No ratings yet
Medical Data ML
6 pages
Python ML Algorithms Guide
No ratings yet
Python ML Algorithms Guide
7 pages
Machine Learning With Titanic Dataset Tutorial
No ratings yet
Machine Learning With Titanic Dataset Tutorial
7 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
44 pages
Basic ML Algo
No ratings yet
Basic ML Algo
10 pages
Data Mining Lab Manual CSE VII Sem
No ratings yet
Data Mining Lab Manual CSE VII Sem
63 pages
Ex 3
No ratings yet
Ex 3
5 pages
ML Experiments
No ratings yet
ML Experiments
22 pages
ML Lab Programs
No ratings yet
ML Lab Programs
9 pages
ELC650
No ratings yet
ELC650
1 page
Stack and Queue Operations Guide
No ratings yet
Stack and Queue Operations Guide
25 pages
Aladdin's Lessons on Friendship
No ratings yet
Aladdin's Lessons on Friendship
13 pages
Assessment 2
No ratings yet
Assessment 2
6 pages
508 Group Presentation
No ratings yet
508 Group Presentation
11 pages
Gauhati Univ Exam Form Fill-Up 2024
No ratings yet
Gauhati Univ Exam Form Fill-Up 2024
2 pages
Installation
No ratings yet
Installation
13 pages
MT6735 Android Scatter
No ratings yet
MT6735 Android Scatter
7 pages
Unix Case Study
88% (8)
Unix Case Study
5 pages
Automotive TCXO Specifications
No ratings yet
Automotive TCXO Specifications
1 page
DEEPTECH 2M Schedule
No ratings yet
DEEPTECH 2M Schedule
8 pages
ALY (R-410A) Series
No ratings yet
ALY (R-410A) Series
41 pages
Java Interview Guide - 200+ Interview Questions and Answers (Video)
No ratings yet
Java Interview Guide - 200+ Interview Questions and Answers (Video)
5 pages
Brain Music System Assisted Living Update
No ratings yet
Brain Music System Assisted Living Update
24 pages
Engine Parts Catalogue 2012
100% (1)
Engine Parts Catalogue 2012
49 pages
Saudi Arabia Mining Project Overview
No ratings yet
Saudi Arabia Mining Project Overview
11 pages
11.4.2.7 Lab - Managing Device Configuration Files Using TFTP, Flash, and USB
0% (1)
11.4.2.7 Lab - Managing Device Configuration Files Using TFTP, Flash, and USB
14 pages
Pci 8000 (MKT-0430)
No ratings yet
Pci 8000 (MKT-0430)
3 pages
7430 - Sigmacover 630
No ratings yet
7430 - Sigmacover 630
0 pages
Cphs4 2 Generic-69969
No ratings yet
Cphs4 2 Generic-69969
31 pages
Mysql High Availability Solutions
No ratings yet
Mysql High Availability Solutions
41 pages
60+vip+xtream+codes+2025 01 01-1
No ratings yet
60+vip+xtream+codes+2025 01 01-1
9 pages
Perforating Techniques Explained
100% (2)
Perforating Techniques Explained
65 pages
Application of Pentron Prod
No ratings yet
Application of Pentron Prod
3 pages
Solar Batteries & Panels Nigeria 2024
No ratings yet
Solar Batteries & Panels Nigeria 2024
1 page
Engineering Fluid Dynamics Study
No ratings yet
Engineering Fluid Dynamics Study
10 pages
Cornell VC Directory
No ratings yet
Cornell VC Directory
186 pages
SQL Checkboxes in Apex Report
No ratings yet
SQL Checkboxes in Apex Report
3 pages
IVRCL Job Application Form
No ratings yet
IVRCL Job Application Form
5 pages
Environmental Management Systems
No ratings yet
Environmental Management Systems
33 pages
Ieee Pes Project: Solar Power Inverter
No ratings yet
Ieee Pes Project: Solar Power Inverter
7 pages
Convert Source Data For Pivot Table
No ratings yet
Convert Source Data For Pivot Table
7 pages
Biogas Brochure Uniflip
No ratings yet
Biogas Brochure Uniflip
8 pages
Srinivas Resume
No ratings yet
Srinivas Resume
2 pages
SOAP Tutorial
100% (2)
SOAP Tutorial
23 pages

KNN Age Prediction for Asian Dataset

Uploaded by

KNN Age Prediction for Asian Dataset

Uploaded by

KNN Age Prediction Model for Asian Dataset Based on 19 lengths of left-hand bones

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier

from sklearn import metrics

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier

from sklearn import metrics

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier

Less time consumed Inconsistent result

Manually Random Ordered

Consistent result Time consuming

You might also like