0% found this document useful (0 votes)

18 views19 pages

ML Lab Manual

The document outlines various Python programming experiments focused on statistical analysis, machine learning, and data visualization. It includes implementations of central tendency measures, linear regression, decision trees, KNN, logistic regression, and K-Means clustering, utilizing libraries such as NumPy, Pandas, and Scikit-learn. Each section provides code examples and expected outputs for better understanding of the concepts.

Uploaded by

Sofia tarannum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views19 pages

ML Lab Manual

Uploaded by

Sofia tarannum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 19

NAME OF THE EXPERIMENT PAGE

NO
1 python program to compute Central Tendency
Measures :Mean, Median, Mode Measures of
Dispersion: variance ,standard Deviation

2 .Study of Python Basic Libraries such as Statistics,

Math, Numpy and Scipy

3 Study of Python Libraries for ML application such

as Pandas and Matplotlib
4 Python Program for Simple Linear Regression.

5 Implementation of Multiple Linear Regression for

House Pricing Pricing Prediction using sklearn
6 Implementation of Decision tree using sklearn and
its parameter tuning

7 Implementation of KNN using sklearn

8 Implementation of Logistic Regression using

sklearn

9 Implementation of K-Means Clustering

import numpy as np

10 Performance analysis of Classification Algorithms

1
Program 1: python program to compute Central Tendency
Measures :Mean, Median, Mode Measures of Dispersion:
variance ,standard Deviation

import statistics as stats

def central_tendency_dispersion(data):

# Central Tendency Measures

mean = stats.mean(data)

median = stats.median(data)

try:

mode = stats.mode(data)

except stats.StatisticsError:

mode = "No unique mode found"

# Measures of Dispersion

variance = stats.variance(data)

std_dev = stats.stdev(data)

# Display results

print(f"Mean: {mean}")

print(f"Median: {median}")

print(f"Mode: {mode}")

print(f"Variance: {variance}")

print(f"Standard Deviation: {std_dev}")

# Example data

2
data = [10, 15, 14, 10, 15, 18, 20, 25, 30]

central_tendency_dispersion(data)

OUTPUT:

Mean: 17.444444444444443
Median: 15
Mode: 10
Variance: 44.52777777777778
Standard Deviation: 6.672913739722534

3
2.Study of Python Basic Libraries such as Statistics, Math, Numpy and
Scipy

Python provides a wide range of basic libraries that are essential for various computational
tasks. These libraries offer functionality to handle statistical calculations, mathematical
operations, and scientific computing. Here is an overview:

Statistics Module

 Used for statistical computations such as mean, median, mode, variance, etc.
 Example

import statistics

data = [1, 2, 2, 3, 4]

print("Mean:", statistics.mean(data))

print("Median:", statistics.median(data))

print("Mode:", statistics.mode(data))

Math Module

 Provides mathematical functions such as trigonometric calculations, logarithms,

factorials, and more.
 Example

import math

print("Square root of 16:", math.sqrt(16))

print("Factorial of 5:", math.factorial(5))

print("Cosine of 45 degrees:", math.cos(math.radians(45)))

Numpy Library

 Widely used for numerical computations with arrays, matrices, and linear algebra
functions.
 Example:

import numpy as np

array = np.array([1, 2, 3, 4, 5])

4
print("Mean of array:", np.mean(array))

print("Sum of array:", np.sum(array))

Scipy Library

 Built on Numpy, it provides additional functionality for optimization, integration, and

scientific computations.
 Example

from scipy import integrate

# Define a function to integrate

result, _ = integrate.quad(lambda x: x**2, 0, 1)

print("Integral of x^2 from 0 to 1:", result)

5
3. Study of Python Libraries for ML application such as Pandas and
Matplotlib

For machine learning and data analysis, Python libraries like Pandas and Matplotlib are
essential for data manipulation and visualization.

Pandas

 Provides data structures like Series and DataFrame for handling and analyzing data
efficiently.
 Example:

import pandas as pd

data = {'Name': ['Alice', 'Bob', 'Charlie'], 'Age': [25, 30, 35]}

df = pd.DataFrame(data)

print(df)

print("Mean Age:", df['Age'].mean())

Matplotlib

 A visualization library used for creating static, interactive, and animated plots.
 Example:

import matplotlib.pyplot as plt

x = [1, 2, 3, 4, 5]

y = [10, 20, 25, 30, 35]

plt.plot(x, y, marker='o', linestyle='--', color='r')

plt.title("Sample Line Plot")

plt.xlabel("X-axis")

plt.ylabel("Y-axis")

plt.show()

6
Program 4:Python Program for Simple Linear Regression.

import numpy as np

import matplotlib.pyplot as plt

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LinearRegression

from sklearn.metrics import mean_squared_error, r2_score

# Generate some example data

np.random.seed(0)

X = 2 * np.random.rand(100, 1)

y = 4 + 3 * X + np.random.randn(100, 1)

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

random_state=42)

# Create and train the model

model = LinearRegression()

model.fit(X_train, y_train)

# Make predictions

y_pred = model.predict(X_test)

# Evaluate the model

mse = mean_squared_error(y_test, y_pred)

r2 = r2_score(y_test, y_pred)

7
print(f"Mean Squared Error: {mse:.2f}")

print(f"R-squared: {r2:.2f}")

# Plotting the results

plt.scatter(X_test, y_test, color="black", label="Actual data")

plt.plot(X_test, y_pred, color="blue", linewidth=2, label="Fitted line")

plt.xlabel("X")

plt.ylabel("y")

plt.title("Simple Linear Regression")

plt.legend()

plt.show()

OUTPUT:

8
program5: Implementation of Multiple Linear Regression for House
Pricing Pricing Prediction using sklearn

import numpy as np

import pandas as pd

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LinearRegression

from sklearn.metrics import mean_squared_error, r2_score

# Load the dataset

data = pd.read_csv('house_prices.csv')

# Display the first few rows of the dataset

print(data.head())

# Selecting features and target variable

X = data[['Size', 'Bedrooms', 'Age']]

y = data['Price']

# Handling missing data

X = X.fillna(X.mean())

y = y.fillna(y.mean())

# Splitting the data into training and testing sets

9
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,
random_state=42)

# Creating and training the model

model = LinearRegression()

model.fit(X_train, y_train)

# Making predictions on the testing set

y_pred = model.predict(X_test)

# Evaluating the model's performance

mse = mean_squared_error(y_test, y_pred)

r2 = r2_score(y_test, y_pred)

print(f'Mean Squared Error: {mse}')

print(f'R-squared: {r2}')

# Model coefficients

print("Intercept:", model.intercept_)

print("Coefficients:", model.coef_)

coefficients = pd.DataFrame(model.coef_, X.columns, columns=['Coefficient'])

print(coefficients)

10
6. Implementation of Decision tree using sklearn and its parameter tuning
11
Importing necessary libraries

import numpy as np

import pandas as pd

from sklearn.model_selection import train_test_split, GridSearchCV

from sklearn.tree import DecisionTreeClassifier

from sklearn.metrics import accuracy_score, classification_report

from sklearn.datasets import load_iris

# Load dataset (for example, the Iris dataset)

data = load_iris()

X = data.data

y = data.target

# Split dataset into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2,

random_state=42)

# Initialize a basic DecisionTreeClassifier

clf = DecisionTreeClassifier(random_state=42)

# Fit the model with the training data

clf.fit(X_train, y_train)

# Predict on the test set

y_pred = clf.predict(X_test)

12
# Evaluate model performance

print("Accuracy without tuning: ", accuracy_score(y_test, y_pred))

print("Classification Report:\n", classification_report(y_test, y_pred))

# Parameter tuning using GridSearchCV

param_grid = {

'criterion': ['gini', 'entropy'], # Different criteria for splitting

'splitter': ['best', 'random'], # Split strategy

'max_depth': [None, 10, 20, 30], # Depth of tree

'min_samples_split': [2, 5, 10], # Minimum number of samples to split a

node

'min_samples_leaf': [1, 2, 4], # Minimum number of samples to be at a

leaf node

'max_features': [None, 'auto', 'sqrt', 'log2'] # Number of features to consider

for the best split

# Using GridSearchCV for parameter tuning

grid_search = GridSearchCV(estimator=clf, param_grid=param_grid, cv=5,

n_jobs=-1, verbose=1)

# Fit GridSearchCV

grid_search.fit(X_train, y_train)

# Best parameters from GridSearchCV

13
print("Best Parameters: ", grid_search.best_params_)

# Predict with the best estimator from grid search

best_clf = grid_search.best_estimator_

y_pred_best = best_clf.predict(X_test)

# Evaluate performance with the tuned model

print("Accuracy with tuning: ", accuracy_score(y_test, y_pred_best))

print("Classification Report:\n", classification_report(y_test, y_pred_best))

OUTPUT:

Accuracy with tuning: 1.0

Classification Report:
precision recall f1-score support

0 1.00 1.00 1.00 10

1 1.00 1.00 1.00 9
2 1.00 1.00 1.00 11

accuracy 1.00 30
macro avg 1.00 1.00 1.00 30
weighted avg 1.00 1.00 1.00 30

7. Implementation of KNN using sklearn

14
# Import necessary libraries

from sklearn.datasets import load_iris

from sklearn.model_selection import train_test_split

from sklearn.neighbors import KNeighborsClassifier

from sklearn.metrics import accuracy_score

# Load the dataset (Iris dataset)

iris = load_iris()

X = iris.data # Features

y = iris.target # Target labels

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3,

random_state=42)

# Create the KNN model with k=3

knn = KNeighborsClassifier(n_neighbors=3)

# Train the model

knn.fit(X_train, y_train)

# Make predictions

y_pred = knn.predict(X_test)

# Evaluate the model's performance

accuracy = accuracy_score(y_test, y_pred)

print(f"Accuracy: {accuracy * 100:.2f}%")

OUTPUT:

Accuracy: 100.00%

8.Implementation of Logistic Regression using sklearn

15
# Import necessary libraries

import numpy as np

from sklearn.model_selection import train_test_split

from sklearn.linear_model import LogisticRegression

from sklearn.metrics import accuracy_score, confusion_matrix,

classification_report

from sklearn.datasets import load_iris

# Load a sample dataset

# Here, we're using the Iris dataset for simplicity.

# We'll use only two classes (binary classification) for logistic regression.

iris = load_iris()

X = iris.data

y = iris.target

# For binary classification, we'll select only two classes (e.g., class 0 and 1)

X = X[y != 2] # Select only class 0 and 1

y = y[y != 2] # Select only class 0 and 1

# Split the data into training and testing sets

X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3,

random_state=42)

# Create a Logistic Regression model

log_reg = LogisticRegression()

# Train the model

log_reg.fit(X_train, y_train)

16
# Make predictions on the test set

y_pred = log_reg.predict(X_test)

# Evaluate the model

accuracy = accuracy_score(y_test, y_pred)

conf_matrix = confusion_matrix(y_test, y_pred)

class_report = classification_report(y_test, y_pred)

# Print the results

print("Accuracy:", accuracy)

print("\nConfusion Matrix:\n", conf_matrix)

print("\nClassification Report:\n", class_report)

OUTPUT:

Accuracy: 1.0

Confusion Matrix:

[[17 0]

[ 0 13]]

Classification Report:

precision recall f1-score support

0 1.00 1.00 1.00 17

1 1.00 1.00 1.00 13

accuracy 1.00 30

macro avg 1.00 1.00 1.00 30

weighted avg 1.00 1.00 1.00 30

9.Implementation of K-Means Clustering

17
import numpy as np

from sklearn.cluster import KMeans

from sklearn.datasets import make_blobs

import matplotlib.pyplot as plt

# Generate synthetic data with 4 clusters

X, y_true = make_blobs(n_samples=300, centers=4, cluster_std=0.60,

random_state=0)

# Create a KMeans model with the number of clusters set to 4

kmeans = KMeans(n_clusters=4, random_state=0)

# Fit the model to the data

kmeans.fit(X)

# Predict the cluster labels for each data point

y_kmeans = kmeans.predict(X)

# Plotting the clusters and their centroids

plt.scatter(X[:, 0], X[:, 1], c=y_kmeans, s=50, cmap='viridis')

# Marking the centroids

centers = kmeans.cluster_centers_

plt.scatter(centers[:, 0], centers[:, 1], c='red', s=200, alpha=0.75, marker='X')

plt.title("K-Means Clustering")

18
plt.xlabel("Feature 1")

plt.ylabel("Feature 2")

plt.show()

OUTPUT:

ML Record
No ratings yet
ML Record
19 pages
ML Record
No ratings yet
ML Record
21 pages
R22 ML Lab Manual
No ratings yet
R22 ML Lab Manual
25 pages
ML Lab Manual
No ratings yet
ML Lab Manual
38 pages
ML Lab Programs 2
No ratings yet
ML Lab Programs 2
16 pages
ML Lab-1
No ratings yet
ML Lab-1
32 pages
Machine Learning LAB
No ratings yet
Machine Learning LAB
20 pages
cp4252 Machine Learning Lab Manual
No ratings yet
cp4252 Machine Learning Lab Manual
21 pages
Lab Experiments Vi Sem-1
No ratings yet
Lab Experiments Vi Sem-1
10 pages
Karmbir 19 ML
No ratings yet
Karmbir 19 ML
20 pages
Machinelearning - Lab Manual
No ratings yet
Machinelearning - Lab Manual
26 pages
ML Lab Record
No ratings yet
ML Lab Record
17 pages
Sr. No. Practical No. Date Sign: Index
No ratings yet
Sr. No. Practical No. Date Sign: Index
11 pages
ML Yogesh
No ratings yet
ML Yogesh
23 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
22 pages
Machine Learning Algorithms Are Generally Categorized Into Three Main Types
No ratings yet
Machine Learning Algorithms Are Generally Categorized Into Three Main Types
7 pages
Lab ML
No ratings yet
Lab ML
26 pages
Data Mining Practicals
No ratings yet
Data Mining Practicals
22 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
34 pages
Easy Pract ML
No ratings yet
Easy Pract ML
7 pages
Tushar ML
No ratings yet
Tushar ML
52 pages
ML Manual
No ratings yet
ML Manual
24 pages
Smec ML Lab Manual R22
No ratings yet
Smec ML Lab Manual R22
21 pages
ML Lab
No ratings yet
ML Lab
29 pages
ML Lab Record - 250625 - 105014
No ratings yet
ML Lab Record - 250625 - 105014
29 pages
ML File Syllabus
No ratings yet
ML File Syllabus
43 pages
ML Lab Manual
No ratings yet
ML Lab Manual
28 pages
MLLAb
No ratings yet
MLLAb
36 pages
Big Data Practical
No ratings yet
Big Data Practical
20 pages
ML Lab (R22) Manual
No ratings yet
ML Lab (R22) Manual
25 pages
PROG-1: Write A Python Program To Compute Central Tendency Measures: Mean, Median, Mode Measure of Dispersion: Variance, Standard Deviation Aim
No ratings yet
PROG-1: Write A Python Program To Compute Central Tendency Measures: Mean, Median, Mode Measure of Dispersion: Variance, Standard Deviation Aim
11 pages
ML Lab
No ratings yet
ML Lab
23 pages
Machine Learning Evaluation Guide
100% (1)
Machine Learning Evaluation Guide
504 pages
FDS Lab Manual
No ratings yet
FDS Lab Manual
10 pages
Machine Learning
No ratings yet
Machine Learning
10 pages
ML Shristi File
No ratings yet
ML Shristi File
49 pages
Machine Learning Practicals
No ratings yet
Machine Learning Practicals
30 pages
ML With Python Practical
No ratings yet
ML With Python Practical
22 pages
ML Lab Mala Reddy CLG
No ratings yet
ML Lab Mala Reddy CLG
23 pages
CP4252 Machine Learning Lab Manual
No ratings yet
CP4252 Machine Learning Lab Manual
26 pages
LAB MANUAL For Machine Learning
No ratings yet
LAB MANUAL For Machine Learning
15 pages
Sahil ML
No ratings yet
Sahil ML
21 pages
ML Cyber Lab
No ratings yet
ML Cyber Lab
16 pages
Python Machine Learning Practical Guide
No ratings yet
Python Machine Learning Practical Guide
13 pages
ML Lab Manual
No ratings yet
ML Lab Manual
14 pages
Machine Learning Lab Manual
No ratings yet
Machine Learning Lab Manual
23 pages
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
No ratings yet
VND - Openxmlformats Officedocument - Wordprocessingml.document&rendition 1
24 pages
ML - LAB - FILE Amrit
No ratings yet
ML - LAB - FILE Amrit
13 pages
ML Lab
No ratings yet
ML Lab
33 pages
Machine Learning Final Manual
No ratings yet
Machine Learning Final Manual
45 pages
27 KrishParasShah
No ratings yet
27 KrishParasShah
17 pages
Seminar Presentation
No ratings yet
Seminar Presentation
25 pages
Decision Tree
No ratings yet
Decision Tree
6 pages
Important Questions
No ratings yet
Important Questions
4 pages
ML RECORD - Merged
No ratings yet
ML RECORD - Merged
33 pages
CSE455/CSE552 Machine Learning (Spring 2024) Homework #1: Hand-In Policy Collaboration Policy Grading
No ratings yet
CSE455/CSE552 Machine Learning (Spring 2024) Homework #1: Hand-In Policy Collaboration Policy Grading
2 pages
ML Lab - Manual
No ratings yet
ML Lab - Manual
15 pages
Redmi 10 4G User Guide Uk
No ratings yet
Redmi 10 4G User Guide Uk
60 pages
1714010223-Vent Ext Fima - R
No ratings yet
1714010223-Vent Ext Fima - R
1 page
5-Axis CNC Router Setup Guide
No ratings yet
5-Axis CNC Router Setup Guide
27 pages
Proceeding
No ratings yet
Proceeding
380 pages
Lesson 5 Python For Loops While Loops
No ratings yet
Lesson 5 Python For Loops While Loops
7 pages
Easy Access Rules For Third Country Operators - Revision From April 2023 PDF
No ratings yet
Easy Access Rules For Third Country Operators - Revision From April 2023 PDF
38 pages
Reshade Preset DaC v4 5 by Fynn
No ratings yet
Reshade Preset DaC v4 5 by Fynn
12 pages
LM - Ic - Unit2 2
No ratings yet
LM - Ic - Unit2 2
23 pages
Teacher's Guide to SENA 3
100% (2)
Teacher's Guide to SENA 3
7 pages
Marconi Access Network: Indoor Enclosure Quick Rack 100/40
No ratings yet
Marconi Access Network: Indoor Enclosure Quick Rack 100/40
38 pages
Calculation Memory
No ratings yet
Calculation Memory
2 pages
Integration With ArcSight
No ratings yet
Integration With ArcSight
19 pages
Powerful and Scalable Buissiness
No ratings yet
Powerful and Scalable Buissiness
8 pages
The Effective CIO How To Achieve Outstanding Success Through Strategic Alignment Financial Management and IT Governance 1st Edition Eric J. Brown Full
100% (3)
The Effective CIO How To Achieve Outstanding Success Through Strategic Alignment Financial Management and IT Governance 1st Edition Eric J. Brown Full
87 pages
Digicode AI Book 7 - Answer Key
75% (4)
Digicode AI Book 7 - Answer Key
26 pages
Resume CA Somya Ranjan Das
No ratings yet
Resume CA Somya Ranjan Das
3 pages
NCBI 3d Animation Syllabus
No ratings yet
NCBI 3d Animation Syllabus
5 pages
DTP1
No ratings yet
DTP1
103 pages
2017 CAMELION Catalogue GB Web
No ratings yet
2017 CAMELION Catalogue GB Web
79 pages
Robotics Joint Motion Techniques
No ratings yet
Robotics Joint Motion Techniques
2 pages
Knowledge Booster 3 Unit-7
No ratings yet
Knowledge Booster 3 Unit-7
12 pages
Industrial Automation (PLC HMI SCADA VFD) - 2022
No ratings yet
Industrial Automation (PLC HMI SCADA VFD) - 2022
21 pages
74 343 Exam
No ratings yet
74 343 Exam
5 pages
P8 5.5.0-P85.5.4 Patch Compatibility Matrix 6
No ratings yet
P8 5.5.0-P85.5.4 Patch Compatibility Matrix 6
16 pages
Death's Door Walkthrough Guide
No ratings yet
Death's Door Walkthrough Guide
90 pages
Yosef K Effect of e Banking On Profitability of Cbe Aa Branch
No ratings yet
Yosef K Effect of e Banking On Profitability of Cbe Aa Branch
69 pages
MiFIR Data Validation Rules Guide
No ratings yet
MiFIR Data Validation Rules Guide
32 pages
Circular 11-23
No ratings yet
Circular 11-23
2 pages
Cellular Systems & Strategies Guide
No ratings yet
Cellular Systems & Strategies Guide
58 pages
Extended Warranty for Airdopes
No ratings yet
Extended Warranty for Airdopes
18 pages