0% found this document useful (0 votes)

29 views11 pages

Week 8

The document outlines a week 8 course on anomaly detection for predictive maintenance in mechatronics engineering, focusing on unsupervised learning methods like Isolation Forest and Autoencoders. It includes steps for data preprocessing, visualization, model training, and performance evaluation using metrics such as precision and recall. Key skills developed include understanding predictive maintenance, applying machine learning techniques, and visualizing data trends.

Uploaded by

sakshijariwala2712

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views11 pages

Week 8

Uploaded by

sakshijariwala2712

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 11

Week 8: Introduction to Artificial intelligence for

the Mechatronics Engineering

Title: Anomaly Detection for Predictive Maintenance

Objective:
1. Understand the concept of anomaly detection and its application in predictive
maintenance.
2. Implement unsupervised learning methods to detect anomalies in sensor
data.
3. Apply Isolation Forest and Autoencoder neural networks for anomaly
detection.
4. Visualize normal and anomalous sensor readings using Matplotlib.
5. Evaluate model performance using precision, recall, and F1-score.

Key Skills:

 Understanding predictive maintenance and its role in industrial automation

 Applying machine learning for anomaly detection in sensor data.
 Using Isolation Forest and Autoencoders for anomaly detection.
 Evaluating model performance using classification metrics.
 Visualizing sensor data trends and anomalies using Matplotlib and Seaborn.

Problem Statement:

 A factory floor has multiple vibration sensors installed on different machines.

 These sensors monitor the health of rotating machinery by measuring vibration
levels over time.
 Normal machines show vibration levels between 5 and 15 mm/s.
 Faulty machines exhibit outliers (anomalies) with higher vibration levels.
Instructions

Step 1: Import Required Libraries (5 minutes)

 Start by importing the necessary Python libraries.

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
from sklearn.ensemble import IsolationForest
from sklearn.preprocessing import StandardScaler
from sklearn.metrics import classification_report
import tensorflow as tf
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense
from tensorflow.keras.optimizers import Adam

Step 2: Load and Preprocess the Dataset (15 minutes)

 Generate Sample Sensor Data

 We will simulate a dataset representing normal and faulty machine conditions.

# Set random seed for reproducibility

np. random.seed (42)

# Generate normal sensor readings (500 samples)

normal_data = np.random.normal (loc=10, scale=2, size=500) #Mean = 10

# Generate abnormal sensor readings (20 samples)

anomaly_data = np.random.normal(loc=25, scale=5, size=20) #Mean = 25

# Combine data into a single dataset

vibration_data = np.concatenate((normal_data, anomaly_data))

# Create labels (0=Normal, 1=Anomaly)

labels =np.concatenate((np.zeros(len(normal_data)),np.ones(len (anoma

# Convert to DataFrame
df = pd.DataFrame({'Vibration': vibration_data, 'Label': labels})
# Shuffle the dataset
df = df.sample(frac=1, random_state=42).reset_index(drop=True)

# Display dataset summary

print(df.head())

Step 3: Visualizing the Data (10 minutes)

# Plot histogram of vibration levels

plt.figure(figsize=(8, 5))
sns.histplot(df['Vibration'], bins=30, kde=True, hue=df ['Label'], pale*
plt.xlabel("Vibration Level (mm/s)"')
plt.ylabel ("Frequency")
plt.title("Distribution of Sensor Readings")
plt. legend ( ["Normal", "Anomaly"])
plt. show()

Step 4: Anomaly Detection Using Isolation Forest (20 minutes)

 Train the Isolation Forest Model

# # Standardize the data

scaler = StandardScaler()
df ['Vibration_Scaled'] = scaler.fit_transform(df[['Vibration']])
python
⑦
# Train Isolation Forest model
iso_forest = IsolationForest(contamination=0.05, random_state=42)
df ['Anomaly_Score'] =
iso_forest.fit_predict(df[['Vibration_Scaled']])
# Convert anomaly scores (-1= anomaly, 1 = normal) to binary labels (.
df ['Anomaly_Detected'] = (df ['Anomaly_Score'] = -1) .astype(int)
# Print classification report
print(classification_report(df['Label'], df ['Anomaly_Detected']) )
* Visualizing Isolation Forest Predictions
python
plt.figure(figsize=(8, 5))
sns.scatterplot(x=df.index, y=df['Vibration'], hue=df I'Anomaly_DetecE
plt.xlabel ("Sample Index")
plt.ylabel("Vibration Level (mm/s)"')
plt.title("Anomaly Detection with Isolation Forest") plt. legend ( ["Normal",
"Anomaly"])
plt.show()

Step 5: Anomaly Detection Using Autoencoder Neural Network (25 minutes)

 Define and Train an Autoencoder Model

# Define Autoencoder model

autoencoder = Sequential([
Dense(8, activation='relu', input_shape=(1,)),
Dense(4, activation='relu'),
Dense(8, activation='relu'),
Dense(1, activation='linear')
# Compile the model
autoencoder.compile(optimizer=Adam(learning_rate=0.01), loss='mse')
# Train the Autoencoder
history = autoencoder.fit(df[ 'Vibration_Scaled'], df['Vibration_Scaled

 Reconstruction Error and Anomaly Detection

#Compute reconstruction errors

reconstructed _data = autoencoder-predict(df[ 'Vibration_Scaled'])
reconstruction_error = np.abs(df['Vibration_Scaled'] - reconstructed_di
# Set threshold for anomaly detection (Mean + 2 * Std Dev)
threshold = np-mean (reconstruction_error) + 2 * np.std (reconstruction_‹ df
['Anomaly_Autoencoder'] = (reconstruction_error › threshold).astype(:
# Print classification report
print(classification_report(df['Label'], df['Anomaly_Autoencoder']))
* Visualizing Autoencoder-Based Anomaly Detection
python
plt.figure(figsize=(8, 5))
SnS« Scatterplot(x=dfindex, yedf['Vibration'], hue=dfl'Anomaly_AutoefE
plt.xlabel("Sample Index"')
plt.ylabel("Vibration Level (mm/s)"')
plt.title("Anomaly Detection with Autoencoder")
plt. legend ( ["Normal", "Anomaly"]) plt.show()

Step 6: Compare Model Performance (10 minutes)

 Compare the Isolation Forest and Autoencoder models based on:

 Precision (accuracy of detecting actual anomalies).
 Recall (ability to find all anomalies).
 F1-score (harmonic mean of precision and recall).
#print("Isolation Forest Performance:")
print(classification_report(df['Label'], df['Anomaly_Detected']))
print ("Autoencoder Performance:")
print(classification_report(df['Label'], df['Anomaly_Autoencoder']))

Evaluation Questions:

1. What are the key differences between Isolation Forest and Autoencoder
for anomaly detection?
 Algorithm Type: Isolation Forest is a tree-based model, while an
Autoencoder is a neural network-based model.
 Training Method: Isolation Forest isolates anomalies using decision trees
without requiring labeled data, whereas an Autoencoder learns to
reconstruct normal data patterns and uses reconstruction error as an
anomaly score.
 Feature Dependencies: Isolation Forest works well with tabular data and
high-dimensional data without assuming feature dependencies.
Autoencoders, on the other hand, learn complex feature relationships,
making them better suited for structured or sequential data.
 Computational Complexity: Isolation Forest is computationally efficient
and scalable, whereas Autoencoders require significant computational
resources, especially for deep networks.
 Interpretability: Isolation Forest is more interpretable since it provides a
direct anomaly score based on tree depth, while Autoencoders are more of
a "black-box" requiring additional interpretation techniques.

2. How does contamination level in Isolation Forest affect the results?

Effect of Contamination Level in Isolation Forest
The contamination level is a hyperparameter that represents the expected
proportion of anomalies in the dataset.
 High Contamination: More points are classified as anomalies, increasing
recall but reducing precision, as more normal points might be falsely
labeled as outliers.
 Low Contamination: Fewer points are labeled as anomalies, increasing
precision but possibly missing actual outliers (lower recall).
 Incorrect Contamination Estimate: If the true anomaly ratio is different
from the contamination parameter, the model might overfit or under-
detect anomalies.

3. What happens when we change the threshold for anomaly detection in

the Autoencoder?
Effect of Changing the Threshold in Autoencoder for Anomaly Detection
 Higher Threshold: Fewer anomalies detected, reducing false positives
but potentially missing real anomalies.
 Lower Threshold: More anomalies detected, increasing recall but also
increasing false positives.
 Tuning Strategy: The threshold is usually set based on validation data or
domain knowledge, and sometimes through statistical methods like using
the mean + k standard deviations of reconstruction error.

4. How could this method be applied to real-time predictive maintenance in

factories?
Application to Real-Time Predictive Maintenance in Factories
 Data Collection: Use IoT sensors to collect machine operation data
(vibration, temperature, pressure, sound, etc.).
 Model Training: Train an Autoencoder on normal operating conditions to
learn normal behavior patterns.
 Real-Time Monitoring: Continuously feed live data into the trained
Autoencoder and calculate the reconstruction error.
 Anomaly Detection: If the reconstruction error exceeds a threshold, flag
the instance as an anomaly.
 Predictive Action: Trigger alerts for maintenance before failures occur,
reducing downtime and costs.
Complete Code And Obtained Output:

# Set random seed for reproducibility

np.random.seed(42)

# Generate normal sensor readings (500 samples)

normal_data = np.random.normal(loc=10, scale=2, size=500) # Mean = 10

# Generate abnormal sensor readings (20 samples)

anomaly_data = np.random.normal(loc=25, scale=5, size=20) # Mean = 25

# Combine data into a single dataset

vibration_data = np.concatenate((normal_data, anomaly_data))

# Create labels (0 = Normal, 1 = Anomaly)

labels = np.concatenate((np.zeros(len(normal_data)), np.ones(len(anomaly_data))))

# Convert to DataFrame
df = pd.DataFrame({'Vibration': vibration_data, 'Label': labels})

# Shuffle the dataset

df = df.sample(frac=1, random_state=42).reset_index(drop=True)

# Display dataset summary

print(df.head())

# Visualizing the Data

plt.figure(figsize=(8, 5))
sns.histplot(data=df, x='Vibration', bins=30, kde=True, hue='Label', palette=['green', 'red'])
plt.xlabel("Vibration Level (mm/s)")
plt.ylabel("Frequency")
plt.title("Distribution of Sensor Readings")
plt.legend(["Normal", "Anomaly"])
plt.show()

# Anomaly Detection Using Isolation Forest

# Standardize the data
scaler = StandardScaler()
df['Vibration_Scaled'] = scaler.fit_transform(df[['Vibration']])

# Train Isolation Forest model

iso_forest = IsolationForest(contamination=0.05, random_state=42)
df['Anomaly_Score'] = iso_forest.fit_predict(df[['Vibration_Scaled']])
# Convert anomaly scores (-1= anomaly, 1 = normal) to binary labels (0 = normal, 1 = anomaly)
df['Anomaly_Detected'] = (df['Anomaly_Score'] == -1).astype(int)

# Print classification report

print("Isolation Forest Classification Report:")
print(classification_report(df['Label'], df['Anomaly_Detected']))

# Visualizing Isolation Forest Predictions

plt.figure(figsize=(8, 5))
sns.scatterplot(x=df.index, y=df['Vibration'], hue=df['Anomaly_Detected'], palette=['green', 'red'])
plt.xlabel("Sample Index")
plt.ylabel("Vibration Level (mm/s)")
plt.title("Anomaly Detection with Isolation Forest")
plt.legend(["Normal", "Anomaly"])
plt.show()

# Anomaly Detection Using Autoencoder Neural Network

# Define Autoencoder model
autoencoder = Sequential([
Dense(8, activation='relu', input_shape=(1,)),
Dense(4, activation='relu'),
Dense(8, activation='relu'),
Dense(1, activation='linear')
])

# Compile the model

autoencoder.compile(optimizer=Adam(learning_rate=0.01), loss='mse')

# Train the Autoencoder

history = autoencoder.fit(df['Vibration_Scaled'], df['Vibration_Scaled'],
epochs=50, batch_size=32, validation_split=0.2, verbose=0)
# Reconstruction Error and Anomaly Detection
# Compute reconstruction errors
reconstructed_data = autoencoder.predict(df['Vibration_Scaled'], verbose=0)
reconstruction_error = np.abs(df['Vibration_Scaled'] - reconstructed_data.flatten())

# Set threshold for anomaly detection (Mean + 2 * Std Dev)

threshold = np.mean(reconstruction_error) + 2 * np.std(reconstruction_error)
df['Anomaly_Autoencoder'] = (reconstruction_error > threshold).astype(int)

# Print classification report

print("Autoencoder Classification Report:")
print(classification_report(df['Label'], df['Anomaly_Autoencoder']))

# Visualizing Autoencoder-Based Anomaly Detection

plt.figure(figsize=(8, 5))
sns.scatterplot(x=df.index, y=df['Vibration'], hue=df['Anomaly_Autoencoder'], palette=['green',
'red'])
plt.xlabel("Sample Index")
plt.ylabel("Vibration Level (mm/s)")
plt.title("Anomaly Detection with Autoencoder")
plt.legend(["Normal", "Anomaly"])
plt.show()
# Compare Model Performance
print("Isolation Forest Performance:")
print(classification_report(df['Label'], df['Anomaly_Detected']))
print("\nAutoencoder Performance:")
print(classification_report(df['Label'], df['Anomaly_Autoencoder']))

Khiêm
No ratings yet
Khiêm
7 pages
Experiment 8: Aim: Objective: Tools Used: Theory
No ratings yet
Experiment 8: Aim: Objective: Tools Used: Theory
10 pages
Anomaly ND Condition Monitoring 2
No ratings yet
Anomaly ND Condition Monitoring 2
18 pages
Anomaly Detection
No ratings yet
Anomaly Detection
4 pages
Predictive Maintenance For AirProductionUnit in EuroTram Vehicles MarianaBarros
No ratings yet
Predictive Maintenance For AirProductionUnit in EuroTram Vehicles MarianaBarros
110 pages
10 - Anomaly Detection
No ratings yet
10 - Anomaly Detection
12 pages
Research Project On
No ratings yet
Research Project On
21 pages
1 s2.0 S2215016125000299 Main
No ratings yet
1 s2.0 S2215016125000299 Main
11 pages
Isolation Forest Made Easy & How To Tutorial
No ratings yet
Isolation Forest Made Easy & How To Tutorial
18 pages
Multivariate Time Series Anomaly Detection
No ratings yet
Multivariate Time Series Anomaly Detection
4 pages
Anomaly Detection Time Series Final PDF
No ratings yet
Anomaly Detection Time Series Final PDF
12 pages
5.1.1 Objective and Scope: Jyenis 2020
No ratings yet
5.1.1 Objective and Scope: Jyenis 2020
8 pages
Phase 2.1
No ratings yet
Phase 2.1
9 pages
Ajayi Oluwaniyi Oluwafemi Final Defence
No ratings yet
Ajayi Oluwaniyi Oluwafemi Final Defence
39 pages
Predictive Maintenance Using Isolation Forest - PyImageSearch
No ratings yet
Predictive Maintenance Using Isolation Forest - PyImageSearch
14 pages
Isolation Forest Anomaly Detection
No ratings yet
Isolation Forest Anomaly Detection
3 pages
Naan Mudhalvan
No ratings yet
Naan Mudhalvan
43 pages
Isolation Forest for Anomaly Detection
No ratings yet
Isolation Forest for Anomaly Detection
16 pages
Summarize and Help Me To Write The Paper Complete...
No ratings yet
Summarize and Help Me To Write The Paper Complete...
9 pages
Project Report Based On AI For Predictive Maintenace Using IoT
No ratings yet
Project Report Based On AI For Predictive Maintenace Using IoT
11 pages
Autoencoder Forest for IoT Anomaly Detection
No ratings yet
Autoencoder Forest for IoT Anomaly Detection
27 pages
Report Combined
No ratings yet
Report Combined
11 pages
Isolation & Random Cut Forests Review
No ratings yet
Isolation & Random Cut Forests Review
20 pages
Review 1
No ratings yet
Review 1
8 pages
Bioengineering 10 00405 v2
No ratings yet
Bioengineering 10 00405 v2
30 pages
Machine Failure Prediction
No ratings yet
Machine Failure Prediction
11 pages
Predictive Maintenance Model Based On Anomaly Detection in Induction Motors: A Machine Learning Approach Using Real-Time Iot Data
No ratings yet
Predictive Maintenance Model Based On Anomaly Detection in Induction Motors: A Machine Learning Approach Using Real-Time Iot Data
8 pages
Isolationforest1 Python
No ratings yet
Isolationforest1 Python
7 pages
Matlab La-2
No ratings yet
Matlab La-2
10 pages
Predictivemaintenance FaultDetection
No ratings yet
Predictivemaintenance FaultDetection
12 pages
Ashwath Thesis PDF
No ratings yet
Ashwath Thesis PDF
90 pages
Aetsam Javed Thesis Slides-1
No ratings yet
Aetsam Javed Thesis Slides-1
24 pages
IOT Project Report
No ratings yet
IOT Project Report
13 pages
Manufacturing Machine Learning Tool Mechanical
No ratings yet
Manufacturing Machine Learning Tool Mechanical
13 pages
Final Report
No ratings yet
Final Report
50 pages
Knime Anomaly Detection Visualization
No ratings yet
Knime Anomaly Detection Visualization
13 pages
01 Autoencoder Anomaly Detection Cooling Systems
No ratings yet
01 Autoencoder Anomaly Detection Cooling Systems
11 pages
Anomaly Detection On Industrial Electrical Systems Using Deep Learning
No ratings yet
Anomaly Detection On Industrial Electrical Systems Using Deep Learning
6 pages
AI-Driven Anomaly Detection
No ratings yet
AI-Driven Anomaly Detection
2 pages
Minor Project
No ratings yet
Minor Project
21 pages
Ai-Based Anomaly Detection in Power Electronics
No ratings yet
Ai-Based Anomaly Detection in Power Electronics
25 pages
CCN Presentation
No ratings yet
CCN Presentation
13 pages
Practical No. 5
No ratings yet
Practical No. 5
12 pages
Predictive Maintenance Project Milestone Report
No ratings yet
Predictive Maintenance Project Milestone Report
7 pages
Anomaly Detection in Electricity Consumption Data of Buildings Using Predictive Models
No ratings yet
Anomaly Detection in Electricity Consumption Data of Buildings Using Predictive Models
20 pages
Group4 AutoencodersforIoT
No ratings yet
Group4 AutoencodersforIoT
10 pages
Functional Isolation Forest
No ratings yet
Functional Isolation Forest
16 pages
Temp
No ratings yet
Temp
17 pages
Enhanced Sensor Fault Detection in Aquatic Monitoring Using Deep BALAJI PRESENTATION
No ratings yet
Enhanced Sensor Fault Detection in Aquatic Monitoring Using Deep BALAJI PRESENTATION
14 pages
Enhancing Cybersecurity With Machine Learning
No ratings yet
Enhancing Cybersecurity With Machine Learning
5 pages
Artificial Intelligence Techniques For Predictive Maintenance
No ratings yet
Artificial Intelligence Techniques For Predictive Maintenance
47 pages
What Features in The Dataset Are Most Important For Predicting Equipment Failures?
No ratings yet
What Features in The Dataset Are Most Important For Predicting Equipment Failures?
25 pages
Part III
No ratings yet
Part III
15 pages
Research Article Final Year Project
No ratings yet
Research Article Final Year Project
10 pages
NF Assighment4
No ratings yet
NF Assighment4
5 pages
Mobile Robotics Fault Detection
No ratings yet
Mobile Robotics Fault Detection
42 pages
Review 3
No ratings yet
Review 3
19 pages
Project Title
No ratings yet
Project Title
4 pages
1-S2.0-S2352484723010454-Main 2022
No ratings yet
1-S2.0-S2352484723010454-Main 2022
11 pages
Recursive-Functions A4
No ratings yet
Recursive-Functions A4
21 pages
Laboratory Report For The Experiment On Absorption and Water Content of Aggregates
No ratings yet
Laboratory Report For The Experiment On Absorption and Water Content of Aggregates
8 pages
Important Question For Class 10 Science Light Reflection and Refraction
No ratings yet
Important Question For Class 10 Science Light Reflection and Refraction
62 pages
Reverse Translation
No ratings yet
Reverse Translation
10 pages
Strukturdaten K 2022 GB
No ratings yet
Strukturdaten K 2022 GB
1 page
Business Research CH 5
No ratings yet
Business Research CH 5
10 pages
German: Unit 4 Writing Mark Scheme
No ratings yet
German: Unit 4 Writing Mark Scheme
19 pages
Numerical Study and Optimization of Parabolic Trough Solar Collector Receiver Tube
No ratings yet
Numerical Study and Optimization of Parabolic Trough Solar Collector Receiver Tube
10 pages
Strongest Bolt Bumax
No ratings yet
Strongest Bolt Bumax
12 pages
Revisi 2
No ratings yet
Revisi 2
12 pages
Unit 6 Different Strokes Terminado
No ratings yet
Unit 6 Different Strokes Terminado
32 pages
Transistor Hitachi
No ratings yet
Transistor Hitachi
6 pages
Iso 05006-2006
No ratings yet
Iso 05006-2006
28 pages
Grade 7 PHIL-IRI Mock Post-Test Answers Booklet
No ratings yet
Grade 7 PHIL-IRI Mock Post-Test Answers Booklet
6 pages
Analemmatic Sundial PDF Generator
0% (1)
Analemmatic Sundial PDF Generator
37 pages
TRUMPF Technical Data Sheet TruPulse
No ratings yet
TRUMPF Technical Data Sheet TruPulse
3 pages
ABC vs Traditional Costing Analysis
No ratings yet
ABC vs Traditional Costing Analysis
3 pages
Corporate Video Script
No ratings yet
Corporate Video Script
4 pages
PFR Ujh Multipurpose Project
100% (1)
PFR Ujh Multipurpose Project
64 pages
Materi Descriptive Text Kelas Xi Offline and Online
80% (5)
Materi Descriptive Text Kelas Xi Offline and Online
5 pages
4th Grade Homework Policy
100% (1)
4th Grade Homework Policy
6 pages
VOL II - Zoning Ordinance
No ratings yet
VOL II - Zoning Ordinance
45 pages
Adaptive TD PLL
No ratings yet
Adaptive TD PLL
7 pages
Binnie Solution Practice Answers PDF
No ratings yet
Binnie Solution Practice Answers PDF
2 pages
CE 4269 River Pollution
No ratings yet
CE 4269 River Pollution
17 pages
Assessment 2: - (First Name) (Middle Name) (Last Name)
No ratings yet
Assessment 2: - (First Name) (Middle Name) (Last Name)
4 pages
WameedMUCLecture 2021 92831536
No ratings yet
WameedMUCLecture 2021 92831536
8 pages
Environmental Impact on Species
No ratings yet
Environmental Impact on Species
2 pages
Body Language: School of Management Studies
No ratings yet
Body Language: School of Management Studies
18 pages
Exp 04 & 05
No ratings yet
Exp 04 & 05
5 pages