0% found this document useful (0 votes)

16 views22 pages

Idps-UNIT 1 - Notes

The document provides an overview of Intrusion Detection Systems (IDS) and Intrusion Prevention Systems (IPS), detailing their functions, classifications, and detection methods. It discusses various detection approaches including signature-based, anomaly-based, and hybrid detection, as well as the benefits of implementing IDS in network security. Additionally, it explores the theoretical foundations of detection, including fuzzy logic and machine learning techniques, and categorizes anomaly detection systems based on detection techniques, data types, application domains, and deployment methods.

Uploaded by

austen2304

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views22 pages

Idps-UNIT 1 - Notes

Uploaded by

austen2304

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

CS E 19-INTRUSION DETECTION AND PREVENTION SYSTEM

UNIT- I: INTRODUCTION: Understanding Intrusion Detection – Intrusion detection and

prevention basics – IDS and IPS analysis schemes, Attacks, Detection approaches –Misuse
detection – anamoly detection – specification based detection – hybrid detection THEORETICAL
FOUNDATIONS OF DETECTION: Taxonomy of anomaly detection system – fuzzy logic – Bayes
theory – Artificial Neural networks – Support vector machine – Evolutionary computation –
Association rules – Clustering.

UNIT I: INTRODUCTION

Understanding Intrusion Detection:

Intrusion Detection Systems (IDS) and Intrusion Prevention Systems (IPS) are security
mechanisms designed to detect and respond to unauthorized access or malicious activities within
a computer network or system.

IDS analyze incoming network traffic and identify suspicious patterns or behaviors, while IPS
takes proactive measures to block or prevent detected intrusions.

IDS and IPS Analysis Schemes:

IDS typically operates using either signature-based (misuse detection) or anomaly-based

(behavioral or statistical anomaly detection) analysis schemes.

Signature-based IDS compares incoming data packets with a database of known attack
signatures to identify known threats.

Anomaly-based IDS monitors system behavior and flags deviations from established baselines
as potential intrusions.

A system called an intrusion detection system (IDS) observes network traffic for malicious
transactions and sends immediate alerts when it is observed. It is software that checks a
network or system for malicious activities or policy violations. Each illegal activity or
violation is often recorded either centrally using a SIEM system or notified to an
administration. IDS monitors a network or system for malicious activity and protects a
computer network from unauthorized access from users, including perhaps insiders. The
intrusion detector learning task is to build a predictive model (i.e. a classifier) capable of
distinguishing between ‘bad connections’ (intrusion/attacks) and ‘good (normal) connections’.
How does an IDS work?
• An IDS (Intrusion Detection System) monitors the traffic on a computer network to detect
any suspicious activity.
• It analyzes the data flowing through the network to look for patterns and signs of abnormal
behavior.
• The IDS compares the network activity to a set of predefined rules and patterns to identify
any activity that might indicate an attack or intrusion.
• If the IDS detects something that matches one of these rules or patterns, it sends an alert to
the system administrator.
• The system administrator can then investigate the alert and take action to prevent any
damage or further intrusion.
Classification of Intrusion Detection System

IDS are classified into 5 types:

• Network Intrusion Detection System (NIDS): Network intrusion detection systems
(NIDS) are set up at a planned point within the network to examine traffic from all devices
on the network. It performs an observation of passing traffic on the entire subnet and
matches the traffic that is passed on the subnets to the collection of known attacks. Once an
attack is identified or abnormal behavior is observed, the alert can be sent to the
administrator. An example of a NIDS is installing it on the subnet where firewalls are
located in order to see if someone is trying to crack the firewall.
• Host Intrusion Detection System (HIDS): Host intrusion detection systems (HIDS) run on
independent hosts or devices on the network. A HIDS monitors the incoming and outgoing
packets from the device only and will alert the administrator if suspicious or malicious
activity is detected. It takes a snapshot of existing system files and compares it with the
previous snapshot. If the analytical system files were edited or deleted, an alert is sent to the
administrator to investigate. An example of HIDS usage can be seen on mission-critical
machines, which are not expected to change their layout.
• Protocol-based Intrusion Detection System (PIDS): Protocol-based intrusion detection
system (PIDS) comprises a system or agent that would consistently reside at the front end
of a server, controlling and interpreting the protocol between a user/device and the server. It
is trying to secure the web server by regularly monitoring the HTTPS protocol stream and
accepting the related HTTP protocol. As HTTPS is unencrypted and before instantly
entering its web presentation layer then this system would need to reside in this interface,
between to use the HTTPS.
• Application Protocol-based Intrusion Detection System (APIDS): An application
Protocol-based Intrusion Detection System (APIDS) is a system or agent that generally
resides within a group of servers. It identifies the intrusions by monitoring and interpreting
the communication on application-specific protocols. For example, this would monitor the
SQL protocol explicitly to the middleware as it transacts with the database in the web
server.
• Hybrid Intrusion Detection System: Hybrid intrusion detection system is made by the
combination of two or more approaches to the intrusion detection system. In the hybrid
intrusion detection system, the host agent or system data is combined with network
information to develop a complete view of the network system. The hybrid intrusion
detection system is more effective in comparison to the other intrusion detection system.
Prelude is an example of Hybrid IDS.
Benefits of IDS
• Detects malicious activity: IDS can detect any suspicious activities and alert the system
administrator before any significant damage is done.
• Improves network performance: IDS can identify any performance issues on the network,
which can be addressed to improve network performance.
• Compliance requirements: IDS can help in meeting compliance requirements by
monitoring network activity and generating reports.
• Provides insights: IDS generates valuable insights into network traffic, which can be used
to identify any weaknesses and improve network security.
Detection Method of IDS
1. Signature-based Method: Signature-based IDS detects the attacks on the basis of the
specific patterns such as the number of bytes or a number of 1s or the number of 0s in the
network traffic. It also detects on the basis of the already known malicious instruction
sequence that is used by the malware. The detected patterns in the IDS are known as
signatures. Signature-based IDS can easily detect the attacks whose pattern (signature)
already exists in the system but it is quite difficult to detect new malware attacks as their
pattern (signature) is not known.
2. Anomaly-based Method: Anomaly-based IDS was introduced to detect unknown malware
attacks as new malware is developed rapidly. In anomaly-based IDS there is the use of
machine learning to create a trustful activity model and anything coming is compared with
that model and it is declared suspicious if it is not found in the model. The machine
learning-based method has a better-generalized property in comparison to signature-based
IDS as these models can be trained according to the applications and hardware
configurations.
Comparison of IDS with Firewalls
IDS and firewall both are related to network security but an IDS differs from a firewall as a
firewall looks outwardly for intrusions in order to stop them from happening. Firewalls restrict
access between networks to prevent intrusion and if an attack is from inside the network it
doesn’t signal. An IDS describes a suspected intrusion once it has happened and then signals
an alarm.
Attacks:
1. Malware Attacks: Malicious software designed to infiltrate or damage a computer
system. Examples include viruses, worms, Trojans, and ransomware.
2. Denial of Service (DoS) and Distributed Denial of Service (DDoS) Attacks:
Overload a system or network with excessive traffic, rendering it unavailable to
legitimate users.
3. Intrusions: Unauthorized access to a system or network, often with the intent of
stealing information, causing damage, or disrupting operations.
4. Phishing: Deceptive attempts to obtain sensitive information, such as usernames,
passwords, or financial details, by impersonating a trusted entity.
5. Man-in-the-Middle (MitM) Attacks: Interception and alteration of communication
between two parties without their knowledge or consent.
6. SQL Injection: Exploiting vulnerabilities in web applications to execute malicious
SQL queries, potentially gaining access to sensitive databases.
7. Cross-Site Scripting (XSS): Injecting malicious scripts into web pages viewed by
other users, often leading to session hijacking or theft of cookies.
8. Insider Threats: Malicious activities perpetrated by individuals with authorized access
to the system or network, such as employees or contractors.
Detection Approaches:
1. Signature-based Detection: Compares network traffic or system activity against a
database of known attack signatures. Effective against well-known threats but may miss
new or previously unseen attacks.
2. Anomaly-based Detection: Establishes a baseline of normal behavior and flags
deviations as potential intrusions. Can detect previously unknown attacks but may
produce false positives.
3. Hybrid Detection: Combines signature-based and anomaly-based approaches for
improved accuracy and coverage. Utilizes the strengths of both methods while
mitigating their weaknesses.
4. Specification-based Detection: Defines rules or policies for acceptable behavior and
identifies deviations from these specifications. Useful for detecting policy violations or
unauthorized activities.
5. Behavioral Analysis: Monitors user or system behavior to identify suspicious patterns
or deviations from normal activity. Can detect insider threats and zero-day attacks.
6. Statistical Analysis: Utilizes statistical techniques to analyze network traffic or system
logs for anomalous patterns or trends. Effective for detecting subtle deviations
indicative of potential intrusions.
7. Machine Learning: Applies machine learning algorithms to analyze large volumes of
data and identify patterns associated with malicious activities. Includes techniques such
as neural networks, support vector machines, and clustering.
8. Real-time Monitoring: Constantly monitors network traffic or system activity for signs
of intrusion, enabling rapid detection and response to security incidents.
By deploying a combination of these detection approaches, organizations can enhance their
ability to detect and respond to a wide range of cyber threats effectively.
Misuse Detection:
Misuse detection, also known as signature-based detection, involves comparing network traffic
or system activity against a database of known attack signatures or patterns.
It focuses on identifying predefined patterns associated with known threats, such as specific
sequences of commands or known malicious payloads.
Misuse detection is effective at detecting well-known attacks but may struggle with detecting
novel or previously unseen threats.
Anomaly Detection:
Anomaly detection involves establishing a baseline of normal behavior for a system or network
and flagging deviations from this baseline as potential intrusions.
It identifies activities or patterns that differ significantly from expected behavior, which may
indicate unauthorized or malicious activity.
Anomaly detection is effective at detecting previously unknown attacks but may generate false
positives due to legitimate variations in system behavior.
Specification-based Detection:
Specification-based detection involves defining rules or policies that describe acceptable
behavior within a system or network.
It monitors for deviations from these specifications, such as unauthorized access attempts,
policy violations, or unusual system configurations.
Specification-based detection is useful for detecting insider threats, policy violations, and
unauthorized activities that violate predefined rules.
Hybrid Detection:

Hybrid detection combines multiple detection approaches, such as misuse detection, anomaly
detection, and specification-based detection, for improved accuracy and coverage.
By leveraging the strengths of different detection methods, hybrid detection can effectively
detect a wide range of cyber threats while minimizing false positives and negatives.
It allows organizations to adapt to evolving threat landscapes by dynamically adjusting
detection strategies based on the characteristics of incoming threats.
Each of these detection approaches has its advantages and limitations, and organizations often
employ a combination of techniques to enhance their overall intrusion detection capabilities.

Taxonomy of anomaly detection systems

Taxonomy of anomaly detection systems involves categorizing these systems

based on various criteria. Anomaly detection, also known as outlier detection, is a
critical component in various fields such as cybersecurity, finance, healthcare,
and manufacturing. The taxonomy can be based on several factors, including the
detection technique, data type, application domain, and deployment method. Here
is a detailed explanation of each category:
1. Detection Technique:
- Statistical Methods:
- Description: These methods use statistical models to identify anomalies
based on the assumption that normal data follows a specific statistical
distribution.
- Examples: Z-score analysis, Gaussian distribution-based models, clustering
techniques.

- Machine Learning Approaches:

- Description: Anomaly detection can be framed as a supervised or
unsupervised machine learning problem, using algorithms that learn patterns in
data and identify deviations from those patterns.
- Examples: Support Vector Machines (SVM), Isolation Forests,
Autoencoders, Neural Networks.

- Rule-based Methods:
- Description: These methods define rules that characterize normal behavior
and flag instances that violate these rules as anomalies.
- Examples: Expert systems, threshold-based rules.

2. Data Type:
- Structured Data:
- Description: Involves data organized in tables with predefined fields.
Anomalies in this context might refer to unexpected values or patterns.
- Examples: SQL databases, CSV files.

- Unstructured Data:
- Description: Involves data without a predefined data model, such as text,
images, or videos.
- Examples: Text data, images, videos.

- Time-series Data:
- Description: Data points are recorded over time, and anomalies are
deviations from expected temporal patterns.
- Examples: Stock prices, sensor readings.

3. Application Domain:
- Cybersecurity:
- Description: Detecting unusual activities or patterns in network traffic, log
files, or user behavior to identify potential security threats.
- Examples: Intrusion detection systems, network anomaly detection.

- Finance:
- Description: Identifying fraudulent transactions, unusual trading patterns, or
financial anomalies in banking and trading systems.
- Examples: Credit card fraud detection, algorithmic trading anomaly
detection.

- Healthcare:
- Description: Detecting anomalies in patient data, medical images, or
laboratory results to identify potential health issues.
- Examples: Patient monitoring systems, medical imaging anomaly detection.

- Manufacturing:
- Description: Monitoring equipment, production lines, or quality control
processes to detect deviations from normal operating conditions.
- Examples: Predictive maintenance systems, quality control anomaly
detection.

4. Deployment Method:
- On-premises:
- Description: The anomaly detection system is installed and operated within
the organization's infrastructure.
- Examples: Local servers, private data centers.

- Cloud-based:
- Description: The system is hosted on cloud platforms, providing scalability
and accessibility.
- Examples: AWS anomaly detection services, Azure Machine Learning.

- Hybrid:
- Description: Combining on-premises and cloud-based solutions to leverage
the benefits of both.
- Examples: On-premises data processing with cloud-based machine learning
models.
Fuzzy logic
Fuzzy logic is a mathematical framework that deals with uncertainty and
imprecision. It provides a way to represent and manipulate vague or ambiguous
information in a systematic and structured manner. Developed by Lotfi Zadeh in
the 1960s, fuzzy logic has found applications in various fields, including control
systems, decision-making, artificial intelligence, and data analysis.

Here's a detailed explanation of fuzzy logic:

1. Membership Functions:
- Description: Fuzzy logic uses membership functions to represent the degree of
membership of an element in a fuzzy set. Instead of categorizing an element as
strictly belonging or not belonging to a set (as in classical or crisp logic), fuzzy
logic allows for partial membership.
- Example: In temperature control, a fuzzy set "Cold" might have a membership
function indicating the degree to which a temperature is considered cold.

2. Fuzzy Sets:
- Description: A fuzzy set is an extension of a classical set where elements can
have degrees of membership between 0 and 1. Fuzzy sets are defined by
membership functions that assign a degree of membership to each element.
- Example: A fuzzy set "Tall" might include individuals with varying degrees of
height, represented by different membership values.

3. Fuzzy Rules:
- Description: Fuzzy rules define the relationship between input and output
variables using linguistic terms and conditional statements. These rules capture
human-like reasoning and decision-making processes.
- Example: If "Temperature" is "Cold," then "Heater" should be "On."

4. Fuzzy Inference System (FIS):

- Description: FIS combines fuzzy sets, membership functions, and fuzzy rules
to make decisions or control a system. It involves fuzzification (converting crisp
inputs into fuzzy values), rule evaluation, and defuzzification (converting fuzzy
outputs into crisp values).
- Example: FIS can be used in a washing machine to determine the appropriate
water temperature and duration based on the "Dirtiness" and "Fabric Type"
inputs.
5. Fuzzy Control Systems:
- Description: Fuzzy logic is widely employed in control systems where
conventional methods may be inadequate due to the presence of uncertainties or
nonlinearities. Fuzzy control systems can model complex relationships between
inputs and outputs.
- Example: Fuzzy control in an autonomous vehicle can adapt to varying road
conditions and unexpected obstacles.

6. Fuzzy Clustering:
- Description: Fuzzy clustering techniques, like Fuzzy C-Means (FCM), allow
for the classification of data points into multiple clusters with varying degrees of
membership, providing a more nuanced approach compared to traditional
clustering.
- Example: Fuzzy clustering can be applied in customer segmentation based on
purchasing behavior, allowing customers to belong to multiple segments with
different degrees.

7. Advantages of Fuzzy Logic:

- Handling Uncertainty: Fuzzy logic excels in situations where information is
uncertain, imprecise, or incomplete.
- Linguistic Representation: It allows the incorporation of human-like linguistic
terms into computational models.
- Flexibility: Fuzzy logic provides a flexible and intuitive way to model
complex systems.

8. Limitations of Fuzzy Logic:

- Computational Complexity: Implementing fuzzy logic systems can be
computationally demanding.
- Interpretability: Fuzzy logic systems may be challenging to interpret and fine-
tune due to the subjective nature of linguistic rules.
Bayes' Theorem,

Bayes' Theorem, named after the Reverend Thomas Bayes, is a fundamental

principle in probability theory. It provides a way to update our beliefs about a
hypothesis or event based on new evidence or data. The theorem is particularly
influential in statistics, machine learning, and various fields where uncertainty
and probability play a crucial role. The mathematical expression of Bayes'
Theorem is as follows:

\[ P(A | B) = \frac{P(B | A) \cdot P(A)}{P(B)} \]

Where:
- \( P(A | B) \) is the probability of event A occurring given that event B has
occurred (the posterior probability).
- \( P(B | A) \) is the probability of event B occurring given that event A has
occurred (the likelihood).
- \( P(A) \) is the prior probability of event A occurring.
- \( P(B) \) is the prior probability of event B occurring.

Here's a detailed explanation of Bayes' Theorem:

1. Prior Probability (P(A)):

- Description: The prior probability represents our initial belief or probability of
an event occurring before considering new evidence. It is based on prior
knowledge, experience, or information.
- Example: In medical diagnosis, the prior probability might be the likelihood
of a person having a particular disease based on general population statistics.

2. Likelihood (P(B | A)):

- Description: The likelihood is the probability of observing the new evidence
given that the hypothesis or event is true. It reflects how well the evidence
supports the hypothesis.
- Example: In a diagnostic test, the likelihood might be the probability of
obtaining a positive result given that the patient has the disease.

3. Posterior Probability (P(A | B)):

- Description: The posterior probability is the updated probability of the
hypothesis or event given the new evidence. It combines the prior probability and
the likelihood to reflect the impact of the new information.
- Example: The updated probability of a patient having a disease after
considering the results of a diagnostic test.

4. Evidence or Data (P(B)):

- Description: The probability of observing the evidence or data, regardless of
whether the hypothesis is true or false. It serves as a normalization factor to
ensure that the posterior probability is a valid probability distribution.
- Example: The overall probability of obtaining a positive test result,
considering both cases where the patient has the disease and cases where they
don't.

5. Bayesian Inference:
- Description: Bayes' Theorem is fundamental to Bayesian inference, a
statistical method for updating probabilities as new evidence becomes available.
It allows for a principled way to combine prior knowledge and new data.
- Example: Bayesian inference is widely used in machine learning, particularly
in Bayesian networks and probabilistic programming.

6. Applications of Bayes' Theorem:

- Medical Diagnosis: Assessing the probability of a disease based on symptoms
and diagnostic test results.
- Spam Filtering: Determining the probability that an email is spam based on
certain features.
- Predictive Analytics: Updating predictions based on new data in various fields
such as finance and sports.

7. Bayes' Theorem in Machine Learning:

- Description: Bayes' Theorem is a foundation for various machine learning
algorithms, especially in the field of probabilistic models, such as Naive Bayes
classifiers.
- Example: In a spam classification model, Bayes' Theorem can be used to
calculate the probability that an email is spam given certain features (words,
phrases).
Artificial Neural Networks (ANNs)

Artificial Neural Networks (ANNs) are a class of machine learning models

inspired by the structure and functioning of the human brain. These networks
consist of interconnected nodes, or artificial neurons, organized into layers.
Neural networks have gained prominence due to their ability to learn complex
patterns and relationships from data. Here's a detailed explanation of Artificial
Neural Networks:

1. Neurons (Nodes):
- Description: Neurons are the basic units of an artificial neural network. Each
neuron receives input, processes it using an activation function, and produces an
output. Neurons are organized into layers: input layer, hidden layers, and output
layer.
- Example: In a neural network for image recognition, each neuron in the input
layer may correspond to a pixel value, while neurons in subsequent layers
represent more abstract features.

2. Layers:
- Description: Neural networks are structured in layers: an input layer, one or
more hidden layers, and an output layer. The input layer receives raw data,
hidden layers process this information, and the output layer produces the final
result.
- Example: In a feedforward neural network, data flows from the input layer
through the hidden layers to the output layer.

3. Weights and Biases:

- Description: Neurons are connected by weighted edges, representing the
strength of connections. Each connection has a weight that determines its
influence on the neuron's output. Additionally, each neuron has a bias, a constant
term that adjusts the neuron's activation function.
- Example: In a neural network for predicting house prices, the weights may
represent the importance of features like square footage, and biases may account
for factors like location desirability.

4. Activation Function:
- Description: The activation function determines the output of a neuron based
on its weighted inputs and bias. Common activation functions include sigmoid,
hyperbolic tangent (tanh), and rectified linear unit (ReLU).
- Example: Sigmoid activation function is often used in binary classification
tasks, mapping input values to a range between 0 and 1.

5. Feedforward and Backpropagation:

- Description: In the training process, data is passed through the network in the
forward direction (feedforward). The network's output is compared to the
expected output, and the error is backpropagated to adjust the weights and biases
using optimization algorithms like gradient descent.
- Example: During backpropagation, the weights and biases are adjusted
iteratively to minimize the difference between predicted and actual outputs.

6. Types of Neural Networks:

- Feedforward Neural Networks (FNN): Information flows in one direction
from input to output.
- Recurrent Neural Networks (RNN): Connections form cycles, allowing
information to persist.
- Convolutional Neural Networks (CNN): Specialized for processing grid-like
data such as images.
- Generative Adversarial Networks (GAN): Comprising a generator and a
discriminator to generate realistic data.

7. Deep Learning:
- Description: ANNs with multiple hidden layers are termed deep neural
networks. Deep learning involves training these deep architectures to
automatically learn hierarchical representations.
- Example: Deep learning has excelled in image and speech recognition, natural
language processing, and many other complex tasks.

8. Applications:
- Image and Speech Recognition: ANNs are widely used in recognizing patterns
in images and speech.
- Natural Language Processing (NLP): Neural networks power language models
for tasks like machine translation and sentiment analysis.
- Autonomous Vehicles: Deep learning is employed in the perception and
decision-making systems of self-driving cars.
Support Vector Machine (SVM)
A Support Vector Machine (SVM) is a supervised machine learning algorithm
used for classification and regression tasks. SVMs are particularly effective in
high-dimensional spaces and are widely used in various domains, including
image classification, text classification, and bioinformatics. The primary goal of
an SVM is to find the hyperplane that best separates different classes in the input
space. Here's a detailed explanation of Support Vector Machines:

1. Binary Classification:

- Description: SVM is initially designed for binary classification problems,

where the algorithm learns to differentiate between two classes based on a set of
labeled training examples.

- Example: Given labeled data points representing spam and non-spam emails,
SVM learns a decision boundary to classify new emails as spam or non-spam.

2. Hyperplane:

- Description: In SVM, the decision boundary is represented by a hyperplane,

which is a subspace with one dimension less than the original input space. For a
binary classification problem in a two-dimensional feature space, the hyperplane
is a line.

- Example: In a 2D space, a hyperplane is a line that separates two classes. In a

3D space, it would be a plane, and so on.

3. Support Vectors:

- Description: Support vectors are the data points that lie closest to the decision
boundary (hyperplane). They play a crucial role in defining the optimal
hyperplane and are used to maximize the margin between classes.
- Example: In a scatter plot, the support vectors are the points lying on the
edges of the clusters.

4. Margin:

- Description: The margin is the distance between the decision boundary and
the nearest data point from either class. SVM aims to maximize this margin,
providing a wider gap between classes, which generally leads to better
generalization to new, unseen data.

- Example: In a scatter plot, the margin is the distance between the decision
boundary and the nearest point from either class.

5. Kernel Trick:

- Description: SVM can efficiently handle non-linear decision boundaries by

using kernel functions. These functions transform the input features into a higher-
dimensional space, making it possible to find a linear decision boundary in that
space.

- Example: The polynomial kernel or radial basis function (RBF) kernel can be
applied to capture complex relationships between features.

6. Soft Margin SVM:

- Description: In cases where the data is not perfectly separable, a soft margin
SVM allows for some misclassifications to achieve a balance between
maximizing the margin and minimizing errors.

- Example: In situations where there is noise or outliers in the data, a soft

margin SVM can be more robust.

7. Multi-class Classification:
- Description: SVM can be extended to handle multi-class classification by
training multiple binary classifiers and combining their outputs.

- Example: One-vs-All (OvA) or One-vs-One (OvO) strategies can be

employed for multi-class SVM, where multiple binary classifiers are trained for
each class.

8. Advantages:

- SVMs are effective in high-dimensional spaces.

- They work well even with a limited amount of data.

- The use of a margin makes SVMs less sensitive to outliers.

9. Limitations:

- SVMs can be sensitive to the choice of kernel and hyperparameter tuning.

- Training time can be high for large datasets.

- Interpretability of the model might be challenging, especially in high-

dimensional spaces.

10. Applications:

- Image Recognition: SVMs are used in image classification tasks.

- Text Classification: SVMs are applied to tasks such as spam detection and
sentiment analysis.

- Bioinformatics: SVMs are used for protein classification and gene expression
analysis.
Evolutionary Computation (EC)

Evolutionary Computation (EC) is a family of optimization algorithms inspired

by the principles of natural evolution. These algorithms are used to find
approximate solutions to optimization and search problems by mimicking the
processes of natural selection, recombination, mutation, and survival of the
fittest. Evolutionary computation techniques are widely applied in various fields,
including engineering, computer science, finance, and biology. Here's a detailed
explanation of Evolutionary Computation:

1. Genetic Algorithm (GA):

- Description: Genetic Algorithms are one of the most well-known evolutionary
computation techniques. They use a population of candidate solutions encoded as
chromosomes (genetic strings) and apply genetic operators such as selection,
crossover (recombination), and mutation to evolve solutions over successive
generations.
- Example: Optimizing parameters for a machine learning algorithm or finding
the optimal route for a delivery truck.

2. Genetic Programming (GP):

- Description: Genetic Programming extends the principles of genetic
algorithms to evolve computer programs or mathematical expressions. The
population consists of symbolic structures (syntax trees), and genetic operations
manipulate the trees to improve performance.
- Example: Automatically evolving a mathematical formula to approximate a
given function.

3. Evolutionary Strategies (ES):

- Description: Evolutionary Strategies focus on optimizing a set of parameters
in a continuous domain. They use mutation and recombination operations to
adapt a population of candidate solutions to the environment, without explicitly
representing a chromosome.
- Example: Parameter tuning in machine learning models or optimizing control
policies for robotics.

4. Differential Evolution (DE):

- Description: Differential Evolution is a population-based optimization
algorithm that employs a differential mutation operator. It creates new candidate
solutions by combining differences between randomly chosen members of the
population.
- Example: Optimizing complex functions or parameter tuning for optimization
problems.

5. Ant Colony Optimization (ACO):

- Description: Inspired by the foraging behavior of ants, ACO algorithms
simulate the exploration of solution spaces using artificial ants. They lay
pheromone trails to communicate and reinforce paths leading to better solutions.
- Example: Solving the Traveling Salesman Problem or routing optimization in
computer networks.

6. Particle Swarm Optimization (PSO):

- Description: PSO is inspired by the social behavior of birds or fish. It
maintains a population of particles that traverse the search space, adjusting their
positions based on personal best and global best solutions.
- Example: Function optimization, parameter tuning, or training neural network
weights.

7. Memetic Algorithm:
- Description: Memetic Algorithms combine evolutionary algorithms with local
search techniques. They use evolutionary processes to explore the solution space
globally and apply local search to refine promising solutions.
- Example: Combining genetic algorithms with gradient descent for
optimization tasks.

8. Applications:
- Engineering Design: Optimizing parameters in structural design or circuit
layouts.
- Financial Modeling: Portfolio optimization, risk management, and algorithmic
trading strategy development.
- Machine Learning: Feature selection, hyperparameter tuning, and neural
network architecture optimization.
- Bioinformatics: Protein structure prediction, drug design, and DNA sequence
alignment.

9. Advantages:
- Effective for complex, non-linear, and multimodal optimization problems.
- Robust against local optima due to the population-based nature.
- Parallelizable, allowing for efficient exploration of solution spaces.

10. Challenges:
- Proper parameter tuning is crucial for algorithm performance.
- May require significant computational resources for complex problems.
- Interpretability of results can be challenging.

Association Rules:
Association Rules refer to patterns that highlight relationships or associations
between items in a dataset. These rules are commonly used in data mining and are
particularly useful in market basket analysis. The primary goal is to discover
interesting relationships, dependencies, or correlations among a set of items in a
transactional database. The Apriori algorithm is one of the commonly used
methods for generating association rules.

1. Frequent Itemsets:
- Description: Frequent itemsets are subsets of items that appear together
frequently in the dataset. They are identified based on a minimum support
threshold.
- Example: If "A" and "B" frequently appear together in transactions, {A, B} is
a frequent itemset.

2. Support:
- Description: Support measures how often a particular itemset appears in the
dataset. It is calculated as the ratio of transactions containing the itemset to the
total number of transactions.
- Example: If the support of {A, B} is 0.2, it means that the itemset appears in
20% of the transactions.

3. Confidence:
- Description: Confidence measures the reliability of a rule. It is the conditional
probability of the occurrence of the consequent given the antecedent.
- Example: If the confidence of the rule {A} → {B} is 0.8, it means that "B" is
bought with "A" in 80% of the cases where "A" is present.

4. Lift:
- Description: Lift measures how much more likely the consequent is to occur
when the antecedent is known, compared to its likelihood without the antecedent.
- Example: If the lift of the rule {A} → {B} is 1.2, it indicates that the
likelihood of buying "B" increases by 20% when "A" is present.

5. Applications:
- Market Basket Analysis in retail.
- Recommender systems to suggest related products.
- Web usage mining to understand user behavior on websites.

6. Advantages:
- Helps in revealing hidden patterns in large datasets.
- Provides insights for decision-making and strategy formulation.

7. Limitations:
- Sensitive to noise and outliers in the data.
- May produce a large number of rules, making interpretation challenging.

Clustering:
Clustering is a machine learning technique that involves grouping similar data
points together based on certain features or characteristics. The objective is to
partition a dataset into subsets, or clusters, in a way that points within the same
cluster are more similar to each other than to those in other clusters. There are
various clustering algorithms, including K-Means, Hierarchical Clustering, and
DBSCAN.

1. K-Means Clustering:
- Description: K-Means partitions data into k clusters by iteratively assigning
points to the cluster with the nearest centroid and updating the centroids based on
the mean of the points in each cluster.
- Example: Grouping customers based on their purchasing behavior.

2. Hierarchical Clustering:
- Description: Hierarchical Clustering builds a hierarchy of clusters by either
iteratively merging smaller clusters into larger ones (agglomerative) or dividing
larger clusters into smaller ones (divisive).
- Example: Representing taxonomic relationships in biology.

3. DBSCAN (Density-Based Spatial Clustering of Applications with Noise):

- Description: DBSCAN groups data points based on their density, identifying
clusters as areas with higher point density separated by areas with lower density.
- Example: Identifying dense regions in a geographical dataset.

4. Silhouette Score:
- Description: Silhouette score is a measure of how well-separated clusters are.
It ranges from -1 to 1, where a higher score indicates better-defined clusters.
- Example: Evaluating the quality of clustering results.

5. Applications:
- Customer segmentation for targeted marketing.
- Image segmentation for object recognition.
- Anomaly detection by identifying unusual patterns.

6. Advantages:
- Does not require labeled data for training.
- Useful for exploratory data analysis and pattern discovery.

7. Limitations:
- Sensitive to the choice of parameters.
- May struggle with non-convex or irregularly shaped clusters.

8. Differences and Relationship:

- Overlap: Clustering identifies groups of similar items, while association rules
reveal associations between items irrespective of their groupings.
- Interconnectedness: Association rules focus on relationships between items
within transactions, whereas clustering is concerned with grouping entire
instances.
- Application: Association rules are often applied in market analysis, while
clustering has broader applications, including pattern discovery and anomaly
detection.

Intrusion Detection System
No ratings yet
Intrusion Detection System
3 pages
Chapter 3
No ratings yet
Chapter 3
4 pages
InfoSec Lec06 Oct14-18
No ratings yet
InfoSec Lec06 Oct14-18
5 pages
EH Sagar Debnath
No ratings yet
EH Sagar Debnath
18 pages
Intrusion Detection
No ratings yet
Intrusion Detection
4 pages
Intrusion Detection System (IDS)
No ratings yet
Intrusion Detection System (IDS)
15 pages
Intrusion Detection System
No ratings yet
Intrusion Detection System
2 pages
Cyber Security
No ratings yet
Cyber Security
77 pages
An Intrusion Detection System
No ratings yet
An Intrusion Detection System
5 pages
Intrusion Prevention System Guide
No ratings yet
Intrusion Prevention System Guide
6 pages
Evading IDS Firewall and Honeypots - 12
No ratings yet
Evading IDS Firewall and Honeypots - 12
22 pages
What Are Intrusion Detection Systems
No ratings yet
What Are Intrusion Detection Systems
5 pages
IDS
No ratings yet
IDS
36 pages
Idt Seminar
No ratings yet
Idt Seminar
14 pages
Network Security: IDS Explained
No ratings yet
Network Security: IDS Explained
31 pages
Section C Lecture 3
No ratings yet
Section C Lecture 3
16 pages
Unit 5 IDS
No ratings yet
Unit 5 IDS
11 pages
FALL2020-21 CSE4026 ETH AP2020211000450 Reference Material I 29-Aug-2020 MODULE-2 PART - 6 INTRUSION DETECTION SYSTEM
No ratings yet
FALL2020-21 CSE4026 ETH AP2020211000450 Reference Material I 29-Aug-2020 MODULE-2 PART - 6 INTRUSION DETECTION SYSTEM
13 pages
CN Unit V Notes
No ratings yet
CN Unit V Notes
23 pages
Intruders
No ratings yet
Intruders
25 pages
Subject Review Intrusion Detection System IDS and
No ratings yet
Subject Review Intrusion Detection System IDS and
4 pages
Intrusion Detection and Prevention System: Varun Gupta
No ratings yet
Intrusion Detection and Prevention System: Varun Gupta
5 pages
Unit 4
No ratings yet
Unit 4
23 pages
Unit IV
No ratings yet
Unit IV
48 pages
Speech
No ratings yet
Speech
3 pages
Dcit 418-Slide 2
No ratings yet
Dcit 418-Slide 2
42 pages
KFA SCT 115 Security Computing Week 18 IDS and IPS
No ratings yet
KFA SCT 115 Security Computing Week 18 IDS and IPS
17 pages
Intrusion Detection Systems
No ratings yet
Intrusion Detection Systems
11 pages
Meat of Fowl
No ratings yet
Meat of Fowl
15 pages
Intrusion Detection System: By:-Saumitya Srivastava Intern
No ratings yet
Intrusion Detection System: By:-Saumitya Srivastava Intern
9 pages
Intrusion Detection ZU
No ratings yet
Intrusion Detection ZU
34 pages
Intrusion Detection System
No ratings yet
Intrusion Detection System
4 pages
An Intrusion Detection System
No ratings yet
An Intrusion Detection System
2 pages
Lec 1 IDS
No ratings yet
Lec 1 IDS
49 pages
Unit Iv Intrusion Detection
No ratings yet
Unit Iv Intrusion Detection
23 pages
Ids & Ips
No ratings yet
Ids & Ips
6 pages
Intrusion Detection System - Wikipedia
No ratings yet
Intrusion Detection System - Wikipedia
15 pages
Module 4
No ratings yet
Module 4
64 pages
Intrusion Detection
No ratings yet
Intrusion Detection
4 pages
CC Project Report
No ratings yet
CC Project Report
26 pages
Network Security: IDS & IPS Insights
100% (1)
Network Security: IDS & IPS Insights
5 pages
Intrusion Detection System-Types and Prevention
No ratings yet
Intrusion Detection System-Types and Prevention
6 pages
Intrusion Detection System
No ratings yet
Intrusion Detection System
13 pages
Intrusion Detection
100% (1)
Intrusion Detection
13 pages
Intrusion Detection System Seminar
No ratings yet
Intrusion Detection System Seminar
38 pages
Computer Security: Vinod Ganapathy
No ratings yet
Computer Security: Vinod Ganapathy
63 pages
10 PNJ Ids
No ratings yet
10 PNJ Ids
33 pages
Module 5
No ratings yet
Module 5
19 pages
Intrusion Detection System
100% (1)
Intrusion Detection System
23 pages
Intrusion Detection System: Survey & Analysis of Nids: Keywords
No ratings yet
Intrusion Detection System: Survey & Analysis of Nids: Keywords
7 pages
Intrusion Detection System
No ratings yet
Intrusion Detection System
10 pages
Ids (En)
No ratings yet
Ids (En)
32 pages
Lec 7
No ratings yet
Lec 7
40 pages
Intrusion Detection Primer
No ratings yet
Intrusion Detection Primer
5 pages
Lecture 13 Ips-Ids
No ratings yet
Lecture 13 Ips-Ids
17 pages
Unit 4
No ratings yet
Unit 4
40 pages
Intrusion Detection System
No ratings yet
Intrusion Detection System
44 pages
Ethical Hacking Module 4
No ratings yet
Ethical Hacking Module 4
40 pages
Forest Report 2023
No ratings yet
Forest Report 2023
2 pages
Major & Minor Relief of Oceans
No ratings yet
Major & Minor Relief of Oceans
2 pages
Unit 1-Ehpm
No ratings yet
Unit 1-Ehpm
27 pages
Intrusion Detection and Prevention System May 2024
No ratings yet
Intrusion Detection and Prevention System May 2024
2 pages
Unit 2 Idps
No ratings yet
Unit 2 Idps
12 pages
Unit 3 Second Half Ehpm
No ratings yet
Unit 3 Second Half Ehpm
10 pages
Chieftain 2100X Illustrated Parts Catalog (Hosur) Revision 3.3
No ratings yet
Chieftain 2100X Illustrated Parts Catalog (Hosur) Revision 3.3
357 pages
Python MySQL Fashion Store Management
No ratings yet
Python MySQL Fashion Store Management
17 pages
APC AP8853 Metered Rack PDU Data Sheet
No ratings yet
APC AP8853 Metered Rack PDU Data Sheet
2 pages
Usenet
No ratings yet
Usenet
5 pages
GSM - Mobility - Important Paramaters
No ratings yet
GSM - Mobility - Important Paramaters
12 pages
Deep Learning Quran NLP Guide
No ratings yet
Deep Learning Quran NLP Guide
11 pages
Medium Error During Reconstruction
No ratings yet
Medium Error During Reconstruction
3 pages
SPM Lecture 2 ScopeManagement
No ratings yet
SPM Lecture 2 ScopeManagement
40 pages
NUPCOs Medical Equipment Catalogue December 2024
No ratings yet
NUPCOs Medical Equipment Catalogue December 2024
1,033 pages
SAP Integration Hub Guide 2022
No ratings yet
SAP Integration Hub Guide 2022
458 pages
Zero Trust Architecture
50% (2)
Zero Trust Architecture
12 pages
Chapter 1 - Process Deadlocks
No ratings yet
Chapter 1 - Process Deadlocks
48 pages
Vpls Configuration Ios XR With BGP and LDP Autodiscovery
No ratings yet
Vpls Configuration Ios XR With BGP and LDP Autodiscovery
104 pages
Hargadon-1997-Technology Brokering and Innovation in A Product Development Firm
No ratings yet
Hargadon-1997-Technology Brokering and Innovation in A Product Development Firm
35 pages
Extended Essay
No ratings yet
Extended Essay
18 pages
OBD Data Collection: Methodology and Protocol in Accident Cases
No ratings yet
OBD Data Collection: Methodology and Protocol in Accident Cases
10 pages
Cot Arts 6 Q4
No ratings yet
Cot Arts 6 Q4
10 pages
TV User Guide for Setup & Features
No ratings yet
TV User Guide for Setup & Features
129 pages
Dcx4s-Blower Replacement
No ratings yet
Dcx4s-Blower Replacement
2 pages
"Ayacucho:hambre y Esperanza", de Antonio Díaz Martínez (1969)
100% (1)
"Ayacucho:hambre y Esperanza", de Antonio Díaz Martínez (1969)
291 pages
Smart Glasses: Farhana Abdullah, Arjun Vishwakarma
No ratings yet
Smart Glasses: Farhana Abdullah, Arjun Vishwakarma
5 pages
VOLTE Introduction and Solutio N: Security Level
100% (5)
VOLTE Introduction and Solutio N: Security Level
35 pages
CHAPTER 2 & 3 Comp Graph
No ratings yet
CHAPTER 2 & 3 Comp Graph
10 pages
Java Programming Basics Guide
No ratings yet
Java Programming Basics Guide
10 pages
6 Diagnostic Stago
No ratings yet
6 Diagnostic Stago
6 pages
Lab #2 - Data Analysis With NumPy and Pandas
No ratings yet
Lab #2 - Data Analysis With NumPy and Pandas
7 pages
MIS for Students and Professionals
No ratings yet
MIS for Students and Professionals
3 pages
Wa 2a Product Manual
No ratings yet
Wa 2a Product Manual
16 pages
Loan Management System Report
No ratings yet
Loan Management System Report
30 pages
User Guide: Thinkpad T430S and T430Si
No ratings yet
User Guide: Thinkpad T430S and T430Si
192 pages

Idps-UNIT 1 - Notes

Uploaded by

Idps-UNIT 1 - Notes

Uploaded by

CS E 19-INTRUSION DETECTION AND PREVENTION SYSTEM

UNIT- I: INTRODUCTION: Understanding Intrusion Detection – Intrusion detection and

Understanding Intrusion Detection:

IDS and IPS Analysis Schemes:

IDS typically operates using either signature-based (misuse detection) or anomaly-based

IDS are classified into 5 types:

Taxonomy of anomaly detection systems

Taxonomy of anomaly detection systems involves categorizing these systems

- Machine Learning Approaches:

Here's a detailed explanation of fuzzy logic:

4. Fuzzy Inference System (FIS):

7. Advantages of Fuzzy Logic:

8. Limitations of Fuzzy Logic:

Bayes' Theorem, named after the Reverend Thomas Bayes, is a fundamental

\[ P(A | B) = \frac{P(B | A) \cdot P(A)}{P(B)} \]

Here's a detailed explanation of Bayes' Theorem:

1. Prior Probability (P(A)):

2. Likelihood (P(B | A)):

3. Posterior Probability (P(A | B)):

4. Evidence or Data (P(B)):

6. Applications of Bayes' Theorem:

7. Bayes' Theorem in Machine Learning:

Artificial Neural Networks (ANNs) are a class of machine learning models

3. Weights and Biases:

5. Feedforward and Backpropagation:

6. Types of Neural Networks:

- Description: SVM is initially designed for binary classification problems,

- Description: In SVM, the decision boundary is represented by a hyperplane,

- Example: In a 2D space, a hyperplane is a line that separates two classes. In a

- Description: SVM can efficiently handle non-linear decision boundaries by

6. Soft Margin SVM:

- Example: In situations where there is noise or outliers in the data, a soft

- Example: One-vs-All (OvA) or One-vs-One (OvO) strategies can be

- SVMs are effective in high-dimensional spaces.

- They work well even with a limited amount of data.

- The use of a margin makes SVMs less sensitive to outliers.

- SVMs can be sensitive to the choice of kernel and hyperparameter tuning.

- Training time can be high for large datasets.

- Interpretability of the model might be challenging, especially in high-

- Image Recognition: SVMs are used in image classification tasks.

Evolutionary Computation (EC) is a family of optimization algorithms inspired

1. Genetic Algorithm (GA):

2. Genetic Programming (GP):

3. Evolutionary Strategies (ES):

4. Differential Evolution (DE):

5. Ant Colony Optimization (ACO):

6. Particle Swarm Optimization (PSO):

3. DBSCAN (Density-Based Spatial Clustering of Applications with Noise):

8. Differences and Relationship:

You might also like