0% found this document useful (0 votes)

20 views16 pages

RL Model

This document presents a research project focused on phishing URL detection using a Reinforcement Learning (RL) approach with Proximal Policy Optimization (PPO). It addresses the limitations of traditional supervised models by integrating real-time feedback and external threat intelligence, achieving 98% accuracy and outperforming existing systems in adaptability and robustness. The proposed system aims to continuously learn from user interactions and adapt to new threats, paving the way for more effective cybersecurity solutions.

Uploaded by

pulisaicharan123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

20 views16 pages

RL Model

Uploaded by

pulisaicharan123

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 16

Department of Artificial Intelligence and Data Science

Phishing URL Detection using the DRL

algorithms with knowledge infusion

GUIDED BY : PRESENTED BY :
Mrs.S.Priya ARKAT SREEKANTH (621721243005)
Assistant Professor (CSE)
KAMINENI LAKSHMIDHAR (621721243025)
KOSURI BHARGAV KUMAR (621721243028)
PULI SAI CHARAN (621721243042)
PROBLEM STATEMENT

• Over 3 million phishing URLs are created every month, with attackers constantly evolving their strategies to
bypass traditional detection systems (Google Transparency Report, 2024).

• Existing static supervised models lack adaptability and struggle to generalize across newly crafted or low-
reputation URLs.

• Less than 15% of detection systems integrate real-time user feedback or external threat intelligence (e.g.,
Virus Total) into their classification loop.

• This highlights a critical gap in interactive, reinforcement-driven models that can learn continuously and
refine predictions on-the-fly.
ABSTRACT
• This research proposes a Reinforcement Learning (RL) approach using Proximal Policy Optimization
(PPO) for multi-class URL classification, targeting Harmless (0), Suspicious (1), and Malicious (2)
categories.

• The model leverages rich feature engineering, incorporating VirusTotal analytics, WHOIS data, Shannon
entropy, and URL structure metrics, followed by feature normalization to enhance learning efficiency.

• A custom Gym environment is developed where each observation (URL feature vector) is associated with a
discrete classification action, and the agent receives a reward signal (+1/-1) based on the correctness of the
prediction.

• Unlike static Supervised Learning (SL) methods such as Random Forest, SVC, and XGBoost, the PPO
model supports real-time inference with online feedback integration, enabling it to adapt dynamically in
production environments.

• Experimental results show that the RL model achieves 98% accuracy with competitive F1-scores across all
classes and significantly outperforms SL models in terms of interactivity, robustness, and adaptability to
unseen threats.
EXISTING SYSTEM
 Traditional URL classification systems heavily rely on supervised learning models such as Random Forest, SVM,
or XGBoost, which perform well on static, labeled datasets but lack adaptability in evolving threat landscapes.

 Recent deep learning (DL) approaches primarily use Convolutional Neural Networks (CNNs) to learn URL text
patterns, but these models often treat URLs as sequences without incorporating semantic or contextual threat
intelligence (like WHOIS or VirusTotal data).

 Most existing solutions are offline, requiring retraining with labeled data and do not adapt to live user feedback
or environmental changes post-deployment.

 Additionally, current models fail to dynamically adjust to newly emerging, low-reputation domains, especially
when feature distribution shifts in real-time scenaris.

 Our project directly addresses these limitations by implementing a Reinforcement Learning-based system
(PPO) that uses reward shaping, API-derived intelligence, and an interactive feedback.
DISADVANTAGE

 Existing models rely heavily on static supervised learning, making them ineffective when faced with new or
evolving URL threats that were not present in the training data.

 Most systems lack real-time adaptability, meaning they cannot adjust predictions based on user feedback or
integrate updated threat intelligence dynamically.

 Deep learning models, particularly those based on CNNs, often treat URLs as mere character sequences and
fail to leverage semantic context or external threat sources like VirusTotal or WHOIS data.

 There is a significant absence of feedback loops in traditional approaches, preventing them from learning
from false positives or misclassifications post-deployment.

 Many models are optimized solely for offline accuracy metrics, resulting in poor robustness and
generalization to zero-day attacks or domain spoofing scenarios in real environments.
PROPOSED SYSTEM

 Reinforcement Learning-Based Detection

 Real-Time Feedback Integration

 Multi-Class Threat Classification

 Threat Intelligence Augmentation

 Adaptive Learning Environment

ADVANTAGES

 Continuously Learns from Feedback

 Adapts to New Threats

 Real-Time URL Analysis

 Integrates External Intelligence

 Minimizes False Predictions

BLOCK DIAGRAM:
HARDWARE REQUIREMENTS
 Computer or Laptop – For model development and implementation.

 Processor – Intel i5/i7 or AMD Ryzen 5/7 for efficient computation.

 RAM – Minimum 8GB (16GB recommended) for handling large datasets.

 Storage – At least 256GB SSD for faster data processing.

 Internet Connection – Required for data collection, API integration, and model deployment.
SOFTWARE
REQUIREMENTS
 Python 3.8+ : Primary language for implementation and scripting.

 Stable-Baselines3: Library for Proximal Policy Optimization (PPO) reinforcement learning.

 Gymnasium: For building custom RL environments.

 Scikit-learn: Used for preprocessing, feature scaling, and supervised model benchmarking.Requests and

 WHOIS: To fetch real-time threat intelligence from VirusTotal and domain registrars.
MODULES

 Data Preprocessing Module

– Handles cleaning, feature extraction (length, entropy, WHOIS), and normalization.
 Threat Intelligence Module
– Integrates VirusTotal API and WHOIS data to enrich each URL with contextual metadata.
 RL Environment Module
– Defines custom Gym environment, actions (0, 1, 2), reward logic, and state transitions.
 PPO Agent Module
– Implements and trains the Proximal Policy Optimization model to classify URLs.

 Feedback and Evaluation Module

– Allows user feedback, updates reward logic, and provides real-time performance evaluation.
OUTPUT
FUTURE ENHANCEMENT
Real-Time Deployment at Scale: Integrate the PPO model into a live browser extension or network
gateway for proactive threat detection.
Active Learning Loop: Incorporate an automated feedback retraining mechanism where high-confidence
user input updates the policy continuously.
Multi-Modal Feature Expansion: Combine URL features with HTML, DNS, and SSL certificate analysis
to further strengthen detection accuracy.
Federated Threat Intelligence: Enable secure, privacy-preserving collaboration across organizations to
share real-time malicious indicators.
Explainable RL Outputs: Add interpretability layers to help cybersecurity analysts understand why a URL
was marked suspicious or malicious.
CONCLUSION

 Developed a pure PPO-based reinforcement learning model for multi-class URL classification: Harmless,

Suspicious, Malicious.

 Integrated real-time feedback and external intelligence from VirusTotal and WHOIS to enhance decision

accuracy.

 Outperformed static supervised models in terms of adaptability, resilience, and real-time threat handling.

 Demonstrated robust training performance with structured reward signals and policy optimization

techniques.

 Enabled live URL prediction and user interaction, paving the way for self-improving cybersecurity systems.
REFERENCES
 M. Bhattacharya et al., "Random Forest for Phishing Detection," IEEE Access, 2020.

 Y. Li et al., "Deep Reinforcement Learning for Cyber Anomaly Detection," IEEE TNNLS, 2022.

 S. Khanzadeh, E.C.P. Neto, S. Iqbal, M. Alalfi, S. Buffett, "An Exploratory Study on Domain Knowledge

Infusion in Deep Learning for Automated Threat Defense," Published Online: 28 Jan 2025.

preprint arXiv:2103.06665, 2021.

 L. Li, D. Wu, "Malicious URL Detection via Deep Learning and Feature Fusion," Computers &

Security, Vol. 114, 2022.

THANK
YOU

2 Review
No ratings yet
2 Review
21 pages
Dattatrya Synopsis 1
No ratings yet
Dattatrya Synopsis 1
6 pages
Malicious Url: Analysis and Detection Using Machine Learning
No ratings yet
Malicious Url: Analysis and Detection Using Machine Learning
58 pages
PPO-Based News Recommendation System
No ratings yet
PPO-Based News Recommendation System
5 pages
Cyber-Security and Reinforcement Learning - A Brief Survey
No ratings yet
Cyber-Security and Reinforcement Learning - A Brief Survey
18 pages
AI Magazine - 2024 - Hanna - Toward The Confident Deployment of Real World Reinforcement Learning Agents
No ratings yet
AI Magazine - 2024 - Hanna - Toward The Confident Deployment of Real World Reinforcement Learning Agents
8 pages
Phishing PPT Final
No ratings yet
Phishing PPT Final
24 pages
Phishing Final
No ratings yet
Phishing Final
13 pages
Is Presentation
No ratings yet
Is Presentation
16 pages
Summary
No ratings yet
Summary
55 pages
Final Thesis Report Merged
No ratings yet
Final Thesis Report Merged
72 pages
Malware Detection Report - Removed
No ratings yet
Malware Detection Report - Removed
40 pages
Enhancing Phishing URL Detection Through Comprehen
No ratings yet
Enhancing Phishing URL Detection Through Comprehen
7 pages
Batch Reinforcement Learning Approach Using Recursive Feature Elimination For Network Intrusion Detection
No ratings yet
Batch Reinforcement Learning Approach Using Recursive Feature Elimination For Network Intrusion Detection
16 pages
Malicious URL Detection Using Machine Learning: Mr. Swapnil Thorat
No ratings yet
Malicious URL Detection Using Machine Learning: Mr. Swapnil Thorat
18 pages
20mis0106 VL2023240103172 Pe003
No ratings yet
20mis0106 VL2023240103172 Pe003
5 pages
Cyber Threat Detection With Large Language Models A Privacy-Preserving BERT-Based Lightweight Model For IoT IIoT Devices
No ratings yet
Cyber Threat Detection With Large Language Models A Privacy-Preserving BERT-Based Lightweight Model For IoT IIoT Devices
18 pages
Batch 18-Journal
No ratings yet
Batch 18-Journal
7 pages
AI - Assignment 2 Zaryab Khan
No ratings yet
AI - Assignment 2 Zaryab Khan
6 pages
Phishing URL Detection Using ML: Project Report
No ratings yet
Phishing URL Detection Using ML: Project Report
24 pages
Cyber Attack
No ratings yet
Cyber Attack
131 pages
Software Project Abstracts 2022-23
No ratings yet
Software Project Abstracts 2022-23
52 pages
Deep Reinforcement Learning For Cybersecurity Threat Detection and Protection: A Review
No ratings yet
Deep Reinforcement Learning For Cybersecurity Threat Detection and Protection: A Review
30 pages
Master - Thesis - V4
No ratings yet
Master - Thesis - V4
67 pages
Paper 2
No ratings yet
Paper 2
10 pages
Similarity-Pushpesh01639 TK182169 FINAL
No ratings yet
Similarity-Pushpesh01639 TK182169 FINAL
68 pages
Comparative Evaluation of Machine Learning Models For Malicious URL Detection
No ratings yet
Comparative Evaluation of Machine Learning Models For Malicious URL Detection
7 pages
Malicious URL Detection Using Random Forest
No ratings yet
Malicious URL Detection Using Random Forest
36 pages
Matjie - LK Final Report (201904606) V1
No ratings yet
Matjie - LK Final Report (201904606) V1
24 pages
CYBER ATTACKS DETECTION USING GoogleNet MODEL FOR ENVIRONMENTAL AWARE SMART CITY APPLICATIONS
No ratings yet
CYBER ATTACKS DETECTION USING GoogleNet MODEL FOR ENVIRONMENTAL AWARE SMART CITY APPLICATIONS
10 pages
PU学习法：Off-policy evaluation via off-policy classification
No ratings yet
PU学习法：Off-policy evaluation via off-policy classification
12 pages
Report
No ratings yet
Report
35 pages
Hybrid ML Phishing Detection System
No ratings yet
Hybrid ML Phishing Detection System
16 pages
Final Review 1
No ratings yet
Final Review 1
29 pages
Screens
No ratings yet
Screens
14 pages
B.E Cse Batchno 256
No ratings yet
B.E Cse Batchno 256
57 pages
Energies 16 01512
No ratings yet
Energies 16 01512
23 pages
Deep Q-Learning Based Reinforcement Learning Approach For
No ratings yet
Deep Q-Learning Based Reinforcement Learning Approach For
19 pages
Term Project Presentation
No ratings yet
Term Project Presentation
17 pages
Review 4
No ratings yet
Review 4
9 pages
Reinforcement Learning For Intrusion Detection More Model Longness and Fewer Update
No ratings yet
Reinforcement Learning For Intrusion Detection More Model Longness and Fewer Update
11 pages
Intership Report Template
No ratings yet
Intership Report Template
33 pages
Evaluating DNN and Classical ML Algorithms For Nids
No ratings yet
Evaluating DNN and Classical ML Algorithms For Nids
24 pages
Full Proj Report
No ratings yet
Full Proj Report
59 pages
2022-Inroads Into Autonomous Network Defence Using Explained Reinforcement Learning
No ratings yet
2022-Inroads Into Autonomous Network Defence Using Explained Reinforcement Learning
21 pages
SafeSurf Enhancing Web Security
No ratings yet
SafeSurf Enhancing Web Security
16 pages
D (1) (1) Report2 Srushti
No ratings yet
D (1) (1) Report2 Srushti
23 pages
Mandadi 2022
No ratings yet
Mandadi 2022
4 pages
Agents LLM
No ratings yet
Agents LLM
7 pages
Deep Reinforcement Learning: Overcoming The Challenges of Deep Learning in Discrete and Continuous Markov Decision Processes
No ratings yet
Deep Reinforcement Learning: Overcoming The Challenges of Deep Learning in Discrete and Continuous Markov Decision Processes
110 pages
Offline Reinforcement Learning For LLM Multi-Step Reasoning
No ratings yet
Offline Reinforcement Learning For LLM Multi-Step Reasoning
14 pages
Paper 5665
No ratings yet
Paper 5665
117 pages
Applsci 12 12070
No ratings yet
Applsci 12 12070
15 pages
Joyal Biju
No ratings yet
Joyal Biju
38 pages
Detection of Phishing Websites by Investigating Their Urls Using LSTM Algorithm
No ratings yet
Detection of Phishing Websites by Investigating Their Urls Using LSTM Algorithm
10 pages
Aravind Final Resume
No ratings yet
Aravind Final Resume
48 pages
SafeSurf Enhancing Web Security Through Phishing Detection
No ratings yet
SafeSurf Enhancing Web Security Through Phishing Detection
15 pages
Network Intrusion Detection Using Deep Reinforcement Learning
No ratings yet
Network Intrusion Detection Using Deep Reinforcement Learning
5 pages
Final Year Stage 2
No ratings yet
Final Year Stage 2
51 pages
Workshop On Formal Verification
No ratings yet
Workshop On Formal Verification
3 pages
Experiment Onwards - Merged
No ratings yet
Experiment Onwards - Merged
35 pages
Desktop Prices
No ratings yet
Desktop Prices
6 pages
I Bcom CA - Programming C++
No ratings yet
I Bcom CA - Programming C++
6 pages
E - Ball
No ratings yet
E - Ball
13 pages
Self Join - SQLZOO
No ratings yet
Self Join - SQLZOO
6 pages
E236 Tool Presetting Machine
No ratings yet
E236 Tool Presetting Machine
3 pages
Intel I3-330m Vs I5-2450m (Cpubenchmark - Net) by PassMark Software
No ratings yet
Intel I3-330m Vs I5-2450m (Cpubenchmark - Net) by PassMark Software
4 pages
Elevator Alarm & Monitoring Solutions
No ratings yet
Elevator Alarm & Monitoring Solutions
5 pages
Cyberark - Cau201.V2022-04-19.Q108: Show Answer
0% (1)
Cyberark - Cau201.V2022-04-19.Q108: Show Answer
28 pages
Co 18cs34 Notes Final
No ratings yet
Co 18cs34 Notes Final
141 pages
Word 2016 - Mock Test 5
No ratings yet
Word 2016 - Mock Test 5
14 pages
Docker Flash Card
No ratings yet
Docker Flash Card
12 pages
Daily Task Worksheet
No ratings yet
Daily Task Worksheet
27 pages
Znid-Gpon-2400a1 - DZS
No ratings yet
Znid-Gpon-2400a1 - DZS
8 pages
Unit 3 Nis 22620
No ratings yet
Unit 3 Nis 22620
6 pages
L05-Intro To Backend Infrastructure
No ratings yet
L05-Intro To Backend Infrastructure
46 pages
Internship Report 11
No ratings yet
Internship Report 11
28 pages
03 Strings in Python
No ratings yet
03 Strings in Python
29 pages
COM Express Carrier Board Design Guide V1.2
No ratings yet
COM Express Carrier Board Design Guide V1.2
96 pages
HS Codesign Overview
No ratings yet
HS Codesign Overview
48 pages
# To Add The 4Th Record in Dataframe
No ratings yet
# To Add The 4Th Record in Dataframe
5 pages
Final List Template Request For Approval Form - Activity List - Freeze Periode NARU 2022-2023
No ratings yet
Final List Template Request For Approval Form - Activity List - Freeze Periode NARU 2022-2023
740 pages
Sales Management System ER Diagrams
No ratings yet
Sales Management System ER Diagrams
42 pages
ISON IS-DG508 Series Datasheet
No ratings yet
ISON IS-DG508 Series Datasheet
4 pages
ADS6800+ A4BC ADS6800+ A2BC: Audio Delay Synchronizers
No ratings yet
ADS6800+ A4BC ADS6800+ A2BC: Audio Delay Synchronizers
62 pages
Mobile Security Interview Questions & Answers
No ratings yet
Mobile Security Interview Questions & Answers
4 pages
The Company: Job Vacancy
No ratings yet
The Company: Job Vacancy
2 pages
Magsino Module
No ratings yet
Magsino Module
4 pages
Thesis Help for Web Security Scholars
100% (3)
Thesis Help for Web Security Scholars
8 pages

RL Model

Uploaded by

RL Model

Uploaded by

Department of Artificial Intelligence and Data Science

Phishing URL Detection using the DRL

 Reinforcement Learning-Based Detection

 Real-Time Feedback Integration

 Multi-Class Threat Classification

 Threat Intelligence Augmentation

 Adaptive Learning Environment

 Continuously Learns from Feedback

 Adapts to New Threats

 Real-Time URL Analysis

 Integrates External Intelligence

 Minimizes False Predictions

 Processor – Intel i5/i7 or AMD Ryzen 5/7 for efficient computation.

 RAM – Minimum 8GB (16GB recommended) for handling large datasets.

 Storage – At least 256GB SSD for faster data processing.

 Stable-Baselines3: Library for Proximal Policy Optimization (PPO) reinforcement learning.

 Gymnasium: For building custom RL environments.

 Data Preprocessing Module

 Feedback and Evaluation Module

preprint arXiv:2103.06665, 2021.

Security, Vol. 114, 2022.

You might also like