🛡️ PhishGuard — Hybrid Phishing Detection Platform

PhishGuard is a hybrid phishing detection platform that combines rule-based heuristics and machine learning to detect malicious URLs and phishing emails — with explainable risk scoring so users understand why something was flagged.

📊 Results

Target	Accuracy
Malicious URL Detection	87.8%
Phishing Email Detection	99.2%

🏗️ Architecture

PhishGuard runs two independent detection pipelines — one for URLs, one for emails — each combining a rule engine with a trained ML model.

         ┌────────────────────────────────────────┐
         │              Flask Web App              │
         │          app/app.py + templates/        │
         └───────────────┬────────────────────────┘
                         │
          ┌──────────────┴──────────────┐
          │                             │
   ┌──────▼──────┐               ┌──────▼──────┐
   │  URL Input  │               │ Email Input │
   └──────┬──────┘               └──────┬──────┘
          │                             │
   ┌──────▼──────┐               ┌──────▼──────┐
   │  url_rules  │               │ email_rules │   ← Rule-based heuristics
   └──────┬──────┘               └──────┬──────┘     (rules/)
          │                             │
   ┌──────▼──────┐               ┌──────▼──────┐
   │url_features │               │email_features│  ← Feature extraction
   └──────┬──────┘               └──────┬──────┘     (features/)
          │                             │
   ┌──────▼────────────┐    ┌───────────▼─────────────┐
   │phishing_url_model │    │ phishing_email_model    │  ← Trained ML models
   │      .pkl         │    │ + email_tfidf_vectorizer│    (model/)
   └──────┬────────────┘    └───────────┬─────────────┘
          │                             │
          └──────────────┬──────────────┘
                         │
                  ┌──────▼──────┐
                  │ Risk Score  │
                  │  + Verdict  │
                  └─────────────┘

How Each Pipeline Works

URL Detection

url_rules.py applies rule-based heuristic checks on the URL structure
url_features.py extracts numerical features from the URL
phishing_url_model.pkl (trained ML model) classifies based on those features
A combined risk score and verdict is returned

Email Detection

email_rules.py applies heuristic checks on email headers and content
email_features.py extracts features from the email body and metadata
email_tfidf_vectorizer.pkl vectorizes the text content
phishing_email_model.pkl classifies the vectorized input
A combined risk score and verdict is returned

🧰 Tech Stack

Layer	Technology
Web Framework	Python, Flask
Frontend	HTML, CSS (served via Flask templates)
ML Models	scikit-learn (`.pkl` — separate models for URL and email)
Text Vectorization	TF-IDF (`email_tfidf_vectorizer.pkl`)
Feature Engineering	Custom Python modules (`url_features.py`, `email_features.py`)
Rule Engine	Custom Python modules (`url_rules.py`, `email_rules.py`)

📁 Project Structure

phishguard/
├── app/
│   ├── app.py               # Flask application — routes & detection logic
│   ├── static/              # CSS, JS, and static assets
│   └── templates/           # HTML templates
├── features/
│   ├── url_features.py      # Feature extraction for URLs
│   ├── email_features.py    # Feature extraction for emails
│   └── feature_order.txt    # Defines feature vector ordering
├── model/
│   ├── phishing_url_model.pkl        # Trained URL classifier
│   ├── phishing_email_model.pkl      # Trained email classifier
│   ├── email_tfidf_vectorizer.pkl    # TF-IDF vectorizer for email text
│   ├── load_model.py                 # URL model loader
│   └── load_email_model.py           # Email model loader
├── rules/
│   ├── url_rules.py         # Heuristic rules for URL detection
│   └── email_rules.py       # Heuristic rules for email detection
├── utils/                   # Shared utility functions
├── requirements.txt
└── runtime.txt

🚀 Getting Started

Prerequisites

Python 3.x
pip

Installation

git clone https://github.com/Adnan-commits/phishguard.git
cd phishguard
pip install -r requirements.txt

Running the App

cd app
python app.py

Then open http://localhost:5000 in your browser.

👤 Author

Adnan Bardgujar LinkedIn · GitHub

⚠️ PhishGuard is an academic project. It is not intended as a production-grade security tool without further dataset validation and hardening.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🛡️ PhishGuard — Hybrid Phishing Detection Platform

📊 Results

🏗️ Architecture

How Each Pipeline Works

🧰 Tech Stack

📁 Project Structure

🚀 Getting Started

Prerequisites

Installation

Running the App

👤 Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
app		app
features		features
model		model
rules		rules
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
runtime.txt		runtime.txt

Folders and files

Latest commit

History

Repository files navigation

🛡️ PhishGuard — Hybrid Phishing Detection Platform

📊 Results

🏗️ Architecture

How Each Pipeline Works

🧰 Tech Stack

📁 Project Structure

🚀 Getting Started

Prerequisites

Installation

Running the App

👤 Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages