0% found this document useful (0 votes)

12 views7 pages

Detectin NG Malic Cious Ur Rlsine E-Mail - An Imp Plementa Ation

Uploaded by

fatna.elmendili

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views7 pages

Detectin NG Malic Cious Ur Rlsine E-Mail - An Imp Plementa Ation

Uploaded by

fatna.elmendili

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Available online at www.sciencedirect.

com

ScienceDirect
AASRI Procedia 4 (2013) 125 – 131

2013 AAS
SRI Confereence on Inteelligent Systtems and Coontrol

Detectinng Maliccious UR
RLs in E-Mail
E – An Impplementaation
kulua, Chellappan Cb*
Dhanaalakshmi Ranganayak
R
a
Adhiiparasakthi Enginneering College, Melmaruvathur 603319,
6 INDIA
b
Anna University,
Un Chennaai 600 025, INDIA
A.

Absttract

The World Wide Web has beccome the most essential criiterion for info ormation comm munication andd knowledge
disseemination. It heelps to transact information tim
mely, rapidly and
a easily. Iden ntity theft and identity fraud aare referred as
two sides of cyberccrime in whichh hackers and malicious
m userss obtain the peersonal data of existing legitimmate users to
attem
mpt fraud or deception motivaation for financial gain. E-Maiils are used as phishing tools in which legitiimate looking
emaiils are sent maaking the genuuine users idenntity with legiitimate contentt with maliciouus URLs. It hhelps to steal
consuumers' personaal data such as user
u names, acccount numbers,, passwords and d other financiaal account creddentials. Spam
E-Maails emerges orr transforms as Phishing mailss. Spoofed Maiils plays a vital role in which the hackers preetends to be a
legitiimate sender poosing to be fromm a legitimate organization
o whhich divulges thhe user to give his
h personal creedentials. The
conteent may escappe from Contennt based filters or o the email may
m be without any body of thhe message except malicious
URL L in it. This papeer identifies maalicious URLs in
i email throughh reduced featuure set method.

20013The
©©2013 .Published
Authors.d Published
by Elsevier B.V. B.V. Open access under CC BY-NC-ND license.
by Elsevier
Seleectionand/or
Selection and/or p review
peer
peer review unnder
under responsibbility
responsibility of Amer
of American rican Applied
Applied Science Institute
Science Research Reseearch Institute

Keyw
words:Age of Dom
main;Host based features;Lexical
f f
features;Maliciou
us URLs;Page ran
nk; Phishing.

1. Introduction
n

T Web servves as better medium for a large num

The mber of malicious activities such as Sppam attacks,
Phisshing attacks ,DDos
, attackss and etc. mottivated under financial aspeects. These atttacks attract tthe common

* E-mail address:ddhanalakshmisai@
@gmail.com.

2212-6716 © 2013 The Authors. Published by Elsevier B.V. Open access under CC BY-NC-ND license.
Selection and/or peer review under responsibility of American Applied Science Research Institute
doi:10.1016/j.aasri.2013.10.020
126 Dhanalakshmi Ranganayakulu and C. Chellappan / AASRI Procedia 4 (2013) 125 – 131

users to click links attached in legitimate looking or spam emails and make them to visit the malicious sites. It
initiates them to click, urges them to give their personal information. Phishing attacks are referred as Lure,
Hook and Catch (Jacobsson and Myers 2007). Spoofed E-Mails poses to be from legitimate company seeking
sensitive information. These email addresses are called the ‘Lure’. E-mails with malicious URLs may have
legitimate content in the body of the mails which are unable to be detected by content based spam filters. The
URLs lead to the actual Phishing sites which are clones of legitimate websites and lure the users into entering
sensitive information. The actual phishing websites are the ‘Hook’ which obtains the private information from
the user. The malicious user poses various critical conditions such as account suspension, failed transaction
and forcing user to upgrade the newly installed security feature. The links in the email leads to fake phishing
site referred as ‘Catch’. The legitimacy of the website may not be displayed by the browser which outlooks
the phishing websites as legitimate. In some cases, the user also overrides the browsers decision.

2. Existing solutions

Blacklists may be in the form of IP addresses or websites used by email filters and block the users through
an available list of IP addresses or websites. PhishNet (Pawan et al 2010) enhances existing blacklists by
discovering related malicious URLs. One major problem with blacklists is that they fail to identify phishing
URLs in the early hours of a phishing attack because their update process is insufficiently fast. Phishing
campaigns have an average life of less than two hours (Sheng et al 2009) and by the time a phishing website
is positively identified and blacklisted, it would have almost hacked. Various features are extracted from
URLs which includes suspicious characters, number of dots in the URL, hexadecimal characters, IP addresses
and length of the URL. Colin Whittaker et al (2010) discussed a scalable machine learning algorithm to
automatically classify phishing pages by training the classifier on noisy dataset. The false positive rate is
below 1% and the classifier is based on Google’s phishing blacklist URLs. Justin Ma et al(2009) discuss a
method to detect malicious websites by analyzing lexical and host based features based on passive aggressive
algorithm. The improvement can be obtained by analyzing the features of page content and pagerank. Zhang
et al (2007) proposed a content-based method using a linear classifier and achieved 89% TP(True positive)
and 1% FP(False positive).The test case was demonstrated for 100 phishing URLs and 100 legitimate URLs.
CANTINA+ (2010) classifies phishing URLs and the feature set is more exhaustive and obtained a
classification accuracy of 92.3%. There exist various related researches and case studies conducted on
analyzing the feature set required to reduce the exhaustiveness and time consumption. Maher Abburrous et al
(2010) attempted a survey to identify the required features which helps to improve the accuracy and the
precision of detecting malicious URLs. Various sources of phishing attacks are obtained from APWG’s
archive(2011) and Phishtank archive(2012). The features are listed in Table 5.1.
Dhanalakshmi Ranganayakulu and C. Chellappan / AASRI Procedia 4 (2013) 125 – 131 127

Table 1 High impact features on URL phishing instances [Maher Aburrous et al (2010)]

S.No. Phishing Features No. of Appearance %

appearances
1. Using the IP address 14 46.66
2. Abnormal request URL 30 100
3. Abnormal URL of anchor 7 23.33
4. Abnormal DNS record 2 06.66
5. Abnormal URL 5 16.66
6. Using SSL certificate 17 56.66
7. Certification authority 4 13.33
8. Abnormal cookie 2 06.66
9. Distinguished Names Certificate (DN) 4 13.33
10. Redirect pages 3 10.00
11. Straddling attack 2 06.66
12. Pharming attack 4 13.33
13. Using on MouseOver to hide the link 6 20.00
14. Server Form Handler (SFH) 2 06.66
15. Spelling errors 24 80.00
16. Copying website 5 16.66
17. Using forms with ‘‘Submit’’ button 6 20.00
18. Using Pop-Ups windows 8 26.66
19. Disabling right click 2 06.66
20. Long URL address 22 73.33
21. Replacing similar characters for URL 16 53.33
22. Adding prefix or suffix 9 30.00
23. Using the @ symbol to confuse 6 20.00
24. Using hexadecimal character codes 8 26.66
25. Much emphasis on security and response 5 16.66
26. Buying time to access accounts 3 10.00
The above selected features ( ) display high impact in various studies as mentioned in the literature and hence the feature set
comprises features whose impact is greater than 20%.This involved the host based features, lexical features,page rank and suspicious
keywords in the mail for better performance.

3. URL analyzer

Phishing URLs can be analyzed based on the lexical features and host based features of the URL. The
lexical feature analyses the format of the URL. URLs contain the host name and the path. For example,
consider ‘www.annauniv.edu/emmrc/emmrc.html’, the host name is www.annauniv.edu and
emmrc/emmrc.html is the path. The proposed methodology analyses host based features such as Pagerank and
age of domain, various lexical based features such as URL encoding, presence of suspicious characters,
hexadecimal character or malicious IP addresses to hide them and analyses the word probabilities to find
whether the email contains any suspicious links to avoid end users falling by phishing attacks as illustrated in
Fig 1. Itis useful as illegitimate users spoof their identities, pass authentication tests and during content
analysis also it may get escaped by avoiding spam keywords. Some emails may not contain any message in
the body except some malicious links in it urging the users to click them leading to fraudulent websites.
128 Dhanalakshmi Ranganayakulu and C. Chellappan / AASRI Procedia 4 (2013) 125 – 131

Fig 1 URL feature extraction

3.1 Lexical Features (F1)

Lexical features analyses the format of the URL. It includes the length of the host name, length of the URL,
the number of dots, presence of suspicious characters such @ symbol, hexadecimal characters and other
special binary characters such as (‘.’, ‘=’, ‘$’, ‘^’ and etc.) either in the host or path name. IP addresses and
hexadecimal characters are used to hide the actual URLs. For example consider the URL
http://www.bankingcompany.com/online /transaction/website/phishing.html” which is shortened using the IP
address http://132.115.201.115 which looks like legitimate and not suspicious. The URL can also be
represented using hexadecimal base values with a ‘%’ symbol. It may represent any special characters
Spoofguard (Neil Chou et al 2004) identified the ‘@’ and ‘-’ symbol most prominent in phishing URLs. A @
symbol in a URL will enable the URL at the left to discard which is legitimate URL and right to enter into the
phishing site. Consider the URL http://www.citibank.com@phishingsite.com” will enter into “phishing
site.com” and discards “www.citibank.com”. These kinds of techniques use the actual phishing website to
disguise and pose as legitimate sites.

3.2 Host Based Features

Host based features identify the location , owner and how malicious sites are hosted and managed. Some of
the features are as follows
3.2.1 Age of domain (F2)

Age of the domain is used to identify when malicious websites are hosted such that they have less age or
relatively new to obtain the user credentials. They will be recently registered sending more mails and some
domains may not be available even at the time of checking. It obtains the data in the number of months and
some may be in years more recently. The WHOIS lookups on the WHOIS server is used to retrieve the
domain registration date, and if the domain registration entry is not found on the WHOIS server, this feature
will simply return-1, deeming it suspicious.
3.2.2 Page rank(F3)

Page rank provides the rank for the webpage and proves higher the page rank, the more important is the
page. Obviously phishing web pages have less age of domain and short lived. Hence they obtain a very low
page rank or page rank does not exist. Page rank is a link analysis algorithm first used by Google, in which
each document on the web is assigned a numerical weight from 0 to 10, with 0 indicating least popular and 10
Dhanalakshmi Ranganayakulu and C. Chellappan / AASRI Procedia 4 (2013) 125 – 131 129

meaning most popular. A score value of 1 is assigned when the page rank value for a particular webpage is
not available. After examining the dataset, 1000 phishing mails and 1000 legitimate mails the percentage of
emails matching the Lexical and Host based features are listed in Table 2.

Table 2Characterizing lexical and host based features matching with Phishing Mails

Feature Legitimate Phishing

Has IP Address 0% 0.04%
Has “Hexadecimal” Character 0% 0.01%
Has suspicious character ‘@’ symbol 0% 0.01%
More No. of Dots 0.01% 0.06%
Suspicious Age of Domain 35% 75%
Page rank< 3 feature 1.2% 88%

3.3 Number of Sensitive Words in URL

3.3.1 Individual occurrences(F4)and Co-Occurencesof suspicious phishing keywords (F5)

Abu-Nimeh et al (2007) used the “bag-of-words” approach with a list of 43 most frequent words as
features in a machine learning approach. Garera et al (2007) used a set of eight sensitive words such as secure,
account, update, login, sign-in, banking, confirm and Verify that frequently appear in phishing URLs. The
system is trained with 1000 phishing emails to give weights to the suspicious words found in the phishing e-
mails. The count of most occurring words includes Secure, Account, Update, Login, Verify ,Signin, Banking,
Notify, Click, Inconvenient, password etcand their Co-Occurencesin the phishing mail.

3.4Approach -Bayes ClassifierBayes classifier is adapted in spam filters such that individual features of URLs
are distributed independently of the values of other features. Bayes theorem is used to calculate the
probability of hypothesis for the event B, provided with the training data A,

(1.1)

It is often easier to calculate the probabilities, , P(A), P(B) for the probability that is required.
Extrapolating Baye’s rule, assume that legitimate and phishing websites occur equal in number and hence
with equal probability, then the posterior probability that the feature vector X belongs to a malicious URL is
such that

(1.2)

(1.3)

(1.4)

where, P(A) = Probability of feature F in phishing and legitimate dataset.

130 Dhanalakshmi Ranganayakulu and C. Chellappan / AASRI Procedia 4 (2013) 125 – 131

P(B(Phishing)) = P(B’(Legitimate)) = 0.5

The classifier has a training dataset of malicious phishing URLs and legitimate URLs. The probability
occurrence of each feature in the dataset are calculated and their respective scores are obtained (i.e) Count up
occurrence of features in the dataset and calculate the cumulative score. If Cumulative score > Threshold,
consider as phishing URL else legitimate URL as illustrated in Fig2 .

Fig 2 Phishing URL classifications

a) How many times does feature F(F1,F2,F3,F4,F5,F6) appear in phishing dataset?

b) How many times does feature F(F1,F2,F3,F4,F5,F6) appear in legitimate dataset?
Let F1 = Lexical features ,F2 = Age of the domain factor of URLs
F3 = Occurrence of Pagerank< 3 in phishing and legitimate dataset
F4 = Individual Occurrence of suspicious keywords
F5= Co–Occurrences of suspicious keywords, F6 = Login Form detection

4.Conclusion and Results

Hackers bypass anti-spam filtering techniques by embedding malicious URL in the content of the
messages. Hence the URL analyzer method with the help of minimized phishing feature set identifies the
malicious URL in the emails. The datasets are obtained from two sourcesviz DMOZ Open Directory Project
and Phishtank(2012). Phishtank is a source of blacklisted phishing URLs which admits user inputs and they
are also verified by users. An E-Mail server has been configured with hMail namedas SSE Mail Server for the
testing purposes. The false positive rate refers to the number of legitimate emails classified as phishing emails,
and false negative rate refers to the number of phishing emails classified as legitimate. The Table 3 shows that
out of 1000 Phishing mails with malicious URLs, the above results were obtained for identifying various
lexical and host based features.
Table 3 Performance analysis with the existing systems

Technique Number TPR FPR Time Complexity

of (%) (%)
features
Dhanalakshmi Ranganayakulu and C. Chellappan / AASRI Procedia 4 (2013) 125 – 131 131

(n)
Cantina (Existing) (with n1 n1(20) 89 1 O(n1)
features)
Cantina+ (Existing)(with n2(27) 92.54 0.407 O(n2) (n1<n2)
n2 features)
URL Classifier m(14) 92.8 0.4 O(m)(m<n2)
(Proposed)(with m
features)

References

1. Colin Whittaker, Brian Ryner and MarriaNazif, “Large-Scale Automatic Classification of Phishing
Pages”, In proceedings of NDSS, 2010.
2. Fette, I., Sadeh, N. and Tomasic, A. “Learning to Detect Phishing Emails’ In WWW”, Proceedings
of the 16th International conference on World Wide Web, pp. 649-656, 2007.
3. Garera, S., Provos, N., Rubin, A.D. and Chew, M. “A Framework for Detection and Measurement of
Phishing Attacks” In Proceedings of the 2007 ACM workshop on Recurring malcode, pp. 1-8, 2007.
4. Justin Ma, Lawrence K. Saul, Stefan Savage and Geoffrey M. Voelker, “Beyond Blacklists: Learning
to Detect Malicious Web Sites from Suspicious URLs”, Proceedings of the 15th ACM SIGKDD international
conference on Knowledge discovery and data mining pp.1245-1254, 2009.
5. Jacobsson, M. and Myers, S. “Phishing and Countermeasures - Understand the Increasing Problem
of Electronic Identity Theft”, New Jersey: Wiley, 2007.
6. Justin Ma, Lawrence Saul, K., Stefan Savage and Geoffrey Voelker, M. “Identifying Suspicious URLs:
An Application of Large-Scale Online Learning”, In ICML ’09: Proceedings of the 26th Annual International
Conference on Machine Learning, pp. 681-688, 2009.
7. Maher Aburrous, Hossain, M.A., KeshavDahal and FadiThabtah, “Experimental Case Studies for
Investigating E-Banking Phishing Techniques and Attack Strategies”, Cognitive Computing,
DOI 10.1007/s12559-010-9042-7, Vol. 2, pp. 242-253, 2010.
8. Neil Chou, Robert Ledesma, Yuka Teraguchi, Dan Boneh and John Mitchell, “Client-side defense
against web-based identity theft”, In 11th Annual Network and Distributed System Security Symposium
(NDSS ’04),San Diego, 2004.
9. PawanPrakash, Manish Kumar, RamanaRaoKompella and Minaxi Gupta, ‘PhishNet:Predictive
Blacklisting to Detect Phishing Attacks’, Proceedings of the IEEE Infocom, pp.1-5, 2010.
10. Sheng, S.,Wardman, B.,Warner, G., Cranor, L., Hong, J. and Zhang, C. “An empirical analysis of
phishing blacklists”, In Proceedings of the CEAS’09, 2009.
11. Xiang, G., Hong, J., Rose, C. P. and Cranor, L. “CANTINA+: A feature-rich machine learning
framework for detecting phishing Web sites”. ACM Trans. Inf. Syst. Secur. Vol.14, No.2, pp.1-21, 2011.
12. Zhang, Y., Hong, J. and Cranor, L. Cantina: A Content-Based Approach to Detecting Phishing
Web Sites. In Proceedings of the 16th international conference on World Wide Web, pp.639-648, 2007.

Detection of Phishing URLs Using Machine Learning
No ratings yet
Detection of Phishing URLs Using Machine Learning
6 pages
Phishing Website Classification and Detection J Kumar
No ratings yet
Phishing Website Classification and Detection J Kumar
6 pages
V6I602
No ratings yet
V6I602
8 pages
Iarjset 2022 9340
No ratings yet
Iarjset 2022 9340
6 pages
Phishing URL Detection Research Paper
No ratings yet
Phishing URL Detection Research Paper
12 pages
Phishing 4
No ratings yet
Phishing 4
6 pages
Detection of Phishing Attacks
No ratings yet
Detection of Phishing Attacks
7 pages
ASRP-116 Camera Ready
No ratings yet
ASRP-116 Camera Ready
13 pages
Phishing Detection via Machine Learning
No ratings yet
Phishing Detection via Machine Learning
51 pages
Mini Project Report Sample Format 2024 - Final
No ratings yet
Mini Project Report Sample Format 2024 - Final
80 pages
Phishing Detection Using Machine Learning
No ratings yet
Phishing Detection Using Machine Learning
9 pages
Valar
No ratings yet
Valar
60 pages
IEEE Format Paper
No ratings yet
IEEE Format Paper
20 pages
Full Thesis
No ratings yet
Full Thesis
81 pages
Phishing Detection via Machine Learning
No ratings yet
Phishing Detection via Machine Learning
3 pages
(IJCST-V3I6P4) : Aishwarya Chavan, Raadhieca Iyer, Aparna Ramtirthakar, Mrs. Shanthi K. Guru, Ms. Pallavi Khude
No ratings yet
(IJCST-V3I6P4) : Aishwarya Chavan, Raadhieca Iyer, Aparna Ramtirthakar, Mrs. Shanthi K. Guru, Ms. Pallavi Khude
5 pages
Major Project BharathUniversity
No ratings yet
Major Project BharathUniversity
5 pages
A Novel Algorithm To Detect Phishing URLs - 2016
No ratings yet
A Novel Algorithm To Detect Phishing URLs - 2016
5 pages
Based On URL Feature Extraction
No ratings yet
Based On URL Feature Extraction
6 pages
Research - Paper - Group-B5
No ratings yet
Research - Paper - Group-B5
4 pages
Phishing Detection via Machine Learning
No ratings yet
Phishing Detection via Machine Learning
4 pages
Survey On Phishing Attack and Defence Techniques: March 2018
No ratings yet
Survey On Phishing Attack and Defence Techniques: March 2018
6 pages
Computer Science Review 2018
No ratings yet
Computer Science Review 2018
25 pages
NLPBased Phishing Attack
No ratings yet
NLPBased Phishing Attack
11 pages
Paper Major1
No ratings yet
Paper Major1
6 pages
Deep Learning Phishing Detection
No ratings yet
Deep Learning Phishing Detection
27 pages
Ijresm V4 I7 43
No ratings yet
Ijresm V4 I7 43
3 pages
Synopsis of Project On Automatic Phishing Email Website Detection System Using Fuzzy Techniques
No ratings yet
Synopsis of Project On Automatic Phishing Email Website Detection System Using Fuzzy Techniques
20 pages
Phishing Detection via Machine Learning
No ratings yet
Phishing Detection via Machine Learning
5 pages
Phishing Website Detection Using Fuzzy Logic: Twinkll Sisodia Simran Choudhary
No ratings yet
Phishing Website Detection Using Fuzzy Logic: Twinkll Sisodia Simran Choudhary
6 pages
Phish Final Project
No ratings yet
Phish Final Project
63 pages
Phishing Web Page Detection Methods URL and HTML Features Detection
No ratings yet
Phishing Web Page Detection Methods URL and HTML Features Detection
5 pages
INFOCOMP+Journal+Final 3
No ratings yet
INFOCOMP+Journal+Final 3
6 pages
Phishing Environments, Techniques, and Countermeasures: A Survey
No ratings yet
Phishing Environments, Techniques, and Countermeasures: A Survey
44 pages
Anti-Phishing for Tech Users
No ratings yet
Anti-Phishing for Tech Users
6 pages
Lab 14 Incident Handling Techniques
No ratings yet
Lab 14 Incident Handling Techniques
4 pages
Batch-5 ECE-D
No ratings yet
Batch-5 ECE-D
4 pages
Detection of Phishing Websites Using Mac
No ratings yet
Detection of Phishing Websites Using Mac
3 pages
An Effective Detection Approach For Phishing URL U
No ratings yet
An Effective Detection Approach For Phishing URL U
16 pages
Survey On Phishing Websites Detection Using Machine Learning
No ratings yet
Survey On Phishing Websites Detection Using Machine Learning
8 pages
Detection of Phishing
No ratings yet
Detection of Phishing
7 pages
Pooja 2020
No ratings yet
Pooja 2020
10 pages
CyberSec Review3 Team10
No ratings yet
CyberSec Review3 Team10
28 pages
1 s2.0 S0167404818304280 Main PDF
No ratings yet
1 s2.0 S0167404818304280 Main PDF
22 pages
Phishing Websites Detection Based On Phishing Characteristics in The Webpage Source Code
100% (1)
Phishing Websites Detection Based On Phishing Characteristics in The Webpage Source Code
9 pages
Detecting Phishing Website With Code Implementation
No ratings yet
Detecting Phishing Website With Code Implementation
13 pages
Irjet V3i1121 PDF
No ratings yet
Irjet V3i1121 PDF
6 pages
Phishing Attacks & Prevention Review
No ratings yet
Phishing Attacks & Prevention Review
6 pages
A Keyword-Based Combination Approach For Detecting Phishing Webpages
No ratings yet
A Keyword-Based Combination Approach For Detecting Phishing Webpages
20 pages
Phishing Detection Using Machine Learnin
No ratings yet
Phishing Detection Using Machine Learnin
5 pages
Part 3 Discription
No ratings yet
Part 3 Discription
27 pages
Expert Systems With Applications: Ozgur Koray Sahingoz, Ebubekir Buber, Onder Demir, Banu Diri
No ratings yet
Expert Systems With Applications: Ozgur Koray Sahingoz, Ebubekir Buber, Onder Demir, Banu Diri
13 pages
Batch-5 Journal-6 ECE-D New
No ratings yet
Batch-5 Journal-6 ECE-D New
6 pages
CHAPTER
No ratings yet
CHAPTER
101 pages
A Framework For Preparing A Balanced and Comprehensive Phishing Dataset
No ratings yet
A Framework For Preparing A Balanced and Comprehensive Phishing Dataset
13 pages
Seminar of Internet Security Law
No ratings yet
Seminar of Internet Security Law
21 pages
Network Security Report
No ratings yet
Network Security Report
42 pages
1 s2.0 S1877050915007395 Main
No ratings yet
1 s2.0 S1877050915007395 Main
10 pages
View
No ratings yet
View
26 pages
Applsci 15 06835 v2
No ratings yet
Applsci 15 06835 v2
26 pages
Stephen 2018 IOP Conf. Ser. Mater. Sci. Eng. 396 012030
No ratings yet
Stephen 2018 IOP Conf. Ser. Mater. Sci. Eng. 396 012030
8 pages
1 s2.0 S209672092500017X Main
No ratings yet
1 s2.0 S209672092500017X Main
17 pages
Mathematics 12 03860
No ratings yet
Mathematics 12 03860
24 pages
Exploring The Security of Blockchain Applications: A Review of Current Solutions and Open Challenges
No ratings yet
Exploring The Security of Blockchain Applications: A Review of Current Solutions and Open Challenges
9 pages
Neurocomputing: Xianghan Zheng, Zhipeng Zeng, Zheyi Chen, Yuanlong Yu, Chunming Rong
No ratings yet
Neurocomputing: Xianghan Zheng, Zhipeng Zeng, Zheyi Chen, Yuanlong Yu, Chunming Rong
8 pages
An Early and Accurate Diagnosis and Detection of The Coronary Heart Disease Using Deep Learning and Machine Learning Algorithms
No ratings yet
An Early and Accurate Diagnosis and Detection of The Coronary Heart Disease Using Deep Learning and Machine Learning Algorithms
32 pages
1 s2.0 S1877050918316909 Main
No ratings yet
1 s2.0 S1877050918316909 Main
8 pages
Mathematics 12 01969 v2
No ratings yet
Mathematics 12 01969 v2
20 pages
Techniques To Detect Spammers in Twitter-A Survey: Monika Verma Divya, Sanjeev Sofat
No ratings yet
Techniques To Detect Spammers in Twitter-A Survey: Monika Verma Divya, Sanjeev Sofat
6 pages
Cao Duke 0066D 12508
No ratings yet
Cao Duke 0066D 12508
143 pages
Compa Ndss13
No ratings yet
Compa Ndss13
17 pages
VDIAZ - MT DetectingMaliciousProfilesTwitter
No ratings yet
VDIAZ - MT DetectingMaliciousProfilesTwitter
66 pages
A Reputation-Based Collaborative Approach For Spam Filtering
No ratings yet
A Reputation-Based Collaborative Approach For Spam Filtering
8 pages
CIPM Exam - Page 6 - ExamTopics
No ratings yet
CIPM Exam - Page 6 - ExamTopics
10 pages
Lab#1 - Email&Online Communication
No ratings yet
Lab#1 - Email&Online Communication
3 pages
Introduction To Cyber Security
No ratings yet
Introduction To Cyber Security
53 pages
Chapter 1
No ratings yet
Chapter 1
16 pages
Sample Payment Received Receipt Letter Templates: Wondershare Official Site
No ratings yet
Sample Payment Received Receipt Letter Templates: Wondershare Official Site
17 pages
Tle Ict 9 Las
No ratings yet
Tle Ict 9 Las
4 pages
Download
100% (1)
Download
9 pages
Solved - Change Body Mail in Dunning Letter in F150 - SAP Community
No ratings yet
Solved - Change Body Mail in Dunning Letter in F150 - SAP Community
7 pages
Unified Email - TCS NQT April 2023 - Infrastructure Readiness Check. Final Reminder PDF
No ratings yet
Unified Email - TCS NQT April 2023 - Infrastructure Readiness Check. Final Reminder PDF
2 pages
Amn Platform Study Guide
No ratings yet
Amn Platform Study Guide
18 pages
SonicDICOM Starter Guide Cloud
No ratings yet
SonicDICOM Starter Guide Cloud
67 pages
Demat Account Closure Form
No ratings yet
Demat Account Closure Form
1 page
FORM 4B TEAM INDIGENOUS and FOLK DANCE 1
No ratings yet
FORM 4B TEAM INDIGENOUS and FOLK DANCE 1
4 pages
IOT Assignment 1-1
No ratings yet
IOT Assignment 1-1
18 pages
Registration Form
No ratings yet
Registration Form
4 pages
Gmail - RFQ For Fuel Filter & Other Items
No ratings yet
Gmail - RFQ For Fuel Filter & Other Items
2 pages
Fresher's Application Form Filling Process
No ratings yet
Fresher's Application Form Filling Process
9 pages
How - To - Associate - My MOS Account To Tracker CSI - To - Access - The - SR - Lists
No ratings yet
How - To - Associate - My MOS Account To Tracker CSI - To - Access - The - SR - Lists
2 pages
Government of Canada (GC) API Store
No ratings yet
Government of Canada (GC) API Store
15 pages
TSC License Portal User Guide
No ratings yet
TSC License Portal User Guide
37 pages
Meeting Agenda Template
No ratings yet
Meeting Agenda Template
1 page
Configure Postfix Send Mail (Send-Only) Using An External SMTP Server
No ratings yet
Configure Postfix Send Mail (Send-Only) Using An External SMTP Server
10 pages
Lab 1
No ratings yet
Lab 1
4 pages
Flash Memory: Features & Uses
No ratings yet
Flash Memory: Features & Uses
2 pages
Presentation On CRM
No ratings yet
Presentation On CRM
12 pages
UnilabGuidelines For Product Donations
No ratings yet
UnilabGuidelines For Product Donations
1 page
Permutations and Combinatorics Problems
No ratings yet
Permutations and Combinatorics Problems
4 pages
AT&T Wireless Internet Guide
No ratings yet
AT&T Wireless Internet Guide
76 pages
FIN41360 Assignment 1 - Portfolio Choice & Performance Evaluation
No ratings yet
FIN41360 Assignment 1 - Portfolio Choice & Performance Evaluation
6 pages
The Urban Insert - Brief
No ratings yet
The Urban Insert - Brief
7 pages

Detectin NG Malic Cious Ur Rlsine E-Mail - An Imp Plementa Ation

Uploaded by

Detectin NG Malic Cious Ur Rlsine E-Mail - An Imp Plementa Ation

Uploaded by

Available online at www.sciencedirect.

T Web servves as better medium for a large num

S.No. Phishing Features No. of Appearance %

Fig 1 URL feature extraction

3.1 Lexical Features (F1)

3.2 Host Based Features

Feature Legitimate Phishing

3.3 Number of Sensitive Words in URL

3.3.1 Individual occurrences(F4)and Co-Occurencesof suspicious phishing keywords (F5)

where, P(A) = Probability of feature F in phishing and legitimate dataset.

P(B(Phishing)) = P(B’(Legitimate)) = 0.5

Fig 2 Phishing URL classifications

a) How many times does feature F(F1,F2,F3,F4,F5,F6) appear in phishing dataset?

4.Conclusion and Results

Technique Number TPR FPR Time Complexity

You might also like