2015
DOI: 10.5120/21813-5191
|View full text |Cite
|
Sign up to set email alerts
|

On Determining the Most Effective Subset of Features for Detecting Phishing Websites

Abstract: Phishing websites are a form of mimicking the legitimate ones for the purpose of stealing user 's confidential information such as usernames, passwords and credit card information. Recently machine learning and data mining techniques have been a promising approach for detection of phishing websites by distinguishing between phishing and legitimate ones. The detection process in this approach is preceded by extracting various features from a website dataset to train the classifier to correctly identify phishing… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1

Citation Types

0
3
0

Year Published

2018
2018
2023
2023

Publication Types

Select...
4
2

Relationship

0
6

Authors

Journals

citations
Cited by 8 publications
(3 citation statements)
references
References 18 publications
0
3
0
Order By: Relevance
“…The breakdown of these features is as follows, the total number of features in each section has been added in the parentheses: Body (76), Header (68), URL (24). At the next level, the totals are: Semantic (30), Syntactic (79), and Pragmatic (34). One exemplar feature from each class is given below.…”
Section: Email Featuresmentioning
confidence: 99%
“…The breakdown of these features is as follows, the total number of features in each section has been added in the parentheses: Body (76), Header (68), URL (24). At the next level, the totals are: Semantic (30), Syntactic (79), and Pragmatic (34). One exemplar feature from each class is given below.…”
Section: Email Featuresmentioning
confidence: 99%
“…Random Forest (RF) has also been successful in distinguishing phishing attacks from normal websites, with Subasi et al [40] achieving an exceptional classification performance of 97.36% using the random forest classifier. Feature selection has been a focus in another study, with characteristics grouped to identify the most effective ones for accurate phishing attack detection [41].…”
Section: Related Workmentioning
confidence: 99%
“…In any Machine Learning based model, 'Feature Selection [106], [107] and Feature Engineering' is a really crucial task as it is used to derive new and novel features from the existing ones to better facilitate the subsequent learning and generalization steps if a Machine Learning based algorithm is deployed to build a model [108]. The performance of the built model can often drastically improve if an intelligent and intuitive Feature Selection and Engineering phase can be executed beforehand.…”
Section: ) Feature Selection and Engineeringmentioning
confidence: 99%