2012
DOI: 10.1007/978-3-642-33167-1_47
|View full text |Cite
|
Sign up to set email alerts
|

Detecting Phishing Emails the Natural Language Way

Abstract: Phishing causes billions of dollars in damage every year and poses a serious threat to the Internet economy. Email is still the most commonly used medium to launch phishing attacks [1]. In this paper, we present a comprehensive natural language based scheme to detect phishing emails using features that are invariant and fundamentally characterize phishing. Our scheme utilizes all the information present in an email, namely, the header, the links and the text in the body. Although it is obvious that a phishing … Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
2
2
1

Citation Types

0
54
0

Year Published

2014
2014
2021
2021

Publication Types

Select...
6
2
1

Relationship

2
7

Authors

Journals

citations
Cited by 70 publications
(54 citation statements)
references
References 25 publications
0
54
0
Order By: Relevance
“…Other checks can also be carried out. For example, to look at differences between displayed and actual domain names [13] or carry out an NLP analysis of the actual email text [38]. If the email is delivered and the person clicks, post-click detection can also occur.…”
Section: Related Workmentioning
confidence: 99%
“…Other checks can also be carried out. For example, to look at differences between displayed and actual domain names [13] or carry out an NLP analysis of the actual email text [38]. If the email is delivered and the person clicks, post-click detection can also occur.…”
Section: Related Workmentioning
confidence: 99%
“…This work differs from theirs in several respects: we consider many other classifiers than they do and we analyze more lexical features, including the character distributions of the URLs. More on phishing email detection can be found in [24,23].…”
Section: Related Researchmentioning
confidence: 99%
“…Previous researchers [14,27] have considered URL classification based on link analysis only by proposing systems which use search engine based features as well as web site popularity and blacklists as major attributes for URL detection. However, with search engines rate-limiting queries and charging for them, methods based on Web search are becoming infeasible.…”
Section: Introductionmentioning
confidence: 99%
“…However, with search engines rate-limiting queries and charging for them, methods based on Web search are becoming infeasible. There have been some phishing URL detection attempts in the past using special symbols [20], domain information [27,29], or single-character distributions [26], but few have investigated character N-grams as features for phishing URL detection. Moreover, mostly batch machine learning algorithms have been used for phishing URL classification and analysis.…”
Section: Introductionmentioning
confidence: 99%