2020
DOI: 10.3390/app10145011
|View full text |Cite
|
Sign up to set email alerts
|

A Discrete Hidden Markov Model for SMS Spam Detection

Abstract: Many machine learning methods have been applied for short messaging service (SMS) spam detection, including traditional methods such as naïve Bayes (NB), vector space model (VSM), and support vector machine (SVM), and novel methods such as long short-term memory (LSTM) and the convolutional neural network (CNN). These methods are based on the well-known bag of words (BoW) model, which assumes documents are unordered collection of words. This assumption overlooks an important piece of information, i.e., word or… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
4
1

Citation Types

0
23
0

Year Published

2020
2020
2024
2024

Publication Types

Select...
6
3
1

Relationship

0
10

Authors

Journals

citations
Cited by 53 publications
(23 citation statements)
references
References 53 publications
0
23
0
Order By: Relevance
“…Accuracy, precision, recall, and F-measure (F1) were applied as the evaluation indexes of the model in this paper. Accuracy is the score of sentiment Scientific Programming correctly predicted in all microblog comments [35], which is the percentage of examples that the classifier obtains from the total number of examples predicted by a given label. e precision is the fraction of relevant instances among all retrieved instances.…”
Section: Evaluation Indexmentioning
confidence: 99%
“…Accuracy, precision, recall, and F-measure (F1) were applied as the evaluation indexes of the model in this paper. Accuracy is the score of sentiment Scientific Programming correctly predicted in all microblog comments [35], which is the percentage of examples that the classifier obtains from the total number of examples predicted by a given label. e precision is the fraction of relevant instances among all retrieved instances.…”
Section: Evaluation Indexmentioning
confidence: 99%
“…Chen et al [8] designed a model with knowledge powered attention mechanisms for classifying short texts according to their semantics. A further example of short text analysis is spam identification in social media posts, emails, and short messaging services [9,10]. However, typical short texts are structured and segmented character strings such as "Jay and Jolin are born in Taiwan" in [8] while domain names are unstructured and unsegmented character strings such as incometaxindiaefiling.gov.in, teacherspayteachers.com, and adnet workperformance.com in the top one million sites provided by Alexa [11].…”
Section: Introductionmentioning
confidence: 99%
“…Other works employ traditional classifiers to such a task (Fernandes et al, 2015;Fattahi and Mejri, 2021;Xia and Chen, 2020), including diverse models like Support Vector Machines (SVM), Hidden Markov Models (HMM), Optimum-Path Forest (OPF), k-Nearest Neighbors (KNN), decision trees, and ensembling approaches. Gupta et al (2018) provide a comparative study using CNN and traditional machine learning architectures.…”
Section: Introductionmentioning
confidence: 99%