Designing Spam Model- Classification Analysis using Decision Trees

Rajput, Shweta; Arora, Amit

doi:10.5120/13145-0549

Cited by 7 publications

(5 citation statements)

References 7 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…We can see how the number of leaves and size decreases by the converging of the confidence factor to a much smaller number, but the accuracy almost remained the same. However, lowering the confidence factor means we have less confidence in our training data (Rajput & Arora, 2013); therefore, the confidence factor was set fixed at 0.001. In addition, as mentioned in (Patel & Upadhyay, 2012), by increasing the minNumObj the size of the tree and number of leaves decreases dramatically with a very small amount of compromise on the accuracy, which can be seen in Table 3.…”

Section: Resultsmentioning

confidence: 99%

“…The minimum number of objects prevents the making of a new branch until the nodes in the branch are equal or greater than the specified threshold. Thus, this is a pre-pruning strategy (Drazin & Montag, 2012;Han et al, 2011;Rajput & Arora, 2013;Witten et al, 2016). Besides the above three options, all the rest of the options were left as default.…”

Section: Developing the Decision Tree With J48mentioning

confidence: 99%

See 1 more Smart Citation

Ontology-based decision tree model for prediction in a manufacturing network

Khan

Saeidlou

Saadat

2019

Production & Manufacturing Research

View full text Add to dashboard Cite

This paper aims to create a predictive model, which will assist in the allocation of newly received orders in a manufacturing network. The manufacturing network, which is taken as a case study in this research, consists of more than 300 small manufacturing enterprises with a central company as the project managing integrator. The methodology presents the mapping of a PROSA (Product-Resource-Order-Staff Architecture) based ontology model on a decision tree, which was created with the Waikato Environment for Knowledge Analysis (WEKA) application. Furthermore, the methodology also demonstrates the formulation of the Semantic Web Rule Language (SWRL) rules from the WEKA decision tree with the help of MATLAB programming. The paper validated the result generated by the ontology model with the results of the decision tree model.

show abstract

Section: Resultsmentioning

confidence: 99%

Section: Developing the Decision Tree With J48mentioning

confidence: 99%

Ontology-based decision tree model for prediction in a manufacturing network

Khan

Saeidlou

Saadat

2019

Production & Manufacturing Research

View full text Add to dashboard Cite

show abstract

“…Cases of text processing techniques are stopword removal and tokenization. The common classification techniques for document analysis include Support Vector Machine (Elmurngi and Gherbi, 2017), Naive Bayes (Zhang and Li, 2007), Logistic Regression (Cheng and Hüllermeier, 2009), Decision Tree (Rajput and Arora, 2013).…”

Section: Related Workmentioning

confidence: 99%

Unfair Reviews Detection on Amazon Reviews using Sentiment Analysis with Supervised Learning Techniques

Elmurngi¹,

Gherbi²

2018

Journal of Computer Science

View full text Add to dashboard Cite

Reputation and trust are significantly important and play a pivotal role in enabling multiple parties to establish relationships that achieve mutual benefit especially in an E-Commerce (EC) environment. There are several factors negatively affecting the sight of customers and sellers in terms of reputation. For instance, lack of credibility in providing feedback reviews, by which users might create phantom feedback reviews to support their reputation. Thus, we will feel that these reviews and ratings are unfair. In this study, we have used Sentiment Analysis (SA) which is now the subject generating the most interest in the field of text analysis. One of the major challenges confronting SA today is how to detect unfair negative reviews, unfair neutral reviews and unfair positive reviews from opinion reviews. Sentiment classification techniques are used against a dataset of consumer reviews. Precisely, we provide comparison of four supervised machine learning algorithms: Naïve Bayes (NB), Decision Tree (DT-J48), Logistic Regression (LR) and Support Vector Machine (SVM) for sentiment classification using three datasets of reviews, including Clothing, Shoes and Jewelry reviews, Baby reviews as well as Pet Supplies reviews. In order to evaluate the performance of sentiment classification, this work has implemented accuracy, precision and recall as a performance measure. Our experiments' results show that the Logistic Regression (LR) algorithm is the best classifier with the highest accuracy as compared to the other three classifiers, not merely in text classification, but in unfair reviews detection as well.

show abstract

“…Lisong Pei, Jakob Schütte, Carlos Simon in 2007-10-07 explained that there are two basic complementary trends in intrusion detection knowledge based, In knowledge base the knowledge about the attacks are taken for the detection of attacks [17].…”

Section: Related Researchmentioning

confidence: 99%

“…It decision trees are build using learning samples that are actually data from history with certain pre assigned classes. The cost of the decision tree so obtained is pruned by cost complexity pruning and the splits of the decision tree are selected using gini index [17]. Here, data is bifurcated into two subsets in such a way that each subset contains more homogeneous records than the previous subsets.…”

Section: Classification and Regression Treementioning

confidence: 99%

Intrusion Detection Using Feature Selection and Machine Learning Algorithm with Misuse Detection

Sasan¹,

Sharma²

2016

IJCSIT

View full text Add to dashboard Cite

In order to avoid illegitimate use of any intruder, intrusion detection over the network is one of the critical issues. An intruder may enter any network or system or server by intruding malicious packets into the system in order to steal, sniff, manipulate or corrupt any useful and secret information, this process is referred to as intrusion whereas when packets are transmitted by intruder over the network for any purpose of intrusion is referred to as attack. With the expanding networking technology, millions of servers communicate with each other and this expansion is always in progress every day. Due to this fact, more and more intruders get attention; and so to overcome this need of smart intrusion detection model is a primary requirement. By analyzing the feature selection methods the identification of essential features of NSL-KDD data set is done, then by using selected features and machine learning approach and analyzing the basic features of networks over the data set a hybrid algorithm is made. Finally a model is produced over the algorithm containing the rules for the network features. A hybrid misuse intrusion detection model is made to find attacks on system to improve the intrusion detection. Based on prior features, intrusions on the system can be detected without any previous learning. This model contains the advantage of feature selection and machine learning techniques with misuse detection.

show abstract

Designing Spam Model- Classification Analysis using Decision Trees

Cited by 7 publications

References 7 publications

Ontology-based decision tree model for prediction in a manufacturing network

Ontology-based decision tree model for prediction in a manufacturing network

Unfair Reviews Detection on Amazon Reviews using Sentiment Analysis with Supervised Learning Techniques

Intrusion Detection Using Feature Selection and Machine Learning Algorithm with Misuse Detection

Contact Info

Product

Resources

About