2017
DOI: 10.1002/cpe.4181
|View full text |Cite
|
Sign up to set email alerts
|

Comprehensive analysis of network traffic data

Abstract: With the large volume of network traffic flow, it is necessary to preprocess raw data before classification to gain the accurate results speedily. Feature selection is an essential approach in preprocessing phase. The principal component analysis (PCA) is recognized as an effective and efficient method. In this paper, we classify network traffic flows by using the PCA technique together with 6 machine learning algorithms-Naive Bayes, decision tree, 1-nearest neighbor, random forest, support vector machine, and… Show more

Help me understand this report

Search citation statements

Order By: Relevance

Paper Sections

Select...
1
1
1
1

Citation Types

0
9
0

Year Published

2018
2018
2024
2024

Publication Types

Select...
6
2
1

Relationship

0
9

Authors

Journals

citations
Cited by 15 publications
(9 citation statements)
references
References 21 publications
0
9
0
Order By: Relevance
“…-Random Forest is a collection of decision trees applied to avoid the instability and risk of overtraining that can occur with a single tree. It consists in suppressing the decision nodes without reducing the overall precision of the tree [49]. Characterized by an adjustment of only two parameters which are the number of trees and the set of attributes to be chosen during the construction of each node, which simplifies the generation of decision forests [50] -Support vector machines are a classification method that transforms a linear problem into a higher dimensional space entity.…”
Section: Methodsmentioning
confidence: 99%
“…-Random Forest is a collection of decision trees applied to avoid the instability and risk of overtraining that can occur with a single tree. It consists in suppressing the decision nodes without reducing the overall precision of the tree [49]. Characterized by an adjustment of only two parameters which are the number of trees and the set of attributes to be chosen during the construction of each node, which simplifies the generation of decision forests [50] -Support vector machines are a classification method that transforms a linear problem into a higher dimensional space entity.…”
Section: Methodsmentioning
confidence: 99%
“…The work in [20] defined a method to classify network traffic flows by using principal component analysis (PCA) technique together with six ML algorithms. They paid attention to the pre-processing phase as they adopted the 20 features presented in [21], and used the feature of server responding duration in their classification experiment.…”
Section: Streamingmentioning
confidence: 99%
“…Miao et al [ 29 ] compared six machine learning algorithms for traffic classification: Naive Bayes, RF, SVM, H O, KNN, and DT. They used principal component analysis for feature extraction and analyzed its influence on the classification results.…”
Section: Related Workmentioning
confidence: 99%